MNIST Digit Recognition using Keras - Hello World in Neural Networks

This repository contains code in python to recognize handwritten digits using a neural network. The data is available at the MNIST website. The neural network was built using Keras.

Environment Setup

This project uses Python 3+ and the following packages - Keras, TensorFlow, numpy, matplotlib and various other dependencies. In case you do not have the environment please follow these instructions -

Install Docker (https://docs.docker.com/engine/installation/)
Run bash command 'docker pull romeo14/neuralnet-toolbox'
Run command 'docker run -it romeo14/neuralnet-toolbox'
Run command 'conda env list' from inside the container

If step 4 displays nn-tbx-cpu then you are done.

Data Preparation

Before training the neural network, the MNIST data is loaded and prepared. This data can be found in the MNIST website. The data is split into training, validation and testing dataset. Each image is a numpy array of (784,1) dimension while its label is a one hot encoded data of the 10 digits. For example the label of the digit 1 would be [0 1 0 0 0 0 0 0 0 0]. Also the data is uniformly spread over all the digits and already in grayscale. Normalization or augmentation techniques were not used. Code for exploring the data can be found in file [explore.py]

Neural Network

The neural newtork is a simple network with 3 layers. The first layer is the input layer with 784 neurons. The second layer consists of 15 hidden neurons. The output layer consists of 10 neurons and is activated by the softmax function. The neural network is built using Keras. The code for this can be found in train.py. Here is a summary of the neural network -

Training

The network was trained using a SGD with a learning rate of 0.01 and batch size of 128 until a validation accuracy of 92% was achieved. The code for this can be found in train.py and looks like this -

train.py

if __name__ == '__main__':

    # load the data
    X_train, y_train, X_test, y_test, X_val, y_val = load_data()

    # get the model and print summary
    model = get_model()
    model.summary()

    # train the model
    model.compile(loss='categorical_crossentropy', metrics=['accuracy'], optimizer=SGD(lr=0.01))
    history = model.fit(
        X_train, y_train,
        validation_data=(X_val, y_val),
        batch_size=128,
        nb_epoch=30,
        verbose=2
    )

    # save the model to the filesystem
    save_model(model)

    # evaluate the model on the test data and print metrics
    metrics = model.evaluate(X_test, y_test, batch_size=128, verbose=2)
    print("Evaluated model on validation data")
    for metric_i in range(len(model.metrics_names)):
        metric_name = model.metrics_names[metric_i]
        metric_value = metrics[metric_i],
        print('{} {}'.format(metric_name, metric_value))

Prediction

Finally the trained model was used to predict some values from the test dataset. The data selected looked like this -

Dataset	Values
Original	[9, 5, 8, 0, 0, 8, 6, 6, 6, 8, 3, 5]
Predicted	[9, 4, 8, 0, 0, 8, 6, 6, 6, 8, 3, 5]

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
out_images		out_images
README.md		README.md
explore.py		explore.py
predict.py		predict.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MNIST Digit Recognition using Keras - Hello World in Neural Networks

Environment Setup

Data Preparation

Neural Network

Training

Prediction

Further Resources

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MNIST Digit Recognition using Keras - Hello World in Neural Networks

Environment Setup

Data Preparation

Neural Network

Training

Prediction

Further Resources

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages