Deep Discriminant Neural Network (MNIST dataset)

Tools & Tech:

Python
Keras
TensorFlow

Building and training a deep neural network for classifying the MNIST dataset.

The MNIST dataset consists of 60,000 28x28 pixel images of handwritten digits.

The set of images in the MNIST database is a combination of two of NIST's databases: Special Database 1 and Special Database 3. Special Database 1 and Special Database 3 consist of digits written by high school students and employees of the United States Census Bureau.

Architecture Approach

The chosen implemented archictecture is a variation of the LeNet-5 architecture.

Original LeNet

LeNet5 contains the basic modules of deep learning:

Convolution layer
Pooling layer
Full link layer

LeNet5 is comprised of 7 layers:

Layer	Layer Type	Activation
Input	Image	-
1	Convolution	tanh
2	Average Pooling	tanh
3	Convolution	tanh
4	Average Pooling	tanh
5	Convolution	tanh
6	FC	tanh
Output	FC	softmax

### Variations on LeNet5

- ReLU-softmax

ReLU-softmax was used inplace of tanh-softmax.

After benchmarking:
- ReLU-softmax
- sigmoid sigmoid
- tanh-softmax

ReLU-softmax returned the best performance on training and test data, and was therefore implemented

- Max Pooling inplace of Average Pooling

Instead of using average pooling, max pooling was implemented in order to reduce computation cost. The background of the MNIST dataset is black, max pooling performs better than average pooling for darker backgrounds.

- Batch Normalisation

Batch Normalisation was used between the layers of the network in an effort to speed up training and use higher learning rates.

- Dropout

In an effort to reduce interdependent learning amongst nuerons and minimize overfitting, Dropout was used within the network.

- Additional dense layer

A additional dense layer, with activation function ReLU and output size of 256 was used.

The resulting architecture:

Layer	Layer Type	Activation
Input	Image	-
1	Convolution	relu
2	Convolution	relu
3	BatchNormalization	relu
4	Max Pooling	-
5	Dropout	-
6	Convolution	relu
7	Convolution	relu
8	BatchNormalization	relu
9	Max Pooling	-
10	Dropout	-
11	FC	relu
12	BatchNormalization	relu
13	FC	relu
14	BatchNormalization	relu
15	FC	relu
16	BatchNormalization	relu
17	Dropout	-
Output	FC	softmax

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitignore		.gitignore
LeNet_30_64.h5		LeNet_30_64.h5
LeNet_30_64_V2.h5		LeNet_30_64_V2.h5
LeNet_variation.py		LeNet_variation.py
Testing_Output.png		Testing_Output.png
Testing_Output_2.png		Testing_Output_2.png
Training.png		Training.png
network_structure.png		network_structure.png
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Discriminant Neural Network (MNIST dataset)

Tools & Tech:

Building and training a deep neural network for classifying the MNIST dataset.

Architecture Approach

Original LeNet

- ReLU-softmax

- Max Pooling inplace of Average Pooling

- Batch Normalisation

- Dropout

- Additional dense layer

About

Releases

Packages

Languages

Enantiodromis/MINST_DDNN

Folders and files

Latest commit

History

Repository files navigation

Deep Discriminant Neural Network (MNIST dataset)

Tools & Tech:

Building and training a deep neural network for classifying the MNIST dataset.

Architecture Approach

Original LeNet

- ReLU-softmax

- Max Pooling inplace of Average Pooling

- Batch Normalisation

- Dropout

- Additional dense layer

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages