Gender Prediction Using Different CNN ARCHITECTURES And Their Comparison

Motivation & Abstract

Gender prediction has been in increasing demand in the Computer Vision sphere. The need for the ability to predict the gender of a person has many uses. As the world gets more digital by the day, it is absolutely viable to make use of technology like AI to tackle such problems. Further, it can be used by the biometric system at airports/railways/government buildings to verify people. The uses are limited by creativity and imagination.

In the AI community there are no certain guidelines regarding how hyperparameters affect a model’s performance. We have tried our best to shed some light on this issue. Also our motivation was to analyze why some state of the art architectures perform the way they do.

Proposed Methodology

We aim to implement VGG-16, InceptionV3 CNN models on the “UTKFace” dataset, draw results and analyze them to conclude possible reasons for the results obtained.

Dataset Description

UTKFace dataset is a large-scale face dataset with a long age span (range from 0 to 116 years old). The dataset consists of over 20,000 face images with annotations of age, gender, and ethnicity.

Algorithm/Description of the Work

VGG-16

We used the standard VGG-16 architecture pre-trained on the ImageNet dataset for our gender classification task. Original images which were 200x200 pixels were pre-processed into size 224x224 to be fed into the model.

All the fully connected layers and the convolutional layers were freezed, i.e- their ‘weights’ and ‘bias’ were not updated by the optimizer ‘adam’.

InceptionV3

We used the standard InceptionV3 architecture pre-trained on the ImageNet dataset for our gender classification task. Original images which were 200x200 pixels were pre-processed into size 299x299 to be fed into the model.

All the fully connected layers and the convolutional layers were freezed, i.e- their ‘weights’ and ‘bias’ were not updated by the optimizer ‘RMS-prop’.

Results

	Train		Vaidation		Test
	Accuracy	Loss	Accuracy	Loss	Accuracy	Loss
VGG-16	88.9	27.13	83.86	40.11	80.38	48.19
InceptionV3	89.24	26.35	85.40	33.27	82.93	40.51

Conclusion

For our gender classification task on the UTKFace dataset of 20,000 images, we prepared the dataset according to ‘male’ and ‘female’ classes. The entire implementation was done on google collaboratory which provided the high-end hardware specification of the google servers. Tensorflow and keras were primary libraries used in the code.

We then used two CNN architectures pre-trained on the ImageNet Dataset namely, VGG-16 and InceptionV3. Both the models with 10 epochs but both got preempted as their accuracy didn’t improve further. The results were drawn and it was found that InceptionV3 performs better. We analyzed the various factors due to which this could be so.

Brains behind this project:

Utkarsh Aggarwal 😄

Anvit Agarwal 👽

Shikhar Shrivastava 😄

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
86744.jpg		86744.jpg
Dataset_Seperation.py		Dataset_Seperation.py
Inception.ipynb		Inception.ipynb
README.md		README.md
References		References
Vgg_16.ipynb		Vgg_16.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gender Prediction Using Different CNN ARCHITECTURES And Their Comparison

Motivation & Abstract

Proposed Methodology

Dataset Description

Algorithm/Description of the Work

VGG-16

InceptionV3

Results

Conclusion

Brains behind this project:

About

Releases

Packages

Languages

anvit1618/Gender_classification_VGG-16

Folders and files

Latest commit

History

Repository files navigation

Gender Prediction Using Different CNN ARCHITECTURES And Their Comparison

Motivation & Abstract

Proposed Methodology

Dataset Description

Algorithm/Description of the Work

VGG-16

InceptionV3

Results

Conclusion

Brains behind this project:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages