Optical Character Recognition

Machine learning OCR with Convolutional Recurrent Neural Network (CRNN). The input shape is (32, 400), so if your dataset has different shapes, you need to resize the inputs or modify the architecture.

How does it work

generate dataset

python3 generate_dateset.py

Train the model

There're many ways to train the model, you can just run python3 crnn-ctc.py on your machine, but it probably has poor performance if you do not have a high-performance GPU (or TPU) and CUDA support. Or you can train your model in the cloud (e.g. AWS), but it's absolutely not free. The way I recommend is Colab, it provides you great GPUs (and TPUs) and is totally free.

You can take a look at ocr.ipynb.

Load pre-trained model

Download best_model.hdf5 and put it into the current directory.

A pre-trained model here: best_model.hdf5

Make a prediction

python3 generate_dataset.py 1 | xargs python3 predict.py

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
ocr		ocr
.gitignore		.gitignore
Arial.ttf		Arial.ttf
Microsoft Sans Serif.ttf		Microsoft Sans Serif.ttf
README.md		README.md
Roboto-Regular.ttf		Roboto-Regular.ttf
character-segment-cnn.py		character-segment-cnn.py
crnn-ctc.py		crnn-ctc.py
generate_dataset.py		generate_dataset.py
ocr.ipynb		ocr.ipynb
predict.py		predict.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Optical Character Recognition

How does it work

generate dataset

Train the model

Load pre-trained model

Make a prediction

About

Releases

Packages

Languages

FX-HAO/optical-character-recognition

Folders and files

Latest commit

History

Repository files navigation

Optical Character Recognition

How does it work

generate dataset

Train the model

Load pre-trained model

Make a prediction

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages