CNN_LSTM_CTC_Tensorflow

The images are first processed by a CNN to extract features, then these extracted features are fed into a LSTM for character recognition.

CNN+LSTM+CTC based OCR(Optical Character Recognition) implemented using tensorflow.

I trained a model with 80k images using this code and got 99.98% accuracy on test dataset (20k images). The images in both dataset:

Overview

This project is based on the great work from here

Below improvements are made:

correct the time step direction
Previously the time step direction is channel, which is incorrect. Now it has been corrected to the width direction. see here for more discussion on this issue.
optimize training scripts
Previously all training images are loaded into memroy, now a simple image generator is used to generate training batch.
metrics implementation implement the character and word accuracy in tensorflow.

Dataset

please see this issue about dataset， the lable file (a .txt file) is in the same folder with images after extracting .tar.gz file.

Prerequisite

TensorFlow 1.4
Numpy

Train the model.

python ./train_model.py

Inference

python ./eval_model.py

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
data		data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
cnn_lstm_otc_ocr.py		cnn_lstm_otc_ocr.py
eval_model.py		eval_model.py
helper.py		helper.py
main.py		main.py
ocr_datasets.py		ocr_datasets.py
ocr_mtrics.py		ocr_mtrics.py
preparedata.py		preparedata.py
run_all_checkpoints.py		run_all_checkpoints.py
train_model.py		train_model.py
traininglog.txt		traininglog.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CNN_LSTM_CTC_Tensorflow

Overview

Dataset

Prerequisite

Train the model.

Inference

About

Releases

Packages

Languages

License

LevinJ/CNN_LSTM_CTC_Tensorflow

Folders and files

Latest commit

History

Repository files navigation

CNN_LSTM_CTC_Tensorflow

Overview

Dataset

Prerequisite

Train the model.

Inference

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages