DecryptZero

Introduction

DecryptZero is a fully open-source OCR (Optical Character Recognition) model for chess tournament notation sheets. DecryptZero runs on a model trained specifically to read chess players' handwriting (which can get very messy at times!) through the open-source deep-text-recognition-benchmark repository (the decrypt-trainer directory in this repo).

Current State

The engine has completed training. It runs with superb accuracy on the training and testing sets but is not yet compatible with the algorithms to read full notation sheets. Image preprocessing algorithms are complete, but they may be overhauled in the future for more efficient and accurate results.

What's Next?

Integrate the engine with ocr-easy.py to test on full notation-sheets rather than one-word samples.

Usage

Prerequisites

If you have an NVIDIA GPU that is CUDA capable, you can install CUDA by following the instructions for your operating system and distribution here. The programs will run much faster on GPU but can still run perfectly fine on CPU. You can check if your NVIDIA GPU is CUDA capable here.

Dependencies

The three main libraries needed to run the programs are OpenCV, EasyOCR, and boxdetect. OpenCV is a library used for processing images before an OCR model reads them so the results are more accurate. EasyOCR is the model itself, and boxdetect is used to find and create bounding boxes around the text, which creates regions of interest (ROIs) that will be read by the OCR engine.

Main

You can install OpenCV with apt or as a pip package (recommended):

sudo apt-get install python3-opencv

pip install opencv-python

You can install EasyOCR and boxdetect as pip packages:

pip install easyocr

pip install boxdetect

Other

Since the program is run in the command line with custom arguments, you will need to install argparse (if it is not already installed):

pip install argparse

You will need to install imutils, which is used for resizing the output to be visualized properly:

pip install imutils

You will also need to install numpy for proper conversion of data types and matplotlib to visualize results:

pip install numpy matplotlib

Preparing the Model

The model files are quite large, so you will need to install Git Large File Storage (Git LFS) to handle them:

sudo apt-get install git-lfs
git lfs install

Once you have git LFS set up, clone the repository:

git clone https://codeberg.org/KPLinux/DecryptZero.

Now you must move the engine files to the /.EasyOCR/ directory. However, sometimes this directory isn't created until you install the default English model.

To create the directory, switch to the DecryptZero/ directory and run easy-test2.py:

cd /path/to/DecryptZero/

python3 easy-test2.py

This command will create the directory and install the default English model.

Now make sure you are in the decrypt-engine directory:

cd /path/to/DecryptZero/decrypt-engine/

Then, move the decryptzero.pth file to /.EasyOCR/model/:

mv decryptzero.pth /path/to/.EasyOCR/model/

and the decryptzero.yaml and decryptzero.py files to /.EasyOCR/user_network/:

mv decryptzero.yaml decryptzero.py /path/to/.EasyOCR/user_network/

Running the Program

The main file is ocr-easy.py.

Currently, the only way to run the program is through the command line interface. A graphical user interface may come in the future.

To use the OCR (through the CLI), you will need to pass 3 arguments:

--image - the path to the image you would like to OCR
--align-template - the path to the template that the image will be aligned with during preprocessing
--box-template - the path to the template that will create the bounding boxes where OCR will be performed

Make sure you are in the DecryptZero/ directory:

cd /path/to/DecryptZero/

Paste this line to run on the sample image & templates:

python3 ocr-easy.py --image sample/image.png --align-template sample/align-template.png --box-template sample/box-template.png

The output will be the input image and a carbon copy of that image with bounding boxes and the detected text in each bounding box (still WIP). In the command line you will see the model generating what it thinks is the text within each bounding box.

There are currently two template options because the alignment algorithm looks for key points on the image to be mapped to the template (which must be a carbon copy of the image) and the boxing algorithm looks for grid lines on the template (which requires a preprocessed image that has the gridlines enhanced to be darker). Because the nature of these templates are fundamentally different, the arguments cannot (yet) be combined into a single --template argument.

Potential Improvements

It would be more efficient if an alignment template/algorithm wasn't necessary to produce accurate results. In the future, an attempt will be made to create a general program that deals with all the intricacies of the sheet in one single scan.

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
decrypt-engine		decrypt-engine
decrypt-prgm-files		decrypt-prgm-files
decrypt-tex-files		decrypt-tex-files
sample		sample
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DecryptZero

Introduction

Current State

What's Next?

Usage

Prerequisites

Dependencies

Main

Other

Preparing the Model

Running the Program

Potential Improvements

About

Releases

Packages

Contributors 2

Languages

License

KPLinux/decryptzero

Folders and files

Latest commit

History

Repository files navigation

DecryptZero

Introduction

Current State

What's Next?

Usage

Prerequisites

Dependencies

Main

Other

Preparing the Model

Running the Program

Potential Improvements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages