voter-id-text-extraction

An implementation to extract info from VoterID image and automatically fetching details from electorial website.
Electoral website : https://electoralsearch.in/##resultArea

Getting Started

Run "TextExtractVoterId.py" to extract information from the Voters ID photo.
Run "TextProcessing.py" to extract Voter ID information from textfile and obtain json file.
You will obtain "TextExtract.txt" and "Result.json" from running above two programs.
Before running the below file, edit the path of tesseract and chromedriver according to your system.
Run the "ScrapeVoterDetails.py" to scrape the data from website automatically.
If you receive an error - "TesseractNotFoundError: tesseract is not installed or it's not in your path"

1) Download tesseract and install it. Windows version is available here: "https://github.com/UB-Mannheim/tesseract/wiki"
2) Copy the path of the tesseract install and paste it line of code exact as below.
pytesseract.pytesseract.tesseract_cmd = r"C:\Program Files\Tesseract-OCR\tesseract.exe"

Installation

Use the package manager pip to install required libraries.

pip install numpy
pip install Pillow
pip install selenium
pip install pytesseract
pip install beautifulsoup4
pip install opencv-python

Environment

Python 3.6

Captcha Solver

The captcha is solved using Pytesseract.

Contributing

Please open an issue if you have any trouble or to discuss what you would like to change.

Authors

Ritesh Rajput

contact-info

Feel free to contact me to discuss any issues, questions, or comments.

Email: [email protected]
GitHub: Ritesh Rajput
LinkedIn: Ritesh Rajput

License

This project is licensed under the MIT License - see the LICENSE.md file for details

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
src		src
.gitignore		.gitignore
FinalText.txt		FinalText.txt
LICENSE		LICENSE
README.md		README.md
README_gif.gif		README_gif.gif
RemoveNoise.jpg		RemoveNoise.jpg
Result.json		Result.json
Sample.jpg		Sample.jpg
SampleProcess.jpg		SampleProcess.jpg
Screenshot.png		Screenshot.png
TextExtract.txt		TextExtract.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

voter-id-text-extraction

Getting Started

Installation

Environment

Captcha Solver

Contributing

Authors

contact-info

License

About

Uh oh!

Releases

Packages

Languages

License

riteshrajput/voter-id-text-extraction-ocr-pytesseract

Folders and files

Latest commit

History

Repository files navigation

voter-id-text-extraction

Getting Started

Installation

Environment

Captcha Solver

Contributing

Authors

contact-info

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages