Python version is now deprecated, please use C++ version from Release
Added 7 missing DLL files.
New release now added support for RTX 3000 series cards
Created by PatchyVideo
Based on CRAFT scene text detector
Translation is done using Google Translate with python package googletrans
You need an NVIDIA GPU no older than GTX 700 series
- Download
textdet_300k.pth
andocr_640k.pth
from release and put them into the project folder - Install NVIDIA Driver
- Install CUDA 10.2
- Install Python3 environment
- Install PyTorch 1.5.1 from offical website
- Run
pip3 install -r requirements.txt
- Run
python main.py --out NUL --verbose --skip_frame 1 --video /path/to/video/file
and have fun
You can specifiy an output srt subtitle file of recognized texts using --out /path/to/out/srt
parameter
You can specifiy an output srt subtitle file of translated texts using --out_translated /path/to/out/srt
parameter
No windows will be shown if --verbose
parameter is not given
--skip_frame 1
controls how many frames to skip in-between two frames used for recognition, higher the value faster the program will run, but the resulting srt file will not be as accurate as setting this value to 0
Use --language
to specifiy which language you want to translate to, list of language codes can be found here
- The current text detector is not very accurate, new model is being trained as you are reading this
- Support Korean
- Build multithreaded C++ version to fully utilized both CPU and GPU
- Integrate into PatchyVideo so people can use it online to enjoy Touhou videos made by people speaking another language