This notebook can search on large and unsearchable (scanned) PDF file to find your keyword.
Supported languages: English and Persian
This notebook has been developed on Google Colab on 08 Dec 2024.
!apt-get install -y tesseract-ocr
!apt-get install -y tesseract-ocr-fas
!pip install pytesseract pdf2image Pillow
!apt-get install -y poppler-utils