Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature Request: Support for Google OCR Integration in Docling #661

Open
BushrHaddad opened this issue Dec 30, 2024 · 1 comment · May be fixed by #662
Open

Feature Request: Support for Google OCR Integration in Docling #661

BushrHaddad opened this issue Dec 30, 2024 · 1 comment · May be fixed by #662
Labels
enhancement New feature or request

Comments

@BushrHaddad
Copy link

Feature Request: Add Support for Google OCR

Description
We would like to propose adding a new OCR option, Google OCR, to Docling. This feature will enhance the tool's capabilities by allowing text extraction from dense image files and providing support for a broader range of languages, including those not currently supported by the existing OCR options.

User Need
This feature is aimed at addressing the following user needs:

  • Accurate Text Extraction from Dense Images: Google OCR excels at extracting text from complex image layouts, such as scanned documents and images with a high density of text.
  • Broader Language Support:Google OCR supports many languages, including those not currently handled by the existing OCR options in Docling. This will expand the tool's applicability to a more diverse set of users and use cases.

Requested Feature

Integration of Google OCR:

  • Leverage the Google Vision API for text extraction.
  • Allow users to configure their Google credentials and OCR preferences through the existing options system.
  • Add support for specifying languages using language_hints provided by the Google OCR API.
@BushrHaddad BushrHaddad added the enhancement New feature or request label Dec 30, 2024
@BushrHaddad BushrHaddad linked a pull request Dec 30, 2024 that will close this issue
3 tasks
@wjkoh
Copy link

wjkoh commented Jan 5, 2025

Awesome! I hope the PR will be reviewed soon.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants