Feature Request: Support for Google OCR Integration in Docling #661

BushrHaddad · 2024-12-30T10:43:25Z

Feature Request: Add Support for Google OCR

Description
We would like to propose adding a new OCR option, Google OCR, to Docling. This feature will enhance the tool's capabilities by allowing text extraction from dense image files and providing support for a broader range of languages, including those not currently supported by the existing OCR options.

User Need
This feature is aimed at addressing the following user needs:

Accurate Text Extraction from Dense Images: Google OCR excels at extracting text from complex image layouts, such as scanned documents and images with a high density of text.
Broader Language Support:Google OCR supports many languages, including those not currently handled by the existing OCR options in Docling. This will expand the tool's applicability to a more diverse set of users and use cases.

Requested Feature

Integration of Google OCR:

Leverage the Google Vision API for text extraction.
Allow users to configure their Google credentials and OCR preferences through the existing options system.
Add support for specifying languages using language_hints provided by the Google OCR API.

wjkoh · 2025-01-05T14:58:17Z

Awesome! I hope the PR will be reviewed soon.

BushrHaddad added the enhancement New feature or request label Dec 30, 2024

BushrHaddad linked a pull request Dec 30, 2024 that will close this issue

feat: add support for google ocr #662

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: Support for Google OCR Integration in Docling #661

Feature Request: Support for Google OCR Integration in Docling #661

BushrHaddad commented Dec 30, 2024

wjkoh commented Jan 5, 2025

Feature Request: Support for Google OCR Integration in Docling #661

Feature Request: Support for Google OCR Integration in Docling #661

Comments

BushrHaddad commented Dec 30, 2024

Feature Request: Add Support for Google OCR

Requested Feature

wjkoh commented Jan 5, 2025