Skip to content

Latest commit

 

History

History
22 lines (13 loc) · 1.12 KB

index.md

File metadata and controls

22 lines (13 loc) · 1.12 KB

daiR: OCR with Google Document AI in R

CRAN status

daiR is an R package for Google Document AI, a powerful server-based OCR processor. The package provides a wrapper for the Document AI API and comes with additional tools for output file parsing and text reconstruction.

Requirements

Google Document AI is a paid service that requires a Google Cloud account and a Google Storage bucket. I recommend using Mark Edmondson's googleCloudStorageR package in combination with daiR. See vignettes for more on authentication and setup.

Installation

daiR is not yet on CRAN, but you can install the latest development version from Github:

devtools::install_github("hegghammer/daiR")