Skip to content

DARPA-ASKEM/document_intelligence

Repository files navigation

Scientific Paper OCR Extraction

Build and Publish

Overview

This repository provides a service for Optical Character Recognition (OCR) extraction of tables, equations, and general content from scientific papers. It is designed to process PDFs and extract key scientific components, making it useful for tasks such as data analysis, research, and document automation in academic or industrial contexts.

Contributing

Contributions are welcome! If you want to contribute:

  1. Fork the repository.
  2. Create a feature branch (git checkout -b feature/your-feature).
  3. Commit your changes (git commit -m 'Add new feature').
  4. Push to the branch (git push origin feature/your-feature).
  5. Open a pull request.

License

This project is licensed under the Apache License 2.0.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published