YOLOv10 Document Segmentation Web-app

Overview

A streamlit app for Document segmentation into different sections. The web-app also performs OCR on the text-based annotated sections, and image-analysis on the image sections in each page of the document. OCR is performed on the text using the tesseract-ocr package. For image analysis, we have used llama-3.2-11b-vision model.

This app uses the YOLOv10x model for document segmentation to annotate various sections of a document such as text-fields, formulae, pictures, list-items,etc. The model uses pretrained weights which may be dowloaded using this colab notebook.

Link for deployed web-application using streamlit.

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
thumbnails		thumbnails
Doc_analysis.ipynb		Doc_analysis.ipynb
LICENSE		LICENSE
README.md		README.md
app.py		app.py
packages.txt		packages.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

YOLOv10 Document Segmentation Web-app

Overview

Website Interface and Results

Annotations on document

Sample Document for text extraction and image analysis

Text extracted and image analysis results being displayed on the website

About

Releases

Packages

Languages

License

Cpt-Shaan/YOLO_DocSegmentation

Folders and files

Latest commit

History

Repository files navigation

YOLOv10 Document Segmentation Web-app

Overview

Website Interface and Results

Annotations on document

Sample Document for text extraction and image analysis

Text extracted and image analysis results being displayed on the website

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages