Skip to content

Cpt-Shaan/YOLO_DocSegmentation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

52 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

YOLOv10 Document Segmentation Web-app

Overview

A streamlit app for Document segmentation into different sections. The web-app also performs OCR on the text-based annotated sections, and image-analysis on the image sections in each page of the document. OCR is performed on the text using the tesseract-ocr package. For image analysis, we have used llama-3.2-11b-vision model.

This app uses the YOLOv10x model for document segmentation to annotate various sections of a document such as text-fields, formulae, pictures, list-items,etc. The model uses pretrained weights which may be dowloaded using this colab notebook.

Link for deployed web-application using streamlit.

Website Interface and Results

Annotations on document

Sample Document for text extraction and image analysis

Text extracted and image analysis results being displayed on the website

About

Document segmentation into different sections

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published