Screenshot Text Extractor & Conversational AI Assistant

Overview

This Python-based application combines Optical Character Recognition (OCR) with a conversational AI assistant, designed to enhance productivity and streamline workflows. The program automatically detects screenshots, extracts text, and interacts with the user by explaining the text and generating relevant questions. Additionally, users can engage in dynamic chat-based conversations with the AI. The program runs efficiently in the background, with a user-friendly interface for seamless interaction.

Features

Automatic Text Extraction: Captures and extracts text from screenshots in real-time using Tesseract OCR.
AI-Powered Explanations: The extracted text is automatically sent to an AI model (via Ollama API), which provides detailed explanations.
Interactive Chat Interface: Users can engage with the AI through a chat interface that supports conversational inputs and generates intelligent responses.
System Tray Integration: The app minimizes to the system tray, allowing it to run unobtrusively in the background.

Key Components

Tesseract OCR: Integrated with Tesseract for accurate text extraction from images or screenshots.
Ollama API: Utilizes the Mistral model to handle dynamic, context-aware conversations, generating responses based on user input and extracted text.
Tkinter & CustomTkinter: Provides a smooth, modern graphical user interface (GUI), offering a polished user experience.
Pystray Integration: Allows the program to minimize to the system tray, making it easily accessible without cluttering the desktop.
Multithreaded Processing: Runs text extraction and AI response generation in the background without interrupting the user experience.

How It Works

Screenshot Detection: The program monitors the clipboard for screenshots.
Text Extraction: Once a screenshot is detected, the program extracts the text using Tesseract OCR.
AI Interaction: The extracted text is automatically sent to the AI, which provides explanations and optionally generates relevant questions.
User Chat: Users can also manually chat with the AI via a clean and intuitive chat interface, allowing for real-time interaction.
Tray Integration: The app runs in the background and can be accessed from the system tray at any time.

Installation & Setup

Install the required dependencies:

pip install pytesseract Pillow pystray customtkinter ollama

Ensure Tesseract-OCR is installed on your system. Set the correct path to the Tesseract executable in the script:
```
pytesseract.pytesseract.tesseract_cmd = r'C:\Program Files\Tesseract-OCR\tesseract.exe'
```
Run the application:
```
python ArtificialTutor.py
```

How to use ?

Steps to Use the Application

Launch the Program: Run the script or double-click the executable file.
Minimize to Tray: Close or minimize the window; the program will continue running in the system tray.
Activate Snipping Tool: Press Shift + Win + S, the screenshots are stored on clipboard
Clipboard Setup: Ensure clipboard history is enabled (Win + V) for automatic text extraction.
Capture Text: Drag over the text you want the AI to process; .
Automatic Text Processing: The program will extract the text and display it, sending it to the AI for an explanation.
AI Response: The AI will provide explanations and generate questions based on the extracted text.
System Tray Access: Click the tray icon to restore the program when needed.

Future Enhancements

Non-Text Image Recognition: Expand the AI’s capabilities to "See" beyond text.
Additional Language Support: Extend the OCR and AI capabilities to handle multiple languages.
Mathematical OCR*: Being able to understand Mathematical expressions and equations and the ability to display it on screen.
Handwritting Recognition

License

This project is licensed under the MIT License, with additional conditions.

You are free to:

Use the software for personal or educational purposes.
Modify and adapt the code for personal projects.

However, you may not:

Redistribute this software, either modified or unmodified, without explicit written permission from the original author.
Use this software, or any derivative works, for commercial purposes without explicit written permission from the original author.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
icons		icons
ArtificialTutor.py		ArtificialTutor.py
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Screenshot Text Extractor & Conversational AI Assistant

Overview

Features

Key Components

How It Works

Installation & Setup

How to use ?

Steps to Use the Application

Future Enhancements

License

About

Releases 1

Packages

Languages

License

Rii-San/Screenshot-Text-Extractor-Conversational-AI-Assistant

Folders and files

Latest commit

History

Repository files navigation

Screenshot Text Extractor & Conversational AI Assistant

Overview

Features

Key Components

How It Works

Installation & Setup

How to use ?

Steps to Use the Application

Future Enhancements

License

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages