-
A project developed for Stranger Hacks hackathon
-
Tagline:
Old questions solution
* Enter url to old exam questions
* It will give you questions and corresponding answers
- Install Requirements:
pip install -r requirements.txt
- Install
tesseract-ocr
- Get OpenAI API Key from here and put it in file
.env
as:
# .env
OPENAI_API_KEY="<API-KEY>"
- Run
python main.py
* home: http://127.0.0.1:8001
* mannual questions: http://127.0.0.1:8001/mannual
* Download pdf from url given
* Convert pdf to images
* Extract text from images using `pytesseract`
* Extract questions from `pytesseract` output using `openai-api`
* Extract answers of questions output using `openai-api`
* Store Answers and put link to question_answers in home page
* openAI API Rate Limit: `RateLimitError: You exceeded your current quota, please check your plan and billing details.`
* `qa.json` format
* using random_id to allow multiple qa with same title
* todo: store to database
{
"random_id":
[
{
"url":<url>,
"title":<title>",
"qa": [
{
"question": "question1",
"answer" : "answer1"
},
{
"question": "question2",
"answer": "answer2"
},
},
{
"url":"<url2>",
},
],
"random_id2":
[
{
"url":"<url>",
"title":"<title>",
"qa": [
{
"question": "question1",
"answer" : "answer1"
},
{
"question": "question2",
"answer": "answer2"
},
},
{
"url":"<url2>",
},
],
],
}
- use database instead of json file for storing data
- option to upload images and pdfs files
- save as soon as each question is extracted or answered
- buy openai api key to remove rate limit