This Python project that allows you to chat with a chatbot about the PDF you uploaded. and generate a PDF transcript of the conversation. The project is built using Python and Streamlit framework.
This is a fork from https://github.com/sudan94/chat-pdf-hugginface. I had issues install the requirements.txt, thus I removed the '==version' in requirements.txt with
sed -i 's/\=.*//g' requirements.txt
Then, I added some modules that are needed for app.py that were not in requirements.txt.
A conda
environment is used instead of venv
virtual environment
To run this project, please follow the steps below:
- Clone the repository:
git clone [email protected]:MattLondon101/chat-pdf-hugginface.git
cd chat-pdf-hugginface
- Create and activate a conda virtual environment (optional but recommended):
conda create -n env1 python=3.10
conda activate env1
- Install the dependencies from the
requirements.txt
file:
pip install -r requirements.txt
- You will need a HUGGINGFACEHUB_API_TOKEN for this next step. To obtain one for free, got to https://huggingface.co/ and Sign Up for a free account. Then, go to Settings > Access Tokens. Create a New token. Then, create a file in this directory, name is
.env
and enterHUGGINGFACEHUB_API_TOKEN = "token"
, replacingtoken
with your User Access Token. Save the.env
file. The.gitignore
file will ignore the.env
for git operation.
Once you have installed the required dependencies, you can run the project using Streamlit, which should have been installed with requirements.txt
. Streamlit provides an easy way to create interactive web applications in Python.
To start the application, run the following command:
streamlit run app.py
This will start the Streamlit server and open the application in your default web browser..
.gitit.sh
can be used for easy pushing updates to remote repo.
Activate with:
chmod +x .gitit.sh
Then, to add, commit, and push to remote repo:
./gitit.sh
This project is licensed under the MIT License.