Gradio WebUI for Llama-3.2-Vision

This repo provides a user-friendly web interface for interacting with the Llama-3.2-11B-Vision model, which generates text responses from image and text prompts.

Getting Started

Get a Hugging Face Token
- Sign up for an account here.
- Get a huggingface token to access llama3.2-11b-vision model.

Project Setup

Clone the repository:

git clone https://github.com/spacewalk01/llama3.2-vision-webui.git
cd llama3.2-vision-webui

Install dependencies:
```
pip install -r requirements.txt
```

Run the Application
- Start the Gradio interface by running:
```
python main.py --token Your_Hugging_Face_Token
```
- Access the local URL to upload images and prompts, and view the Llama 3.2 Vision model's responses.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
data		data
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gradio WebUI for Llama-3.2-Vision

Getting Started

License

References

About

Releases

Packages

Languages

License

spacewalk01/llama3.2-vision-webui

Folders and files

Latest commit

History

Repository files navigation

Gradio WebUI for Llama-3.2-Vision

Getting Started

License

References

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages