-
Notifications
You must be signed in to change notification settings - Fork 402
Open
Description
would it be possible to add support to run gguf moddles like this?
https://huggingface.co/TheBloke/Kunoichi-7B-GGUF
I have been testing, and simply changeing the file extention from .gguf to .bin loads it ok.
but having support for the .gguf extention would help.
image I am using:
ghcr.io/serge-chat/serge:main
debian unstable with all updates installed.
docker docker compose installed and up-to-date.
using chromium from the debian repo.
thank you for considdering my request.
Checklist
- Modify
docker-compose.yml
✓ f408cac Edit - Running GitHub Actions for
docker-compose.yml
✓ Edit - Modify
charts/serge/values.yaml
✓ 105f0fd Edit - Running GitHub Actions for
charts/serge/values.yaml
✓ Edit - Create
scripts/model_loader.py
✓ eadd93c Edit - Running GitHub Actions for
scripts/model_loader.py
✓ Edit - Modify
README.md
✓ 761e0dc Edit - Running GitHub Actions for
README.md
✓ Edit
sweep-aisweep-ai