Skip to content

Commit

Permalink
Add UQFF quant for Mistral Nemo 2407
Browse files Browse the repository at this point in the history
  • Loading branch information
EricLBuehler committed Oct 2, 2024
1 parent ced6fc0 commit c8a9e88
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 1 deletion.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -105,7 +105,7 @@ Mistal.rs supports several model categories:
- [PagedAttention](docs/PAGED_ATTENTION.md) and continuous batching
- Prefix caching
- [Topology](docs/TOPOLOGY.md): Configure ISQ and device mapping easily
- [UQFF](docs/UQFF.md): The uniquely powerful quantized file format
- [UQFF](docs/UQFF.md): Quantized file format for easy mixing of quants, see some [models](docs/UQFF.md#list-of-models) which have already been converted.
- Speculative Decoding: Mix supported models as the draft model or the target model
- Dynamic LoRA adapter activation with adapter preloading: [examples and docs](docs/ADAPTER_MODELS.md#adapter-model-dynamic-adapter-activation)

Expand Down
1 change: 1 addition & 0 deletions docs/UQFF.md
Original file line number Diff line number Diff line change
Expand Up @@ -176,3 +176,4 @@ Have you created a UQFF model on Hugging Face? If so, please [create an issue](h
| -- | -- | -- |
| Phi 3.5 Mini Instruct | microsoft/Phi-3.5-mini-instruct | [EricB/Phi-3.5-mini-instruct-UQFF](EricB/Phi-3.5-mini-instruct-UQFF) |
| Llama 3.2 Vision | meta-llama/Llama-3.2-11B-Vision-Instruct | [EricB/Llama-3.2-11B-Vision-Instruct-UQFF](https://huggingface.co/EricB/Llama-3.2-11B-Vision-Instruct-UQFF) |
| Mistral Nemo 2407 | mistralai/Mistral-Nemo-Instruct-2407 | [EricB/Mistral-Nemo-Instruct-2407-UQFF](https://huggingface.co/EricB/Mistral-Nemo-Instruct-2407-UQFF) |

0 comments on commit c8a9e88

Please sign in to comment.