Add new entries to model registry #141

Gnurro · 2024-12-11T13:48:57Z

Added models: EuroLLM-9B-Instruct, QwQ-32B-Preview, aya-expanse-32b, Teuken-7B-instruct-research-v0.4 and Teuken-7B-instruct-commercial-v0.4
Teuken models may not work with the default huggingface backend without modification as they use extensive custom transformers code and require trust_remote_code=True.

…instruct-research-v0.4 and Teuken-7B-instruct-commercial-v0.4 entries to the model registry

sherzod-hakimov · 2024-12-11T13:51:33Z

Please then test whether these models run one experiment of any game and exclude the ones that don't run.

Gnurro · 2024-12-12T19:05:33Z

The Teuken models can be run as-is, but require a manual input in terminal each time they are loaded, which happens once for each clemgame. This makes batch-running the entire benchmark a lot more complicated, as the prompt to run remote code has a rather short time limit before it simply fails to load the model. This could be handled by adding handling of trust_remote_code in the HF backend code and model registry, as I've already written here: https://github.com/Gnurro/clembench/blob/hf_trust_remote_code/backends/huggingface_local_api.py
We've so far refrained from running models that require this, as is customary in benchmarking LLMs, so I'll remove the Teuken model entries.

…cial-v0.4 entries from the model registry due to them requiring custom code to run.

Add EuroLLM-9B-Instruct, QwQ-32B-Preview, aya-expanse-32b, Teuken-7B-…

b33d017

…instruct-research-v0.4 and Teuken-7B-instruct-commercial-v0.4 entries to the model registry

Remove Teuken-7B-instruct-research-v0.4 and Teuken-7B-instruct-commer…

b4ae140

…cial-v0.4 entries from the model registry due to them requiring custom code to run.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add new entries to model registry #141

Add new entries to model registry #141

Gnurro commented Dec 11, 2024

sherzod-hakimov commented Dec 11, 2024

Gnurro commented Dec 12, 2024

Add new entries to model registry #141

Are you sure you want to change the base?

Add new entries to model registry #141

Conversation

Gnurro commented Dec 11, 2024

sherzod-hakimov commented Dec 11, 2024

Gnurro commented Dec 12, 2024