Support Mixtral #365

WebCloud · 2024-04-10T12:57:57Z

It seems that bumblebee is not capable of loading Mixtral-8x7B models (base or instruct). I've checked the files and it should be able to load the model (in theory) since it is capable of loading Mistral-7b files, but I keep getting

** (ArgumentError) could not match the class name "MixtralForCausalLM" to any of the supported models, please specify the :module and :architecture options
    (bumblebee 0.5.3) lib/bumblebee.ex:409: Bumblebee.do_load_spec/4
    (bumblebee 0.5.3) lib/bumblebee.ex:578: Bumblebee.maybe_load_model_spec/3
    (bumblebee 0.5.3) lib/bumblebee.ex:566: Bumblebee.load_model/2
    #cell:4lsfbdujnkoinq5p:3: (file)

Both configuration files have the same reference to MixtralForCausalLM and Mixtral-8x7B has the safetensors filers.

Mix.install(
  [
    {:bumblebee, "~> 0.5.3"},
    {:exla, ">= 0.0.0"}
  ],
  config: [nx: [default_backend: EXLA.Backend]]
)

repo = {:hf, "mistralai/Mixtral-8x7B-Instruct-v0.1"}

{:ok, model_info} = Bumblebee.load_model(repo, type: :bf16, backend: EXLA.Backend) # errors
{:ok, tokenizer} = Bumblebee.load_tokenizer(repo)
{:ok, generation_config} = Bumblebee.load_generation_config(repo)

The text was updated successfully, but these errors were encountered:

jonatanklosko · 2024-04-10T13:25:10Z

@WebCloud Bumblebee needs implementation for each model type in order to load it. Mixtral is not implemented currently, while Mistral is.

WebCloud · 2024-04-10T18:00:20Z

I see! Well, I am quite fresh on Elixir, but I'd be happy to help however I can.

jonatanklosko · 2024-04-10T18:43:34Z

For an example of adding a model you can see Mistral #264. The corresponding hf/transformers code is modeling_mixtral.py.

However, note that Nx/EXLA doesn't support quantization yet, and the bf16 model is around 100GB, so it is not very practical for running on the GPU at this point.

WebCloud · 2024-04-11T07:17:13Z

Cool! thanks for the resources!

jonatanklosko changed the title ~~Error whist trying to load hf mistralai/Mixtral-8x7B-Instruct-v0.1~~ Support Mixtral Apr 10, 2024

hwuethrich mentioned this issue May 2, 2024

Add support for Bumblebee functions? brainlid/langchain#88

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Mixtral #365

Support Mixtral #365

WebCloud commented Apr 10, 2024

jonatanklosko commented Apr 10, 2024

WebCloud commented Apr 10, 2024

jonatanklosko commented Apr 10, 2024

WebCloud commented Apr 11, 2024

Support Mixtral #365

Support Mixtral #365

Comments

WebCloud commented Apr 10, 2024

jonatanklosko commented Apr 10, 2024

WebCloud commented Apr 10, 2024

jonatanklosko commented Apr 10, 2024

WebCloud commented Apr 11, 2024