Pipeline tries to download all the possible weights even when the dtype is specified: utils/hub.js could not locate file #941

DavidGOrtega · 2024-09-21T09:54:58Z

System Info

tranformers v2.17.2
node v18.20.3

Environment/Platform

Description

For some reason, specifying the dtype is still downloading all the other weights. This approach has some issues:

Is a waste of space and bandwidth (specially HF)
Prevents you to use other repos that has not converted the models to other dtypes like this one

file:///***/node_modules/@xenova/transformers/src/utils/hub.js:238
    throw Error(`${message}: "${remoteURL}".`);
          ^

Error: Could not locate file: "https://huggingface.co/intfloat/multilingual-e5-large/resolve/main/onnx/model_quantized.onnx".
    at handleError (file:///***/node_modules/@xenova/transformers/src/utils/hub.js:238:11)
    at getModelFile (file:///***/node_modules/@xenova/transformers/src/utils/hub.js:471:24)
    at process.processTicksAndRejections (node:internal/process/task_queues:95:5)
    at async constructSession (file:///***/node_modules/@xenova/transformers/src/models.js:123:18)
    at async Promise.all (index 1)
    at async XLMRobertaModel.from_pretrained (file:///***/node_modules/@xenova/transformers/src/models.js:793:20)
    at async AutoModel.from_pretrained (file:///***/node_modules/@xenova/transformers/src/models.js:5519:20)
    at async Promise.all (index 1)
    at async loadItems (file:///***/node_modules/@xenova/transformers/src/pipelines.js:3279:5)
    at async pipeline (file:///***/node_modules/@xenova/transformers/src/pipelines.js:3219:21)

Reproduction

import { pipeline, env } from '@xenova/transformers';

const run = async () => {
  const model = 'intfloat/multilingual-e5-large';
  const extractor = await pipeline('feature-extraction', model, { pooling: 'mean', normalize: true,  dtype: 'fp32' });

  const texts = ['Hello world.', 'Example sentence.'];
  const embeddings = await extractor(texts);
  console.log(embeddings, model);
}

run();

The text was updated successfully, but these errors were encountered:

xenova · 2024-09-22T01:04:52Z

Hi there 👋 Please note that the dtype option is only available in @huggingface/transformers (Transformers.js v3) instead of @xenova/transformers.

tarikakyol · 2024-10-04T06:29:46Z

thanks for the hard work!, I've spent some days until I figured out there's a newer version of transformers.js at @huggingface/transformers, it might just be me being slow but I think this should be a bit more explicit.

DavidGOrtega added the bug Something isn't working label Sep 21, 2024

DavidGOrtega mentioned this issue Sep 21, 2024

"intfloat / multilingual-e5-large" model not working in Node.js #938

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pipeline tries to download all the possible weights even when the dtype is specified: utils/hub.js could not locate file #941

Pipeline tries to download all the possible weights even when the dtype is specified: utils/hub.js could not locate file #941

DavidGOrtega commented Sep 21, 2024

xenova commented Sep 22, 2024

tarikakyol commented Oct 4, 2024

Pipeline tries to download all the possible weights even when the dtype is specified: utils/hub.js could not locate file #941

Pipeline tries to download all the possible weights even when the dtype is specified: utils/hub.js could not locate file #941

Comments

DavidGOrtega commented Sep 21, 2024

System Info

Environment/Platform

Description

Reproduction

xenova commented Sep 22, 2024

tarikakyol commented Oct 4, 2024