在使用GPU推理BioMedGPT-LM-7B时卡住，无结果输出 #65

yangbaoquan · 2024-04-23T09:30:38Z

使用CPU做推理时能够获得输出，但是使用GPU推理时在显卡利用率100%的情况下运行一个小时也没有输出结果。
代码如下：

from transformers import AutoModelForCausalLM, AutoTokenizer

model_path = "./model"
tokenizer = AutoTokenizer.from_pretrained(model_path)
model = AutoModelForCausalLM.from_pretrained(model_path, device_map='cuda:0')

text = ["What's the function of Aspirin?"]
input = tokenizer(text,
              truncation=True,
              return_tensors="pt").to("cuda:0")

output = model.generate(inputs=input.input_ids, max_new_tokens=128, early_stopping=True)
print(tokenizer.decode(output[0]))

caisl1234 · 2024-08-19T02:19:15Z

遇到同样的问题，请问您解决了吗？

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

在使用GPU推理BioMedGPT-LM-7B时卡住，无结果输出 #65

在使用GPU推理BioMedGPT-LM-7B时卡住，无结果输出 #65

yangbaoquan commented Apr 23, 2024

caisl1234 commented Aug 19, 2024

在使用GPU推理BioMedGPT-LM-7B时卡住，无结果输出 #65

在使用GPU推理BioMedGPT-LM-7B时卡住，无结果输出 #65

Comments

yangbaoquan commented Apr 23, 2024

caisl1234 commented Aug 19, 2024