-
Notifications
You must be signed in to change notification settings - Fork 76
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature]: Support Deepseek-r1 671B #809
Comments
@xuechendi please check this. |
@Bihan Please run the following and paste the output below. wget https://raw.githubusercontent.com/vllm-project/vllm/main/collect_env.py For security purposes, please feel free to check the contents of collect_env.py before running it.python collect_env.py |
@Bihan , are you using "https://github.com/HabanaAI/vllm-fork/tree/deepseek_r1"? For easy test, you can follow instruction as below: <style> </style>
|
Below is the error I got
Also I had to Is there a way to use it as we do normally with vLLM like |
🚀 The feature, motivation and pitch
8xGaudi2 with 768GB HBM can support Deepseek-r1 671B. The model weights are successfully loaded using vllm habana fork but cannot deploy due to below mentioned error.
Alternatives
No response
Additional context
When trying to serve:
MODEL_ID=deepseek-ai/DeepSeek-R1
vllm serve $MODEL_ID --tensor-parallel-size 8
Before submitting a new issue...
The text was updated successfully, but these errors were encountered: