-
Notifications
You must be signed in to change notification settings - Fork 652
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
bug: Can openllm run on k8s clusters without GPUs? #1078
Comments
Traceback (most recent call last): |
maybe you can try the llamacpp models, but by default vllm requires GPU to be available. |
All models supported by openllm today requires Nvidia GPU or Apple silicon to run. We may add more options in the future, or you can contribute to https://github.com/bentoml/OpenLLM-models |
Describe the bug
I want to run Qwen0.5b on a k8s cluster without GPU, but the service startup has failed so far. Is there any way to support CPU machines
To reproduce
No response
Logs
No response
Environment
only have CPU
System information (Optional)
No response
The text was updated successfully, but these errors were encountered: