Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Helm Chart Lacks Clear Support for Multi-Node vLLM Deployment #50

Open
shohamyamin opened this issue Jan 31, 2025 · 2 comments
Open

Helm Chart Lacks Clear Support for Multi-Node vLLM Deployment #50

shohamyamin opened this issue Jan 31, 2025 · 2 comments

Comments

@shohamyamin
Copy link

The current Helm chart does not explicitly support deploying vLLM across multiple vllm nodes on Kubernetes, or it's unclear how to configure it. Improved documentation or multi-node support is needed for deploying LLM that require multi-node

@ahg-g
Copy link

ahg-g commented Jan 31, 2025

For reference, and if someone would like to add that to helm charts, the LeaderWorkerSet API can be used to deploy multi-node vllm on k8s: https://docs.vllm.ai/en/latest/deployment/frameworks/lws.html

Also check examples on the LWS repo: https://github.com/kubernetes-sigs/lws/tree/main/docs/examples/vllm

@YuhanLiu11
Copy link
Collaborator

Thanks for submitting the issue!

Multi-node deployment should be supported. If you encounter any issues running a multi-node deployment, feel free to let us know.

We will improve the documentation to clarify how to configure multi-node deployment on Kubernetes.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants