Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

does dcgm on the physical host support capturing gpu metrics of kvm virtual machines? #201

Open
lddlww opened this issue Nov 18, 2024 · 1 comment

Comments

@lddlww
Copy link

lddlww commented Nov 18, 2024

when there are many virtual machines installed on the physical host machine. These virtual machines use GPU through graphics card pass-through . I don’t want to install dcgm-exporter on each virtual machine; if I install dcgm-exporter on the host machine, can it capture all GPU metrics of virtual machine? if not,how can i do to solve it?

best wishes

@nikkon-dev
Copy link
Collaborator

@lddlww,

In passthrough mode, the GPU is not visible to the host OS. Therefore, the nv-hostengine (the server-side component of DCGM) must be run within the VM to which the GPU is attached.

The nv-hostengine can be configured to listen on a TCP port, allowing the DCGM exporter on the host to connect to it (provided the TCP port is exposed from the VM). However, this approach has limitations: the DCGM exporter does not currently support connecting to multiple nv-hostengine instances.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants