Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[enhancement]: Ability to rotate through multiple GPUs when present #7634

Open
1 task done
jbkcrash opened this issue Feb 11, 2025 · 0 comments
Open
1 task done

[enhancement]: Ability to rotate through multiple GPUs when present #7634

jbkcrash opened this issue Feb 11, 2025 · 0 comments
Labels
enhancement New feature or request

Comments

@jbkcrash
Copy link

Is there an existing issue for this?

  • I have searched the existing issues

Contact Details

[email protected]

What should this feature add?

I have noticed the enhancement request to use multi-GPU for parallel or concatenated generation like LLMs can easily do; but I offer a slightly different request still using multiple GPUs. The parallel/concatenation would be a nice feature to have if efficient, however in the meantime and possibly even in leu of distributed generation, it would be nice to use multiple GPUs in a round robin nature. This way you are distributing the load and in my case temp between the available GPUs. The same configuration options would exist for indicating the available GPUs, but add a config item that indicates you want the application to use a difference GPU with each invocation would be useful. In my case where I have three identical RTX-3060s it allow them to cool down between multiple invocations and distribute the load more evenly. That is instead of hammering on the same first GPU all the time. I don't think this conflicts with using multiple GPUs in parallel, it is sort of an alternative approach that might have more value for some. VRAM concatenation would be nice to have, don't get me wrong, but for many this new concept would be helpful as well.

Alternatives

Not much more to consider, the application just needs to select the next available GPU in the pool and use it. It would only need to keep track of the previously used id and then move on to the next in order. It could be called round-robin or something similar.

Additional Content

For example of configuration...

multi-gpu-strategy: round-robin | all

@jbkcrash jbkcrash added the enhancement New feature or request label Feb 11, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant