Running model on multiple GPUs? How to do it? Does private-gpt allows it do easily? #1999

martinenkoEduard started this conversation in General

martinenkoEduard
Jul 17, 2024

Running model on multiple GPUs? How to do it?
Can you show simple example?

What are the restrictions? Should be GPUs identical?
Or is it possible for instance to have
one RTX 3070 and one 3080? What about memory sharing?

Do mistal and llama support these features?

Also can you share a rig configuration with multiple GPUs for local LLM deployment?

Replies: 0 comments

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment