You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The main model gets split across as expected across the P40s and the draft model on the 3090. However adding -sm row the main model gets split across all 4 GPUs instead of just the P40s.
First Bad Commit
likely introduced with #10497 that introduced --device and --device-draft
Relevant log output
No response
The text was updated successfully, but these errors were encountered:
Name and Version
Operating systems
Linux
Which llama.cpp modules do you know to be affected?
llama-server
Problem description & steps to reproduce
The new
--device
flag does not work with-sm row
.Devices:
When running with this command:
The main model gets split across as expected across the P40s and the draft model on the 3090. However adding
-sm row
the main model gets split across all 4 GPUs instead of just the P40s.First Bad Commit
likely introduced with #10497 that introduced
--device
and--device-draft
Relevant log output
No response
The text was updated successfully, but these errors were encountered: