CPU vs GPU #100

Selbyl · 2023-03-28T19:31:09Z

Is there an option to change this platform to use CUDA or ROCM instead of CPU?

gamersalpha · 2023-03-28T19:49:24Z

for now no

i waiting to ...

he is tooo slowly with CPU

Ionaut · 2023-03-30T13:12:42Z

Yeah, CUDA gpu support would be great. That and leaving the model in the vram for the duration of the conversation.

SurvivaLlama · 2023-04-01T12:29:59Z

So.. Slow for everyone?

gaby · 2023-04-02T16:39:29Z

This is a llama.cpp issue, not Serge. May be worth creating this issue on that repo.

lee-b · 2023-04-10T16:57:03Z

llama.cpp is specifically intended by be a CPU-only implementation. They've said before that they won't support GPU. Someone would have to submit a patch so good that they're willing to accept it anyway, or else I suppose this project would have to find a similar, but GPU-based version of it.

jpaodev · 2024-02-27T02:20:41Z

@lee-b @gaby since llama.cpp does support GPU offloading and solutions such as CLBlast or ROCm now, is this up for debate? Thanks in advance!

gaby · 2024-02-27T02:43:40Z

@jpaodev In progress #944

gaby closed this as completed Apr 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CPU vs GPU #100

CPU vs GPU #100

Selbyl commented Mar 28, 2023

gamersalpha commented Mar 28, 2023

Ionaut commented Mar 30, 2023

SurvivaLlama commented Apr 1, 2023

gaby commented Apr 2, 2023

lee-b commented Apr 10, 2023

jpaodev commented Feb 27, 2024

gaby commented Feb 27, 2024

CPU vs GPU #100

CPU vs GPU #100

Comments

Selbyl commented Mar 28, 2023

gamersalpha commented Mar 28, 2023

Ionaut commented Mar 30, 2023

SurvivaLlama commented Apr 1, 2023

gaby commented Apr 2, 2023

lee-b commented Apr 10, 2023

jpaodev commented Feb 27, 2024

gaby commented Feb 27, 2024