feat: Automatic fallback to CPU in case of GPU loading failure #805

the-vindicar · 2024-03-11T16:06:22Z

Problem
If GPU acceleration is enabled, Jan appears to follow "all or nothing" strategy, with model failing to activate completely if there is not enough vRAM, for example.

Success Criteria
A much better approach would be "graceful degradation", with model activating using CPU instead. Perhaps with a UI warning to notify user what has happened. That way at least the model would still respond, even if more slowly. Additionally, it'd allow accelerating small models and still working with larger ones.

An ideal approach would be to implement partial model offloading. That way it'd be possible to make a guess at how many layers can be safely offloaded into vRAM, so the model is accelerated as much as possible with the given hardware.

Additional context
I think LMStudio and GPT4All implement partial model offloading, so it's something that's possible to do. However, they just stick a slider into UI and leave it to user to find out how many layers can be loaded into vRAM.

dan-menlo · 2024-10-13T09:40:18Z

Closing this issue into #1165

the-vindicar added the type: feature request A new feature label Mar 11, 2024

Van-QA assigned imtuyethan Mar 13, 2024

Van-QA assigned CameronNg Apr 23, 2024

Van-QA unassigned imtuyethan and CameronNg Jul 1, 2024

Van-QA transferred this issue from janhq/jan Jul 1, 2024

Van-QA assigned louis-jan Jul 1, 2024

imtuyethan removed the type: feature request A new feature label Aug 28, 2024

imtuyethan unassigned louis-jan Aug 28, 2024

imtuyethan removed the status in Menlo Aug 28, 2024

imtuyethan moved this to Icebox in Menlo Sep 2, 2024

freelerobot added P1: important Important feature / fix category: hardware management Related to hardware & compute labels Sep 6, 2024

dan-menlo closed this as completed Oct 13, 2024

dan-menlo mentioned this issue Oct 13, 2024

planning: Cortex Hardware API #1165

Closed

11 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Automatic fallback to CPU in case of GPU loading failure #805

feat: Automatic fallback to CPU in case of GPU loading failure #805

the-vindicar commented Mar 11, 2024

dan-menlo commented Oct 13, 2024

feat: Automatic fallback to CPU in case of GPU loading failure #805

feat: Automatic fallback to CPU in case of GPU loading failure #805

Comments

the-vindicar commented Mar 11, 2024

dan-menlo commented Oct 13, 2024