Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

bug: Jan is not using GPU #3737

Closed
1 of 3 tasks
majsterkovic opened this issue Sep 28, 2024 · 10 comments
Closed
1 of 3 tasks

bug: Jan is not using GPU #3737

majsterkovic opened this issue Sep 28, 2024 · 10 comments
Assignees
Labels
P0: critical Mission critical type: bug Something isn't working

Comments

@majsterkovic
Copy link

Jan version

0.5.4

Describe the Bug

Jan is not using GPU

Steps to Reproduce

Model: Bielik-11B-v2.3-Instruct.Q8_0.gguf
GPU: NVIDIA RTX 4070 SUPER

Screenshots / Logs

obraz_2024-09-28_015717703

What is your OS?

  • MacOS
  • Windows
  • Linux
@majsterkovic majsterkovic added the type: bug Something isn't working label Sep 28, 2024
@github-project-automation github-project-automation bot moved this to Triage in Menlo Sep 28, 2024
@majsterkovic majsterkovic changed the title bug: [DESCRIPTION] bug: Jan is not using GPU Sep 28, 2024
@chlimouj
Copy link

I'm having a similar issue. I can only use my 4090 through Vulcan, not CUDA.

I also have a Tesla P40 installed, and Jan sees it but can't use it. Of course, the P40 doesn't support Vulcan.

In the past, I've been able to use the P40 and 4090 simultaneously to load large models. I did have some driver issues with a recent upgrade to Win11 (BSOD every few hours). Possibly not fully resolved? Not sure what to look at or what to try.

@imtuyethan
Copy link
Contributor

@louis-jan is this related to #3558

@imtuyethan imtuyethan added the P0: critical Mission critical label Sep 29, 2024
@chlimouj
Copy link

@louis-jan is this related to #3558

In my case, it happens with models I've imported manually as well.

And the models load and run fine on CPU (just slower, obviously)

@dan-menlo dan-menlo removed the status in Menlo Sep 29, 2024
@dan-menlo dan-menlo moved this to Investigating in Menlo Sep 29, 2024
@louis-jan
Copy link
Contributor

Hi @chlimouj Could you please share the log file, and what NGL settings are you using?

@mtomas7
Copy link

mtomas7 commented Oct 3, 2024

Same here. Log file attached.
app - Copy.log

  • First I noticed that GPU Acceleration was OFF in Settings/Advanced. I turned it ON.
  • After that AI Model would not load at all, but I got a message to install additional components for CUDA:
  • Additional Dependencies
    This extension embeds cortex.cpp, a lightweight inference engine written in C++. See https://jan.ai/. Additional dependencies could be installed to run without Cuda Toolkit installation.

NGL was set to default 33 layers.

After that AI model was loaded into VRAM and worked.

@louis-jan
Copy link
Contributor

Hi @chlimouj Yeah, with the fix, you'll need to re-import it for it to work properly. Which means it will retrieve the correct metadata from the model when importing now.

@louis-jan
Copy link
Contributor

Same here. Log file attached. app - Copy.log

  • First I noticed that GPU Acceleration was OFF in Settings/Advanced. I turned it ON.
  • After that AI Model would not load at all, but I got a message to install additional components for CUDA:
  • Additional Dependencies
    This extension embeds cortex.cpp, a lightweight inference engine written in C++. See https://jan.ai/. Additional dependencies could be installed to run without Cuda Toolkit installation.

NGL was set to default 33 layers.

After that AI model was loaded into VRAM and worked.

Yeah, we're working on bundling these dependencies into the app by default. It will increase the app size a bit, but it won't require any additional actions from the user.

@freelerobot
Copy link
Contributor

Will be fixed with #3690

@Fuckingnameless
Copy link

how do you install this fucking cortex extension?

@Fuckingnameless
Copy link

also how to change the models folder?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
P0: critical Mission critical type: bug Something isn't working
Projects
Archived in project
Development

No branches or pull requests

7 participants