feat: support a local custom cortex engine #3764

RyanMetcalfeInt8 · 2024-10-03T13:14:04Z

Hello, I have a custom cortex engine (compiled into an engine.dll). Right now I am able to make use of it in a super hacky way, by replacing the default cortex.llamacpp engine.dll with my custom version. But obviously I don't want to have to do that..

So, I am wondering -- What the correct way to make use of my custom engine? I see in model.json that there is an 'engine' field, which is typically set to 'nitro'. My confusion here is that 'nitro' seems equivalent to 'cortex' (which I also use), but I don't see any field that can specify which specific engine to use (cortex.llamacpp vs. mine).

It's very possible that I missed / overlooked something..

Perhaps there is also some feature needed in cortex.cpp layer as well?

Thanks!
Ryan

louis-menlo · 2024-10-03T13:24:11Z

Hi @RyanMetcalfeInt8, that's awesome! You're right, existing models have defined nitro as the engine, but the extension will actually route to the cortex.cpp server underneath, it would run on the llama.cpp engine (version 0.5.5). We're working on multiple engines support, which is tracked here. #3690

RyanMetcalfeInt8 · 2024-10-03T13:28:00Z

Hey @louis-jan -- Thanks for getting back to me so quickly! Makes sense.. I'll subscribe to that issue.. and happy to help test new features as they become available. Cheers!

louis-menlo · 2024-10-03T13:32:43Z

Hey @RyanMetcalfeInt8, I think this is a valid issue to track for improving the extension dev experience. I encountered the same issue while working on MLX support. I will reopen it for tracking.

freelerobot · 2024-10-14T14:10:35Z

related #3773

imtuyethan · 2024-11-17T11:41:20Z

Related: https://discord.com/channels/1107178041848909847/1306612519338643466

imtuyethan · 2024-12-10T03:39:53Z

Related: planning: Jan and Cortex's Extension Framework #3773

github-project-automation bot added this to Menlo Oct 3, 2024

github-project-automation bot moved this to Investigating in Menlo Oct 3, 2024

RyanMetcalfeInt8 closed this as completed Oct 3, 2024

github-project-automation bot moved this from Investigating to Review + QA in Menlo Oct 3, 2024

louis-menlo reopened this Oct 3, 2024

github-project-automation bot moved this from Review + QA to In Progress in Menlo Oct 3, 2024

freelerobot added the category: local engines label Oct 14, 2024

freelerobot changed the title ~~[Question] How can I make use of my custom cortex engine?~~ feat: support a local custom cortex engine Oct 14, 2024

freelerobot added category: providers Local & remote inference providers and removed category: local providers labels Oct 17, 2024

imtuyethan added the move to Cortex label Nov 3, 2024

RyanMetcalfeInt8 mentioned this issue Nov 22, 2024

engine: Intel OpenVino janhq/cortex.cpp#677

Closed

imtuyethan closed this as completed Dec 10, 2024

github-project-automation bot moved this from In Progress to QA in Menlo Dec 10, 2024

imtuyethan mentioned this issue Dec 10, 2024

planning: Jan and Cortex's Extension Framework #3773

Open

imtuyethan moved this from QA to Completed in Menlo Dec 10, 2024

imtuyethan added the duplicate This issue or pull request already exists label Dec 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support a local custom cortex engine #3764

feat: support a local custom cortex engine #3764

RyanMetcalfeInt8 commented Oct 3, 2024

louis-menlo commented Oct 3, 2024 •

edited

Loading

RyanMetcalfeInt8 commented Oct 3, 2024

louis-menlo commented Oct 3, 2024

freelerobot commented Oct 14, 2024

imtuyethan commented Nov 17, 2024

imtuyethan commented Dec 10, 2024 •

edited

Loading

feat: support a local custom cortex engine #3764

feat: support a local custom cortex engine #3764

Comments

RyanMetcalfeInt8 commented Oct 3, 2024

louis-menlo commented Oct 3, 2024 • edited Loading

RyanMetcalfeInt8 commented Oct 3, 2024

louis-menlo commented Oct 3, 2024

freelerobot commented Oct 14, 2024

imtuyethan commented Nov 17, 2024

imtuyethan commented Dec 10, 2024 • edited Loading

louis-menlo commented Oct 3, 2024 •

edited

Loading

imtuyethan commented Dec 10, 2024 •

edited

Loading