Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

roadmap: Jan Local Engine Management can pick Version and Variant #4128

Open
5 of 7 tasks
imtuyethan opened this issue Nov 26, 2024 · 0 comments
Open
5 of 7 tasks

roadmap: Jan Local Engine Management can pick Version and Variant #4128

imtuyethan opened this issue Nov 26, 2024 · 0 comments
Assignees
Labels
category: providers Local & remote inference providers category: settings Settings, system monitoring components P1: important Important feature / fix
Milestone

Comments

@imtuyethan
Copy link
Contributor

imtuyethan commented Nov 26, 2024

Problem Statement

https://cortex.so/api-reference#tag/engines/get/v1/engines/{name}
Jan needs a UI to manage Cortex.cpp engines based on the following API endpoints:

  • List installed engines
  • Install/Uninstall engines
  • Load/Unload engines
  • Get/Set default engine variant
  • Update engine

Tasklist

Jan Tasks

Cortex Backend Tasks

Feature Idea

https://cortex.so/api-reference#tag/engines/get/v1/engines/{name}

Engine Management

  • Version
  • Update: Update engine to latest version of current default variant
  • Set Default: Ability to set default variant for each engine type
  • Show installed engines with:
    • Engine type (llama-cpp, onnxruntime, tensorrt-llm)
    • Variant name (e.g., mac-arm64, linux-amd64-avx)
  • Only one engine can be loaded at a time
  • Handle errors gracefully
  • Install: Allow installing specific engine variant/version
  • Uninstall: Remove engine variant
@imtuyethan imtuyethan added the type: feature request A new feature label Nov 26, 2024
@github-project-automation github-project-automation bot moved this to Investigating in Jan & Cortex Nov 26, 2024
@imtuyethan imtuyethan added needs designs Needs designs type: epic A major feature or initiative and removed type: feature request A new feature labels Nov 26, 2024
@imtuyethan imtuyethan moved this from Investigating to Planning in Jan & Cortex Nov 26, 2024
@imtuyethan imtuyethan added category: settings Settings, system monitoring components category: providers Local & remote inference providers labels Nov 26, 2024
@imtuyethan imtuyethan added the P1: important Important feature / fix label Nov 28, 2024
@imtuyethan imtuyethan moved this from Planning to Scheduled in Jan & Cortex Nov 28, 2024
@dan-homebrew dan-homebrew changed the title epic: Engine Management epic: Engines Management for llama.cpp to pick Version and Variant Nov 28, 2024
@dan-homebrew dan-homebrew changed the title epic: Engines Management for llama.cpp to pick Version and Variant roadmap: Engines Management for llama.cpp to pick Version and Variant Nov 28, 2024
@dan-homebrew dan-homebrew changed the title roadmap: Engines Management for llama.cpp to pick Version and Variant roadmap: Jan llama.cpp engine can pick Version and Variant Nov 28, 2024
@dan-homebrew dan-homebrew changed the title roadmap: Jan llama.cpp engine can pick Version and Variant roadmap: Jan Local Engine Management can pick Version and Variant Nov 28, 2024
@imtuyethan imtuyethan removed their assignment Nov 29, 2024
@imtuyethan imtuyethan removed needs designs Needs designs type: epic A major feature or initiative labels Nov 29, 2024
@imtuyethan imtuyethan added this to the v0.5.11 milestone Nov 29, 2024
@dan-homebrew dan-homebrew assigned namchuai and unassigned louis-jan Nov 29, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
category: providers Local & remote inference providers category: settings Settings, system monitoring components P1: important Important feature / fix
Projects
Status: In Progress
Development

No branches or pull requests

4 participants