Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

roadmap: Jan refactors /messages, /threads to Cortex #3895

Closed
3 of 6 tasks
dan-menlo opened this issue Oct 28, 2024 · 2 comments
Closed
3 of 6 tasks

roadmap: Jan refactors /messages, /threads to Cortex #3895

dan-menlo opened this issue Oct 28, 2024 · 2 comments
Assignees
Milestone

Comments

@dan-menlo
Copy link
Contributor

dan-menlo commented Oct 28, 2024

Goal

  • Jan should be a thin UI layer
  • Cortex should be an API Platform that has full OpenAI compatibility
  • We need to itemize and shift functionality to Cortex (e.g. Threads, Messages)

From my POV, Jan should only be a UI and "Packaging" layer (i.e. Mobile, Desktop, Server)
All other API functionality and state should shift to Cortex

Tasklist

Frontend

  • Itemize functionality that needs to shift to Cortex

Backend

Administrative

  • Dan/Nicole to link Internal/Admin issues

Product Marketing

Resources

@dan-menlo dan-menlo added this to Menlo Oct 28, 2024
@dan-menlo dan-menlo converted this from a draft issue Oct 28, 2024
@gabrielle-ong
Copy link
Contributor

gabrielle-ong commented Oct 29, 2024

WIP - Preliminary notes from last week with Louis

Model Hub

  • Updated models, delist
  • Jan dogfoods this to display on UI
  • future: eg Cortex remote models list

View RAM, CPU usage

  • requires reimplementing cortex ps for each engine
  • Can be deployed first but currently combines all engines together resulting in inaccurate

Hardware, eg GPU support

Support Tensorrt-llm engines [Prioritised?]

  • Need to define Model artifact, model.yml
  • need to bundle artifacts (needs investigating)

multi-modal - cortex.cpp needs to be able to pull 2 models at the same time

  • Need to define model.yml for the models

Reference discussions:

@dan-menlo dan-menlo changed the title planning: Jan functionality to shift to Cortex planning: Jan refactors APIs and State to Cortex Oct 29, 2024
@dan-menlo dan-menlo changed the title planning: Jan refactors APIs and State to Cortex planning: Jan Refactors APIs and State to Cortex Oct 29, 2024
@dan-menlo dan-menlo changed the title planning: Jan Refactors APIs and State to Cortex planning: Jan refactors APIs and State to Cortex Oct 30, 2024
@dan-menlo dan-menlo moved this from Investigating to Planning in Menlo Nov 1, 2024
@dan-menlo dan-menlo changed the title planning: Jan refactors APIs and State to Cortex roadmap: Jan refactors APIs and State to Cortex Nov 28, 2024
@dan-menlo dan-menlo moved this from Planning to In Progress in Menlo Nov 29, 2024
@dan-menlo dan-menlo changed the title roadmap: Jan refactors APIs and State to Cortex roadmap: Jan refactors /messages, /threads to Cortex Nov 29, 2024
@dan-menlo dan-menlo assigned namchuai and unassigned dan-menlo and louis-jan Nov 29, 2024
@namchuai namchuai moved this from In Progress to Eng Review in Menlo Dec 6, 2024
@namchuai namchuai moved this from Eng Review to QA in Menlo Dec 6, 2024
@namchuai namchuai moved this from QA to Eng Review in Menlo Dec 6, 2024
@namchuai namchuai moved this from Eng Review to QA in Menlo Dec 9, 2024
@namchuai namchuai moved this from QA to In Progress in Menlo Dec 9, 2024
@namchuai namchuai moved this from In Progress to Eng Review in Menlo Dec 9, 2024
@imtuyethan imtuyethan added this to the v0.5.12 milestone Dec 10, 2024
@imtuyethan imtuyethan moved this from Eng Review to QA in Menlo Dec 18, 2024
@imtuyethan
Copy link
Contributor

@imtuyethan imtuyethan moved this from QA to Completed in Menlo Dec 19, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Archived in project
Development

No branches or pull requests

5 participants