-
Notifications
You must be signed in to change notification settings - Fork 134
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Sprint 26 Planning #1735
Comments
Suggested Jan Epics (as parent issues for Cortex)cc @imtuyethan @dan-homebrew @0xSage 3. Jan & Cortex fixes Vulkan acceleration
4. Jan & Cortex supports Multi GPU |
Closing after sprint planning discussion, now on kanban boards |
github-project-automation
bot
moved this from Investigating
to Review + QA
in Jan & Cortex
Nov 29, 2024
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
1.0.4 scope: https://github.com/janhq/cortex.cpp/milestone/25
Sprint 25 In progress / QA + Release next week
/threads
,/messages
#1567Sprint 25 Not yet started + Sprint 26 stories
Jan: Engine Management, Hardware Settings
Jan Bugs that should be solved (requires QA)
Jan Bugs that needs investigation (to prioritise)
bug: Mismatch between Jan UI & model configuration options jan#4035
Sub tasks:
Wrong NGL maximum settings (e.g., NGL number limits at 30 layers but the user can set it to 100 in Jan UI): https://discord.com/channels/1107178041848909847/1306758623325851689/1307651904171540512
bug: Max context length automatically defaults to 4096 tokens #1668
bug: OpenRouter API max token limit too low in UI (1024) compared to model capability (128k) jan#4014
bug: Inference with NVIDIA GPU stop working after resuming from sleep on Linux #1459 - computer going to sleep
bug: Arch Detector Causing Special Characters in GPU Name in
settings.json
#1140bug: Cortex Engine crashes - Doesn't start any local models after the latest update #1469
bug: Tokens per second calculation is wrong. #1099, epic: cortex.cpp Hardware Benchmarking + Backend Infra #985 Tokens per second calculation:
Need technical help understanding:
Jan x Cortex implements Hardware Detection
Cortex supports new Engines
dylibs
and are self-contained #1732New Features
P0: Model sources - needed for Jan Hub
P0: Jan x Cortex supports Python
Assistants
/assistants
(Jan status quo equivalent) #1573QA
Cortex UX Improvements
cortex models update <MODEL_ID>
#1060cortex update
does not stop gracefully #1656Sprint 27 / beyond
Support Ichigo & TTS & STT -> Sprint 27, blocked on Python
RAG
Supports Vision, upstream llama.cpp -> Move to Sprint 27
Unprioritised / Icebox
/model/presets
#1185Jan & Cortex supports Multi GPU -> Create parent Jan issue
Docs / Website:
The text was updated successfully, but these errors were encountered: