Sprint 26 Planning #1735

gabrielle-ong · 2024-11-28T05:01:49Z

1.0.4 scope: https://github.com/janhq/cortex.cpp/milestone/25

Sprint 25 In progress / QA + Release next week

Sprint 25 Not yet started + Sprint 26 stories

Jan: Engine Management, Hardware Settings

Jan Bugs that should be solved (requires QA)

Jan Bugs that needs investigation (to prioritise)

bug: Mismatch between Jan UI & model configuration options jan#4035
Sub tasks:
Wrong NGL maximum settings (e.g., NGL number limits at 30 layers but the user can set it to 100 in Jan UI): https://discord.com/channels/1107178041848909847/1306758623325851689/1307651904171540512
bug: Max context length automatically defaults to 4096 tokens #1668
bug: OpenRouter API max token limit too low in UI (1024) compared to model capability (128k) jan#4014
bug: Inference with NVIDIA GPU stop working after resuming from sleep on Linux #1459 - computer going to sleep
bug: Arch Detector Causing Special Characters in GPU Name in settings.json #1140
bug: Cortex Engine crashes - Doesn't start any local models after the latest update #1469
bug: Tokens per second calculation is wrong. #1099, epic: cortex.cpp Hardware Benchmarking + Backend Infra #985 Tokens per second calculation:

Need technical help understanding:

feat: set up HTTPS for Custom Models #1097: is this fixed by proxy?
roadmap: mmap for keeping Model in VRAM when Flash Attention is used #1717

Jan x Cortex implements Hardware Detection

Cortex supports new Engines

New Features

P0: Model sources - needed for Jan Hub

planning: Cortex API supports /model/sources #15

P0: Jan x Cortex supports Python

janhq/jan-internal#8

Assistants

QA

Cortex UX Improvements

Sprint 27 / beyond

Support Ichigo & TTS & STT -> Sprint 27, blocked on Python

Jan Parent issue: roadmap: Jan supports Local Voice Mode w/ Ichijo jan#3488

RAG

planning: Cortex API supports Retrieval Tool #1597

Supports Vision, upstream llama.cpp -> Move to Sprint 27

Unprioritised / Icebox

Jan & Cortex supports Multi GPU -> Create parent Jan issue

planning: prioritize GPUs with CUDA_VISIBLE_DEVICES #1679
Add Multi-GPU Support for LlamaCpp Engine #1391
Fixes Jan multiple GPUs #1458

Docs / Website:

docs: idea: Using Cortex & Jan #1736
Decision: Are we publicly marketing Cortex or focusing on Jan?

The text was updated successfully, but these errors were encountered:

gabrielle-ong · 2024-11-28T10:05:19Z

Suggested Jan Epics (as parent issues for Cortex)

cc @imtuyethan @dan-homebrew @0xSage

3. Jan & Cortex fixes Vulkan acceleration

Need Eng planning / investigation
bug: Failed to Load Model with AMD GPU and Vulkan #1525
bug: AMD RX 7900 XTX Incompatibility Even with Vulkan Mode Enabled jan#3874

4. Jan & Cortex supports Multi GPU

gabrielle-ong · 2024-11-29T10:53:09Z

Closing after sprint planning discussion, now on kanban boards

github-project-automation bot added this to Jan & Cortex Nov 28, 2024

github-project-automation bot moved this to Investigating in Jan & Cortex Nov 28, 2024

gabrielle-ong self-assigned this Nov 28, 2024

gabrielle-ong mentioned this issue Nov 29, 2024

Cortex Braindump (Sprint 25) #1726

Closed

4 tasks

gabrielle-ong closed this as completed Nov 29, 2024

github-project-automation bot moved this from Investigating to Review + QA in Jan & Cortex Nov 29, 2024

gabrielle-ong moved this from Review + QA to Completed in Jan & Cortex Nov 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sprint 26 Planning #1735

Sprint 26 Planning #1735

gabrielle-ong commented Nov 28, 2024 •

edited

Loading

gabrielle-ong commented Nov 28, 2024 •

edited

Loading

gabrielle-ong commented Nov 29, 2024

Sprint 26 Planning #1735

Sprint 26 Planning #1735

Comments

gabrielle-ong commented Nov 28, 2024 • edited Loading

Sprint 25 In progress / QA + Release next week

Sprint 25 Not yet started + Sprint 26 stories

Jan: Engine Management, Hardware Settings

Jan Bugs that should be solved (requires QA)

Jan Bugs that needs investigation (to prioritise)

Need technical help understanding:

Jan x Cortex implements Hardware Detection

Cortex supports new Engines

New Features

P0: Model sources - needed for Jan Hub

P0: Jan x Cortex supports Python

Assistants

QA

Cortex UX Improvements

Sprint 27 / beyond

Support Ichigo & TTS & STT -> Sprint 27, blocked on Python

RAG

Supports Vision, upstream llama.cpp -> Move to Sprint 27

Unprioritised / Icebox

Jan & Cortex supports Multi GPU -> Create parent Jan issue

gabrielle-ong commented Nov 28, 2024 • edited Loading

Suggested Jan Epics (as parent issues for Cortex)

3. Jan & Cortex fixes Vulkan acceleration

4. Jan & Cortex supports Multi GPU

gabrielle-ong commented Nov 29, 2024

gabrielle-ong commented Nov 28, 2024 •

edited

Loading

gabrielle-ong commented Nov 28, 2024 •

edited

Loading