Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[TRACKER] SHARK Studio Roadmap #2172

Open
monorimet opened this issue Sep 27, 2024 · 3 comments
Open

[TRACKER] SHARK Studio Roadmap #2172

monorimet opened this issue Sep 27, 2024 · 3 comments
Assignees

Comments

@monorimet
Copy link
Collaborator

monorimet commented Sep 27, 2024

SHARK Studio Roadmap

This project establishes and tracks a plan for phased releases of the SHARK Studio WebUI.

There are three objectives of this roadmap:

  • Define product features, supported OS, and devices.
  • Scheduling of features.

Product Goal

  • Demonstrate SHARK/IREE ML acceleration framework capabilities
  • Support common generative AI workloads for AMD users, as a standalone application.
  • Example application showing how to integrate with SHARK/IREE inference solutions.

Product Definition

OS: Windows 11 and Ubuntu 24.04
Device Targets:

  • 7900xtx
  • Strix iGPUs
  • MI300x
  • MI325
  • MI250
  • LLVMCPU (x86)
    Serving:
  • shortfin

Stable Diffusion (SDXL, SDXL-Turbo, SD3):
Features:

  • Text to Image: given a textual prompt, generate an image.
  • Custom Checkpoints/Weights: bring your own fine-tuned model weights as .safetensors
  • LoRA Embeddings: bring your own embeddings for generating specific characters, objects, or styles.
  • Output Image Size: 128x128 - 1024x1024, multiples of 8. (model restrictions apply)

Chatbot (LLaMa3):
Features:

  • Text to Text: Have a conversation with text as input.
  • Ability to query user-submitted document.

ML Perf Models (TBD)

Product Architecture

image


Schedule

Feature Release
Device: MI300, OS: Linux, Models: SDXL, LLama3, Features except LoRA, document chatbot input Nov 2024
Device: MI300x, Strix, OS: Win, Linux, Models: SDXL (+Turbo), SD3, Features: All Jan 2025
Device: MI300x, Strix, OS: Win, Linux, Models: SDXL (+Turbo), SD3, LLaMa3, Features: All Mar 2025

Nov. 2024 task tracking:

Task Date Assignees PR
Shortfin SD serving (Python) + SDXL precompiled serving test added to e2eshark Oct. 25 @monorimet nod-ai/shark-ai#265
Sharktank model exports, uploads + pytorch-sourced component testing in e2eshark Nov. 1 @monorimet @saienduri
WebUI (SDXL, LLaMa3) (fixed set of config parameters) + webui test in e2eshark Nov. 8 @gpetters-amd
QA + bugfixes, unknowns Nov. 15 @monorimet @gpetters-amd @pdhirajkumarprasad

Building Blocks

Component Source
CLIP turbine-models (SHARK-ModelDev)
Unet sharktank
VAE turbine-models (SHARK-ModelDev)
Scheduler turbine-models (SHARK-ModelDev)
@gpetters-amd
Copy link
Contributor

I'm happy to jump in on the WebUI and CI stuff once I'm done with the internal work.

@monorimet monorimet changed the title [TRACKER] SHARK Studio Q3 Release [TRACKER] SHARK Studio Roadmap Oct 10, 2024
@kumardeepakamd
Copy link

Super good diagram, schedule and clearly called out tasks with target dates @monorimet !!

@VeteranXT
Copy link

What works here? I heard Shark is broken?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants