Support Running Both `qwen2vl` (VLLM) and `showui2b` on GPU Server via API in OOTB #55

DongyoungKim2 · 2024-12-12T07:42:46Z

Description:

I am using the software ootb, which currently runs qwen2vl on an API and showui2b on the local machine's GPU. However, I would like to run both showui2b and qwen2vl on a GPU server for improved performance and scalability.

Current Setup:

qwen2vl: Runs on the API and is powered by VLLM, which is OpenAI API compatible.
showui2b: Currently runs on a local machine’s GPU.

Desired Setup:

Both qwen2vl and showui2b should be run on the GPU server via API calls in ootb, instead of relying on the local machine's GPU for showui2b.

Challenges:

qwen2vl runs on VLLM, which is OpenAI API compatible, and could potentially be added to OOTB for the desired setup.
However, showui2b requires the local GPU for rendering, and there's no clear way to use a GPU server with OOTB to handle showui2b via API.

Proposed Solution:

Integrate a GPU server-based API for showui2b within the OOTB system.
Allow OOTB to call both qwen2vl (via VLLM API) and showui2b (via GPU server API) seamlessly.
Ensure that the server is capable of handling the GPU-intensive operations for both services.

Additional Context:

Integration with VLLM for qwen2vl API calls has been confirmed to be possible.
A similar API or service configuration for showui2b is needed to make this setup work.

The text was updated successfully, but these errors were encountered:

h-siyuan · 2024-12-12T12:00:59Z

Thank you for your detailed feedback! We’ll definitely take it into consideration. This feature is already on our future update list, stay tuned:)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Running Both `qwen2vl` (VLLM) and `showui2b` on GPU Server via API in OOTB #55

Support Running Both `qwen2vl` (VLLM) and `showui2b` on GPU Server via API in OOTB #55

DongyoungKim2 commented Dec 12, 2024

h-siyuan commented Dec 12, 2024

Support Running Both qwen2vl (VLLM) and showui2b on GPU Server via API in OOTB #55

Support Running Both qwen2vl (VLLM) and showui2b on GPU Server via API in OOTB #55

Comments

DongyoungKim2 commented Dec 12, 2024

Current Setup:

Desired Setup:

Challenges:

Proposed Solution:

Additional Context:

h-siyuan commented Dec 12, 2024

Support Running Both `qwen2vl` (VLLM) and `showui2b` on GPU Server via API in OOTB #55

Support Running Both `qwen2vl` (VLLM) and `showui2b` on GPU Server via API in OOTB #55