You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am using the software ootb, which currently runs qwen2vl on an API and showui2b on the local machine's GPU. However, I would like to run bothshowui2b and qwen2vl on a GPU server for improved performance and scalability.
Current Setup:
qwen2vl: Runs on the API and is powered by VLLM, which is OpenAI API compatible.
showui2b: Currently runs on a local machine’s GPU.
Desired Setup:
Both qwen2vl and showui2b should be run on the GPU server via API calls in ootb, instead of relying on the local machine's GPU for showui2b.
Challenges:
qwen2vl runs on VLLM, which is OpenAI API compatible, and could potentially be added to OOTB for the desired setup.
However, showui2b requires the local GPU for rendering, and there's no clear way to use a GPU server with OOTB to handle showui2b via API.
Proposed Solution:
Integrate a GPU server-based API for showui2b within the OOTB system.
Allow OOTB to call both qwen2vl (via VLLM API) and showui2b (via GPU server API) seamlessly.
Ensure that the server is capable of handling the GPU-intensive operations for both services.
Additional Context:
Integration with VLLM for qwen2vl API calls has been confirmed to be possible.
A similar API or service configuration for showui2b is needed to make this setup work.
The text was updated successfully, but these errors were encountered:
Description:
I am using the software
ootb
, which currently runsqwen2vl
on an API andshowui2b
on the local machine's GPU. However, I would like to run bothshowui2b
andqwen2vl
on a GPU server for improved performance and scalability.Current Setup:
Desired Setup:
qwen2vl
andshowui2b
should be run on the GPU server via API calls inootb
, instead of relying on the local machine's GPU forshowui2b
.Challenges:
qwen2vl
runs on VLLM, which is OpenAI API compatible, and could potentially be added to OOTB for the desired setup.showui2b
via API.Proposed Solution:
showui2b
within the OOTB system.qwen2vl
(via VLLM API) andshowui2b
(via GPU server API) seamlessly.Additional Context:
qwen2vl
API calls has been confirmed to be possible.showui2b
is needed to make this setup work.The text was updated successfully, but these errors were encountered: