GitHub - SALT-NLP/audiolm-inference-server

Problem

If vllm model cannot be import in subprocess (runtime error) try re-install numpy==1.26.4

Model specific problem

# Qwen audio
current implement version of qwen-audio is not work yet. there are need for custom-vllm version

How to run

MODEL_NAME="WillHeld/DiVA-llama-3-v0-8b" uvicorn api_server:app --port 40021
MODEL_NAME="Qwen/Qwen2-Audio-7B-Instruct" GPU_MEMORY_UTILIZATION=0.5 uvicorn api_server:app --port 40020

Step to setup on runpod

git clone this repo
git submodule init
git submodule update
cd thirdparty/vllm
export MAX_JOBS=18
pip install -e .

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
entity		entity
handler		handler
test		test
thirdparty		thirdparty
.dockerignore		.dockerignore
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
api_server.py		api_server.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Problem

Model specific problem

How to run

Step to setup on runpod

About

Releases

Packages

Languages

SALT-NLP/audiolm-inference-server

Folders and files

Latest commit

History

Repository files navigation

Problem

Model specific problem

How to run

Step to setup on runpod

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages