Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
Signed-off-by: Dan Sun <[email protected]>
  • Loading branch information
yuzisun authored Feb 19, 2024
1 parent ea547ac commit 84fd59e
Showing 1 changed file with 3 additions and 1 deletion.
4 changes: 3 additions & 1 deletion docs/modelserving/v1beta1/llm/vllm/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -9,6 +9,8 @@ We have work in progress integrating `vLLM` with `Open Inference Protocol` and K
The LLaMA model can be downloaded from [huggingface](https://huggingface.co/meta-llama/Llama-2-7b) and upload to your cloud storage.

=== "Yaml"


```yaml
kubectl apply -n kserve-test -f - <<EOF
apiVersion: serving.kserve.io/v1beta1
Expand All @@ -18,7 +20,7 @@ The LLaMA model can be downloaded from [huggingface](https://huggingface.co/meta
spec:
predictor:
containers:
- args:
- args:
- --port
- "8080"
- --model
Expand Down

0 comments on commit 84fd59e

Please sign in to comment.