diff --git a/docs/modelserving/v1beta1/llm/vllm/README.md b/docs/modelserving/v1beta1/llm/vllm/README.md index 1afc233b9..f8dcfcfbe 100644 --- a/docs/modelserving/v1beta1/llm/vllm/README.md +++ b/docs/modelserving/v1beta1/llm/vllm/README.md @@ -9,6 +9,8 @@ We have work in progress integrating `vLLM` with `Open Inference Protocol` and K The LLaMA model can be downloaded from [huggingface](https://huggingface.co/meta-llama/Llama-2-7b) and upload to your cloud storage. === "Yaml" + + ```yaml kubectl apply -n kserve-test -f - <