Compete Labs

This codelab simulates scenarios where a startup CEO is trying to build a cloud-native intelligent app based on an open-source large language model. In particular, they want to quickly test and compare different cloud providers to find the best price performance.

In this codelab, you will follow a step-by-step guide to experiment with state-of-the-art hardware like Nvidia A100 GPU chips, large language model like Meta Llama 3.1, and software like vLLM. You'll leverage cloud-native technologies like Terraform, Docker, and Linux Bash on major cloud providers such as Azure and AWS.

Prerequisites

Bash (Unix shell) is required to execute commands in this codelab.
Azure Cloud Shell is recommended. (Note: It is also highly recommended to mount a storage account in case of accidental browser closure. Follow instructions here) Alternatively, macOS and Ubuntu are supported.

Steps

Setup Tests

In your lab environment, clone the repository and enter the directory:

git clone https://github.com/Azure-Samples/compete-labs
cd compete-labs

Install dependencies, authenticate, and initialize environments by running the commands below:

source scripts/init.sh

For Azure

export CLOUD=azure
export REGION=eastus2

For AWS

export CLOUD=aws
export REGION=us-west-2

Provision Resources

Provision infrastructure resources like GPU Virtual Machine:

source scripts/resources.sh provision $CLOUD $REGION

Running Tests

Deploying the server

Deploy the LLM-backed inferencing server using Docker:

source scripts/server.sh deploy $CLOUD

Starting the server

Download the Llama 3 8B model from Hugging Face, load it into the GPUs, and start the HTTP server:

source scripts/server.sh start $CLOUD

Testing the server

Send some prompt requests to the HTTP server to test chat completion endpoint:

source scripts/server.sh test $CLOUD

Cleanup Resources

Cleanup infrastructure resources like GPU Virtual Machine:

source scripts/resources.sh cleanup $CLOUD $REGION

Publish Results

Collect and upload test results to Azure Data Explorer. Please always publish results even if you run into issue before reaching this step, this helps us to know which step failed with what error

source scripts/publish.sh $CLOUD

Check out aggregated and visualized test results on the dashboard

Troubleshooting

If you run into issue, please read this troubleshooting doc

Name		Name	Last commit message	Last commit date
Latest commit History 80 Commits
.github		.github
docs		docs
modules		modules
scripts		scripts
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.md		LICENSE.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Compete Labs

Prerequisites

Steps

Setup Tests

For Azure

For AWS

Provision Resources

Running Tests

Deploying the server

Starting the server

Testing the server

Cleanup Resources

Publish Results

Troubleshooting

About

Releases

Packages

Contributors 4

Languages

License

Azure-Samples/compete-labs

Folders and files

Latest commit

History

Repository files navigation

Compete Labs

Prerequisites

Steps

Setup Tests

For Azure

For AWS

Provision Resources

Running Tests

Deploying the server

Starting the server

Testing the server

Cleanup Resources

Publish Results

Troubleshooting

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages