Name		Name	Last commit message	Last commit date
parent directory ..
Dockerfile		Dockerfile
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

README.md

Cloud Run application utilizing Gradio Framework that demonstrates working with Vertex AI API


Author(s)	Romin Irani

This application demonstrates a Cloud Run application that uses the Gradio framework.

Build and Deploy the application to Cloud Run

NOTE: Before you move forward, ensure that you have followed the instructions in SETUP.md. Additionally, ensure that you have cloned this repository and are currently in the chat-gradio folder. This should be your active working directory for the rest of the commands.

To deploy the Gradio App in Cloud Run, we need to perform the following steps:

Your Cloud Function requires access to two environment variables:
- GCP_PROJECT : This the Google Cloud Project Id.
- GCP_REGION : This is the region in which you are deploying your Cloud Function. For e.g. us-central1.
These variables are needed since the Vertex AI initialization needs the Google Cloud Project Id and the region. The specific code line from the main.py function is shown here: vertexai.init(project=PROJECT_ID, location=LOCATION)

In Cloud Shell, execute the following commands:
```
export GCP_PROJECT='<Your Google Cloud Project Id>'  # Change this
export GCP_REGION='us-central1'             # If you change this, make sure region is supported by Model Garden. When in doubt, keep this.
```

We are now going to build the Docker image for the application and push it to Artifact Registry. To do this, we will need one environment variable set that will point to the Artifact Registry name. We have a command that will create this repository for you.

In Cloud Shell, execute the following commands:

export AR_REPO='<REPLACE_WITH_YOUR_AR_REPO_NAME>'  # Change this
export SERVICE_NAME='chat-gradio-app' # This is the name of our Application and Cloud Run service. Change it if you'd like.
gcloud artifacts repositories create "$AR_REPO" --location="$GCP_REGION" --repository-format=Docker
gcloud auth configure-docker "$GCP_REGION-docker.pkg.dev"
gcloud builds submit --tag "$GCP_REGION-docker.pkg.dev/$GCP_PROJECT/$AR_REPO/$SERVICE_NAME"

The final step is to deploy the service in Cloud Run with the image that we built and pushed to the Artifact Registry in the previous step:

In Cloud Shell, execute the following command:

gcloud run deploy "$SERVICE_NAME" \
  --port=8080 \
  --image="$GCP_REGION-docker.pkg.dev/$GCP_PROJECT/$AR_REPO/$SERVICE_NAME" \
  --allow-unauthenticated \
  --region=$GCP_REGION \
  --platform=managed  \
  --project=$GCP_PROJECT \
  --set-env-vars=GCP_PROJECT=$GCP_PROJECT,GCP_REGION=$GCP_REGION

On successfully deployment, you will be provided a URL to the Cloud Run service. You can visit that in the browser to view the application that you just deployed. Select from one of the predefined queries and the application will query the Vertex AI Text model and provide you with a response.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chat-gradio

chat-gradio

README.md

Cloud Run application utilizing Gradio Framework that demonstrates working with Vertex AI API

Build and Deploy the application to Cloud Run

Files

chat-gradio

Directory actions

More options

Directory actions

More options

Latest commit

History

chat-gradio

Folders and files

parent directory

README.md

Cloud Run application utilizing Gradio Framework that demonstrates working with Vertex AI API

Build and Deploy the application to Cloud Run