Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

switch model to 1.0-pro #15

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

adonisote
Copy link

Fixes #

Step 8. https://firebase.google.com/codelabs/ai-genkit-rag#7

It fails with the following error:
⨯ Error: Vertex response generation failed: ClientError: [VertexAI.ClientError]: got status: 429 Too Many Requests. {"error":{"code":429,"message":"Quota exceeded for aiplatform.googleapis.com/generate_content_requests_per_minute_per_project_per_base_model with base model: gemini-1.5-flash. Please submit a quota increase request. https://cloud.google.com/vertex-ai/docs/generative-ai/quotas-genai.","status":"RESOURCE_EXHAUSTED"}}
at Generator.throw ()
digest: "764727645"
POST /gemini 500 in 2669ms

When moving to gemini-1.0-pro-previw it passes. Flash creates the same issue on my side. I guess because of: https://cloud.google.com/vertex-ai/generative-ai/docs/quotas

@alexastrum alexastrum requested a review from nohe427 July 24, 2024 17:46
@alexastrum
Copy link
Collaborator

Hi @adonisote,
given that you're hitting the quota limit for 1.5 Pro, I'd suggest trying the Gemini 1.5 Flash.

1.0 Pro could also be a good choice, but given that it is an older model, I would consider the 1.5 family first.

@nohe427,
I wonder if it's worthwhile to look into switching to the 1.5 Flash model in this codelab.

@nohe427
Copy link
Collaborator

nohe427 commented Jul 24, 2024

@alexastrum & @adonisote --> I would like the 1.5 Flash model instead. Could you please replace the model line with: vertexai/gemini-1.5-flash-preview since this model has a higher quota and is cheaper to run?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants