Adds groq inference adapter. #517

swanhtet1992 · 2024-11-24T09:53:04Z

What does this PR do?

This PR adds a groq inference adapter.

Key features implemented:

Chat completion API with streaming support
Distribution template for easy deployment

What it does not support:

Text completion API
Embeddings API
Certain OpenAI features:
- logprobs and top_logprobs
- response_format options

Test Plan

Run the following test command:

pytest -s -v --providers inference=groq llama_stack/providers/tests/inference/ --env groq_API_KEY=<your-api-key>

To test the distribution template:

# Docker
LLAMA_STACK_PORT=5001
docker run -it -p $LLAMA_STACK_PORT:$LLAMA_STACK_PORT \
  llamastack/distribution-groq \
  --port $LLAMA_STACK_PORT \
  --env groq_API_KEY=$groq_API_KEY

# Conda
llama stack build --template groq --image-type conda
llama stack run ./run.yaml \
  --port $LLAMA_STACK_PORT \
  --env groq_API_KEY=$groq_API_KEY

Sources

groq API Documentation

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Ran pre-commit to handle lint / formatting issues.
Read the contributor guideline,
Pull Request section
Updated relevant documentation.
Wrote necessary unit or integration tests.

swanhtet1992 and others added 2 commits November 24, 2024 03:50

Adds groq inference adapter

d8d0f46

Merge branch 'meta-llama:main' into groq

7d7d1e6

swanhtet1992 requested review from ashwinb, yanxi0830, hardikjshah, dltn and raghotham as code owners November 24, 2024 09:53

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Nov 24, 2024

Merge branch 'main' into groq

bc427b3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds groq inference adapter. #517

Adds groq inference adapter. #517

swanhtet1992 commented Nov 24, 2024

Adds groq inference adapter. #517

Are you sure you want to change the base?

Adds groq inference adapter. #517

Conversation

swanhtet1992 commented Nov 24, 2024

What does this PR do?

Test Plan

Sources

Before submitting