Enable separate Embeddings Client #258

bricemacias · 2023-11-02T01:03:54Z

Hi @danielchalef, I'm proposing this PR since I'm currently using this feature on my own project and it may be useful to users using open source models, or users using Anthropic llm and that would like to use OpenAI embeddings.

The goal is to give the option to create a separate embeddings client. For example in my case, I use the Llama 2 70B Chat through a compatible OpenAI endpoint, but those endpoints just have the llm part enabled, not the embeddings part (in my case I use Anyscale Endpoints, but it would be the same problem for users hosting their own open source model using compatible OpenAI endpoints with vLLM for example).

With this PR, if the embeddings client is disabled (false by default), it will use the llm endpoint for embeddings as it is currently the case.
If you enable embeddings client, you can configure this endpoint that will be specifically used for embeddings (when not using local embeddings), only with the OpenAI service option for the moment. This allows me for example to use the open source model for intents and summaries, and the OpenAI api for embeddings.

The tests are passing locally (except for the Anthropic ones since I don't have a key) and I added some new tests for this use case also. It's also working on Render with embeddings client disabled or enabled

Tell me if you think it would be interesting to merge this feature on the main repo (this is the second time I write with go so please also tell me if I need to rework/rewrite some parts).
I can also make a separate PR to be able to customise the intent prompt when using open source models if you think it would be interesting

danielchalef · 2023-11-05T14:36:32Z

Sorry for the delay. This is great! Please allow me a few days to review it.

danielchalef · 2024-01-31T00:42:43Z

We're refactoring how inference works to allow separate endpoints for the various inference actions and so this PR has been superseded.

Brice Macias added 5 commits November 1, 2023 18:04

creating separate embeddings client option logic

61c70f6

refactoring

9e49d9c

refactoring test cases

8737e6d

fixes

c9b930d

readability

e59a415

bricemacias had a problem deploying to build-test November 2, 2023 01:04 — with GitHub Actions Failure

clean

a117ce0

bricemacias had a problem deploying to build-test November 2, 2023 02:01 — with GitHub Actions Failure

fix (lint)

a2851b2

bricemacias had a problem deploying to build-test November 2, 2023 02:20 — with GitHub Actions Failure

Merge branch 'main' into enable-separate-embeddings-client

bcd1005

danielchalef had a problem deploying to build-test November 5, 2023 14:35 — with GitHub Actions Failure

fix openai org id case

e9caac6

bricemacias had a problem deploying to build-test November 10, 2023 00:10 — with GitHub Actions Failure

danielchalef closed this Jan 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable separate Embeddings Client #258

Enable separate Embeddings Client #258

bricemacias commented Nov 2, 2023

danielchalef commented Nov 5, 2023

danielchalef commented Jan 31, 2024

Enable separate Embeddings Client #258

Enable separate Embeddings Client #258

Conversation

bricemacias commented Nov 2, 2023

danielchalef commented Nov 5, 2023

danielchalef commented Jan 31, 2024