Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support Independent Configuration of Embedding and Rerank Models from Vendors like Voyage, Cohere, Jina, and BGE in RAG Applications #1816

Open
kevintsai1202 opened this issue Nov 25, 2024 · 2 comments

Comments

@kevintsai1202
Copy link

In the application of RAG, embedding and rerank models are just as important as LLMs. Currently, the most effective options include those from Voyage, Cohere, Jina, and bge. Could you consider developing related classes and independent configurations for the embedding and rerank models provided by these vendors?

At the moment, we can only define them in a way compatible with OpenAI. However, OpenAI does not provide a rerank model, so we have to connect to the API ourselves using RestClient.

@ThomasVitale
Copy link
Contributor

@kevintsai1202 thanks for reporting this. You might be interested in #1811, where we are tracking upcoming RAG-related features including implementation for reranking and Cohere support.

About embedding models, you can customise the EmbeddingModel object when you configure your instance of VectorStore or VectorStoreDocumentRetriever. Would that help with your use case?

@kevintsai1202
Copy link
Author

@kevintsai1202 thanks for reporting this. You might be interested in #1811, where we are tracking upcoming RAG-related features including implementation for reranking and Cohere support.

About embedding models, you can customise the EmbeddingModel object when you configure your instance of VectorStore or VectorStoreDocumentRetriever. Would that help with your use case?

Thank you. In fact, I noticed that Part 2 supports Cohere, so it should also integrate other embedding model providers. Additionally, the rerank model is applied in the post-retrieval stage. What modules or implementation interfaces are currently available to accomplish this?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants