MedBot - AI-Powered Chat API with Memory and Document Retrieval

Description

MedBot is an experimental microservices-based chat API project. It explores integrating persistent memory, document retrieval capabilities, and Generative AI within a containerized architecture.

Features

Conversational Memory: The API maintains context across interactions, enabling it to deliver better responses.
Document Integration: It uses Retrieval-Augmented Generation (RAG) techniques, the API ensures that responses are better contextualized, by providing relevant information from documents as context to the LLM using prompt engineering.
Microservices Architecture: Built with modular services for flexibility and potential scalability.

Technologies used:

Docker
Kubernetes
FastAPI
Langchain
Postgres
Redis
Google Gemini APIs
NextJs

Architecture

The architecture comprises five pods, each hosting specific components:

Redis: Utilized for storing the chat history.
Postgres: Serves as a vector database and database for authentication.
Authentication Service: Manages Authentication for the microservices.
Vector Database Management Service: Manages the addition and removal of data from the vector database.
Query Preprocessing Service: Acts as the chat service's entry point. This service performs multiple functions:
- Fetches the chat history for the ongoing session.
- Summarizes the chat history using LLM.
- Uses the latest query and the chat summary to generate a single consolidated query using LLM.
- The newly generated query is then passed to question answer service.
- Once the question answer service returns a response, the new message exchange between AI and the user is saved to redis.
- Finally the response is returned back to the client.
Question and Answer Service: When received a query from query preprocessing service:
- Similarity search is performed on the query.
- The relevant content is fetched from the vector database.
- The document is then passed to LLM as a context with the original query, The generated response from the LLM is passed back to the query proprocessing service.

Folder Structure

The project is structured as follows:

.
├── README.md
├── auth
│   ├── Dockerfile
│   ├── requirements.txt
│   └── src
│       ├── app.py
│       ├── database
│       ├── routes
│       └── security.py
├── frontend
│   ├── Dockerfile
│   ├── README.md
│   ├── api
│   │   └── build-axios-client.ts
│   ├── app
│   │   ├── auth
│   │   ├── favicon.ico
│   │   ├── globals.css
│   │   ├── layout.tsx
│   │   ├── loading.tsx
│   │   └── page.tsx
│   ├── components
│   │   ├── login-form.tsx
│   │   ├── mode-toggle.tsx
│   │   ├── navbar.tsx
│   │   ├── register-form.tsx
│   │   ├── theme-provider.tsx
│   │   └── ui
│   ├── components.json
│   ├── lib
│   │   └── utils.ts
│   ├── next-env.d.ts
│   ├── next.config.mjs
│   ├── node_modules
│   ├── package-lock.json
│   ├── package.json
│   ├── postcss.config.js
│   ├── public
│   ├── tailwind.config.ts
│   └── tsconfig.json
├── infra
│   └── k8s
│       ├── auth-db-manager.yaml
│       ├── auth-manager.yaml
│       ├── frontend-manager.yaml
│       ├── ingress-service.yaml
│       ├── init-sql.yaml
│       ├── postgres-manager.yaml
│       ├── query-preprocessing-manager.yaml
│       ├── question_anwer_manager.yaml
│       ├── rag_uploader_manager.yaml
│       ├── redis-manager.yaml
│       └── secrets.yaml
├── query_preprocessing
│   ├── Dockerfile
│   ├── requirements.txt
│   └── src
│       ├── __init__.py
│       ├── app.py
│       ├── chat_summary_manager.py
│       └── redis_manager.py
├── question_answer
│   ├── Dockerfile
│   ├── requirements.txt
│   ├── src
│   │   ├── __init__.py
│   │   ├── app.py
│   │   └── query_manager.py
│   └── wait-for-postgres.sh
├── rag_uploader
│   ├── Dockerfile
│   ├── requirements.txt
│   └── src
│       ├── __init__.py
│       ├── app.py
│       └── vector_store_manager.py
├── skaffold
└── skaffold.yaml

Getting Started

To set up and run this project locally, follow these steps:

Prerequisites:
- Ensure Docker and Kubernetes are installed on your system.
Set Up Skaffold:
- Install and configure Skaffold according to its documentation.

Configuration:

Create a secrets.yaml file in the infra/k8s directory with your Google API key:

apiVersion: v1
 kind: Secret
 metadata:
    name: medbot-secret
 type: Opaque
 stringData:
    GOOGLE_API_KEY: <your-api-key>
    CLAUDE_API_KEY: <your-api-key>
    GROQ_API_KEY: <your-api-key>
    TAVILY_AI_API_KEY: <your-api-key>
    SECRET_KEY: <secret-to-sign-jwt>
    ALGORITHM: <algo-to-use-for-jwt>
    ACCESS_TOKEN_EXPIRE_MINUTES: <access-token-expiry-time>

Replace my docker hub id with your Docker Hub ID in skaffold.yaml, query-preprocessing-manager.yaml, question_answer_manager.yaml, and rag_uploader_manager.yaml. For example:
```
prathameshkurve/query-preprocessing => <your-name>/query-processing
```

Linux

Install minikube:
- Go to following link to download minikube
```
https://minikube.sigs.k8s.io/docs/start/
```
Install skaffold:
- To install skaffold click on following link
```
https://skaffold.dev/docs/install/
```

Run the following commands:

Start minikube

minikube start
minikube addons enable ingress
minikube addons enable ingress-dns

Host File Entry:
- Run following commands
  1. To fetch ip address run the following command
```
minikube ip
```
  1. Copy your ip address
```
<your-ip-address>
```
  1. Run following command
```
sudo nano /etc/hosts
```
  1. Add this to your host file
```
<your-ip-address> medbot.xyz
```
  1. Save file and exit and go to your project directory
```
ctrl/cmd + x && y
```
Start the Application:
- Navigate to the root directory of the project and run:
```
skaffold dev
```
Access the Services:
- The frontend can be accessed from medbot.xyz.
- The chat API can be accessed at medbot.xyz/api/chat/docs.
- Document upload service is available at medbot.xyz/api/rag/docs.

Windows

Host File Entry:
- Add the following entry to your host file to route local requests:
```
127.0.0.1 medbot.xyz
```
Start the Application:
- Kubernetes Ingress Controller is required for networking. It can be installed and setup with the following command:
```
kubectl apply -f https://raw.githubusercontent.com/kubernetes/ingress-nginx/main/deploy/static/provider/cloud/deploy.yaml
```
- Navigate to the root directory of the project and run:
```
skaffold dev
```
Access the Services:
- The frontend can be accessed from medbot.xyz.
- The chat API can be accessed at medbot.xyz/api/chat/docs.
- Document upload service is available at medbot.xyz/api/rag/docs.

Note

This is an ongoing development project. Its purpose is to explore AI-powered chat API capabilities.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MedBot - AI-Powered Chat API with Memory and Document Retrieval

Description

Features

Technologies used:

Architecture

Folder Structure

Getting Started

Note

About

Releases

Packages

Contributors 5

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 197 Commits
auth		auth
frontend		frontend
infra/k8s		infra/k8s
query_preprocessing		query_preprocessing
question_answer		question_answer
rag_uploader		rag_uploader
.envrc		.envrc
.gitignore		.gitignore
README.md		README.md
flake.lock		flake.lock
flake.nix		flake.nix
skaffold.yaml		skaffold.yaml

consumer-ai-lab/medbot

Folders and files

Latest commit

History

Repository files navigation

MedBot - AI-Powered Chat API with Memory and Document Retrieval

Description

Features

Technologies used:

Architecture

Folder Structure

Getting Started

Note

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages