advanced_rag_team_twain

Branches

main : for data science development

deployment-path : for deployment

Setup

Set up environment and packages

$ python -m venv ragenv

$ source ragenv/bin/activate

$ pip install -r requirements.txt

Download dataset

# Make sure you have git-lfs installed (https://git-lfs.com)

$ git lfs install

$ git clone https://huggingface.co/datasets/neural-bridge/rag-dataset-12000

Set up Ollama

Download Ollama (see https://ollama.com/)

Download Llama3 $ ollama pull llama3

While we recommend using Llama3 via Ollama, if you wish to use HuggingFace models, please set your HuggingFace API token. os.environ['HUGGINGFACEHUB_API_TOKEN'] = 'YOUR TOKEN'
Build system and evaluate

python eval.py This will chunk the data, build a Chroma vector store (if using Chroma), and evaluate the RAG system. In the main function, predefined default options for Chroma Dense Retrieval, ES Sparse Retrieval, and ES Dense Retrieval are set up.

Walk through examples and demos given in presentation in PresentationExamples.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
config		config
evals		evals
script		script
utils		utils
viz		viz
.gitignore		.gitignore
CreateDocuments.py		CreateDocuments.py
PresentationExamples.ipynb		PresentationExamples.ipynb
RAG_utils.py		RAG_utils.py
README.md		README.md
eval.py		eval.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

advanced_rag_team_twain

Branches

Setup

About

Releases

Packages

Contributors 3

Languages

NU-MSAI-Practicum/advanced_rag_team_twain

Folders and files

Latest commit

History

Repository files navigation

advanced_rag_team_twain

Branches

Setup

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages