RAG Monitoring

What is RAG Monitoring?

AMP for collecting and visualizing monitoring data (in this case RAG)

Overview of the AMP

The RAG Monitoring feature helps track the quality and health of Retrieval-Augmented Generation (RAG) applications using a curated set of pre-built metrics and evaluators. It leverages Large Language Models (LLMs) as judge to compute various quality scores, which are then aggregated and visualized over time in a dashboard. The dashboard provides both summary metrics and trends over time.

Our list of pre-built metrics is regularly updated. Currently, the following metrics are available:

Input/Output Word Count Statistics
User Feedback
Context and Final Response Quality
Auto-Evaluation Metrics (e.g., relevance, comprehensiveness, toxicity, faithfulness, Maliciousness)
Raw Input, Output, and Context Data

Pre-requisites

This AMP requires AWS for access to both LLM and embedding models. Please complete the following steps before using the AMP:

The following models need to be configured and accessible via AWS Bedrock.
- Llama3 70b Instruct V1 (meta.llama3-70b-instruct-v1:0)
For Embedding, you will need to enable the following model in AWS Bedrock:
- Cohere English Embedding v3 (meta.cohere-english-embedding-v3:0)

Common Issues

Both the Llama and Cohere models must be enabled in AWS Bedrock for the AMP to function properly. If the models are not enabled, the AMP will not function as expected.

Installation

Follow the standard instructions for installing this AMP into your CML workspace. The "File Name" to use is catalog-entry.yaml.

Find the catalog entry in the AMP catalog.
Start the AMP by Configure & Deploy

For configuring the project, you will need to provide some environment variables.

AWS_DEFAULT_REGION - defaults to us-west-2
AWS_ACCESS_KEY_ID
AWS_SECRET_ACCESS_KEY

Getting Started with the RAG Monitoring Application

Set Up Your Data Source

Create a collection and upload your knowledge base documents. Documents will be vectorized and stored as embeddings. You can set up multiple collections, each with its own chat interface and monitoring dashboard.

Start Chatting

Use the Chat Application to interact with your documents.

Provide Feedback

Rate responses instantly using thumbs-up/down feedback. Text feedback will be available soon.

Track Metrics

Visit the monitoring dashboard to view real-time quality metrics for your RAG application. Metrics are summarized and tracked over time.

Dive Deeper

Access raw input, output, and context data for detailed analysis.

Local Development

You can run the AMP locally using Docker.

Set the AWS_DEFAULT_REGION, AWS_ACCESS_KEY_ID and AWS_SECRET_ACCESS_KEY environment variables as above.
Run ./local.sh

This will build and run the AMP. The script will build the AMP image once. If you would like to force a rebuild, you can pass the BUILD environment variable set to true.

BUILD=true ./local.sh

This is useful if you are making changes to the AMP and want to see the changes reflected when running.

The Fine Print

IMPORTANT: Please read the following before proceeding. This AMP includes or otherwise depends on certain third party software packages. Information about such third party software packages are made available in the notice file associated with this AMP. By configuring and launching this AMP, you will cause such third party software packages to be downloaded and installed into your environment, in some instances, from third parties' websites. For each third party software package, please see the notice file and the applicable websites for more information, including the applicable license terms.

If you do not wish to download and install the third party software packages, do not configure, launch or otherwise use this AMP. By configuring, launching or otherwise using the AMP, you acknowledge the foregoing statement and agree that Cloudera is not responsible or liable in any way for the third party software packages.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
api		api
assets		assets
sample_data		sample_data
scripts		scripts
service		service
st_app		st_app
.gitignore		.gitignore
.project-metadata.yaml		.project-metadata.yaml
LICENSE		LICENSE
NOTICE.txt		NOTICE.txt
README.md		README.md
catalog-entry.yaml		catalog-entry.yaml
export_requirements.sh		export_requirements.sh
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG Monitoring

What is RAG Monitoring?

Overview of the AMP

Pre-requisites

Common Issues

Installation

Getting Started with the RAG Monitoring Application

Local Development

The Fine Print

About

Releases

Packages

Languages

License

cloudera/CML_AMP_RAG_Monitoring

Folders and files

Latest commit

History

Repository files navigation

RAG Monitoring

What is RAG Monitoring?

Overview of the AMP

Pre-requisites

Common Issues

Installation

Getting Started with the RAG Monitoring Application

Local Development

The Fine Print

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages