Release v0.5.5
Release Notes
-
Added Dataset Support:
-
Support for
LongBench-write
quality evaluation of long text generation #136 -
Automatic downloading of
punkt_tab.zip
fromnltk
#140 -
Support for RAG evaluation #127:
- Support for embeddings/reranker evaluation: Integration of
MTEB
(Massive Text Embedding Benchmark) andCMTEB
(Chinese Massive Text Embedding Benchmark), supporting tasks such as retrieval and reranking - Support for end-to-end RAG evaluation: Integration of the
ragas
framework, supporting automatic generation of evaluation datasets and evaluation based on judge models
- Support for embeddings/reranker evaluation: Integration of
-
Documentation Updates:
-
Updated dependencies:
nltk>=3.9
androuge-score>=0.1.0
#145, #143