Skip to content
This repository has been archived by the owner on Sep 24, 2024. It is now read-only.

New huggingface eval for the summarization use case with rouge, meteor, and bertscore #100

Merged
merged 15 commits into from
Jun 13, 2024

Conversation

aittalam
Copy link
Member

@aittalam aittalam commented Jun 12, 2024

What's changing

Added a new evaluate huggingface entrypoint which supports evaluation of local and remote models (seq2seq, causal, openai, vllm, llamafile) and loading datasets / saving results on s3.

How to test it

lm-buddy evaluate huggingface --config examples/configs/evaluation/hf_evaluate_config.yaml

Related Jira Ticket

https://mzai.atlassian.net/browse/MZPLATFORM-78

Additional notes for reviewers

I know we discussed messaging to mzai-platform's backend directly from lm-buddy jobs.
I am 100% in favor of it, I just wanted to keep this PR independent from messaging and I will create a new one to be tested together with the updated mzai-platform code.

examples/configs/evaluation/hf_evaluate_config.yaml Outdated Show resolved Hide resolved
src/lm_buddy/buddy.py Show resolved Hide resolved
src/lm_buddy/configs/jobs/hf_evaluate.py Show resolved Hide resolved
src/lm_buddy/jobs/evaluation/hf_evaluate.py Outdated Show resolved Hide resolved
Copy link
Contributor

@binaryaaron binaryaaron left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

mostly responding / adding on to @veekaybee's review; will look more later. thanks!

examples/configs/evaluation/hf_evaluate_config.yaml Outdated Show resolved Hide resolved
src/lm_buddy/jobs/evaluation/metrics.py Outdated Show resolved Hide resolved
src/lm_buddy/jobs/model_clients.py Show resolved Hide resolved
src/lm_buddy/jobs/evaluation/hf_evaluate.py Outdated Show resolved Hide resolved
src/lm_buddy/buddy.py Show resolved Hide resolved
src/lm_buddy/configs/vllm.py Show resolved Hide resolved
Copy link
Member

@veekaybee veekaybee left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for addressing these and for this PR!

@aittalam aittalam merged commit bc94aaa into main Jun 13, 2024
3 of 4 checks passed
@aittalam aittalam deleted the davide/hf-eval branch June 13, 2024 13:52
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants