Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

StableLM Alpha Testing Issue #1

Closed
MarkSchmidty opened this issue Apr 20, 2023 · 2 comments
Closed

StableLM Alpha Testing Issue #1

MarkSchmidty opened this issue Apr 20, 2023 · 2 comments

Comments

@MarkSchmidty
Copy link

MarkSchmidty commented Apr 20, 2023

@lhl could you shed some light on your testing regimen in this StableLM issue? Stability-AI/StableLM#30

The results seem unbelievably poor. It's shocking that a 7B GPT-NeoX trained on 800B tokens would score below GPT-2 700M and well below GPT-J, even if it is an early training checkpoint.

@MarkSchmidty
Copy link
Author

Thanks!

@lhl
Copy link
Contributor

lhl commented Apr 20, 2023

You can see https://github.com/AUGMXNT/llm-experiments/blob/main/01-lm-eval.md to get an idea of how it's done. Feel free to test yourself. You'll want to use hf-causal as the model.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants