StableLM Alpha Testing Issue #1

MarkSchmidty · 2023-04-20T06:52:02Z

@lhl could you shed some light on your testing regimen in this StableLM issue? Stability-AI/StableLM#30

The results seem unbelievably poor. It's shocking that a 7B GPT-NeoX trained on 800B tokens would score below GPT-2 700M and well below GPT-J, even if it is an early training checkpoint.

MarkSchmidty · 2023-04-20T07:30:26Z

Thanks!

lhl · 2023-04-20T07:30:36Z

You can see https://github.com/AUGMXNT/llm-experiments/blob/main/01-lm-eval.md to get an idea of how it's done. Feel free to test yourself. You'll want to use hf-causal as the model.

MarkSchmidty closed this as completed Apr 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

StableLM Alpha Testing Issue #1

StableLM Alpha Testing Issue #1

MarkSchmidty commented Apr 20, 2023 •

edited

Loading

MarkSchmidty commented Apr 20, 2023

lhl commented Apr 20, 2023

StableLM Alpha Testing Issue #1

StableLM Alpha Testing Issue #1

Comments

MarkSchmidty commented Apr 20, 2023 • edited Loading

MarkSchmidty commented Apr 20, 2023

lhl commented Apr 20, 2023

MarkSchmidty commented Apr 20, 2023 •

edited

Loading