-
Notifications
You must be signed in to change notification settings - Fork 27
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Performance of StarCoder on HumanEvalFixDocs #21
Comments
A few things are different in the command we ran: We use Overall, yeah the commit format on the pretrained StarCoder works really well. On the regular HumanEvalFix, StarCoder + Commit Format also outperforms OctoCoder, see the below Table from Appendix G. The problem of the commit format is that it does not work well for code synthesis or explanation. |
This is helpful. Thanks! I feel this deserves a mention in Table-2 itself then :) Could you also share the script you use to obtain https://huggingface.co/datasets/bigcode/evaluation/blob/main/starcoder/humanevalfixdocs/commit_format/evaluation_humanevalfixdocspy_starcoder_temp02.json? Thanks! |
Sure it would be:
|
With this script, I observe a pass@1 score of 60.1.
|
CC: @Muennighoff |
You're right, it seems the result in the paper is too low. I reran it & got the below:
I will update the paper soon. Thanks a lot for noting this! |
Thanks @Muennighoff :) |
Thanks @Muennighoff, this is very helpful! :) |
With StarCoder, I am observing a pass@1 score of 58.9 instead of 43.5 as reported in the OctoCoder paper.
Script used:
Results:
CC: @Muennighoff
The text was updated successfully, but these errors were encountered: