We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Request in title. Love your work, keep it up!! :)
The text was updated successfully, but these errors were encountered:
Hi @Belzedar94, thanks for your interest! Yes, we do plan to have a live leaderboard for ScienceAgentBench. Please stay tuned!
Sorry, something went wrong.
Merge pull request #2 from btyu/docker
12eeaf0
Merge code for dockerized evaluation
Looking forward to it!! Also to see how the new O1 model performs in it :)
No branches or pull requests
Request in title. Love your work, keep it up!! :)
The text was updated successfully, but these errors were encountered: