Release BigCodeBench v0.1.7
Fix some identified issues:
- The ground truth pass rate was not previously computed in the correct way.
- Passed RAM limits would raise errors, as they were set as float type.
- User permission is not correctly set up in the Evaluate Docker.
Features:
-- check-gt-only
will print out the pass rate when finishing.