-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
rui/ds1000 #44
base: main
Are you sure you want to change the base?
rui/ds1000 #44
Conversation
@rays1024 Any updates on this? btw, the title for your PR is kind of confusing. If you are working on two different things (length calculator and ds-1000 dataset), you should have two separate branches and two separate PRs accordingly. |
Sorry I have been pretty busy with projects and assignments, but I will work on the issue and try to have an update by Friday. I'll also make a new PR for length_calculator as well. Thanks for the reminder! |
I see, thanks for the update. Let me know if anything changes. |
…t llama-based model uses empty string as tokenizer_eos_token
update branch to latest main branch
Updating this pull request's name to "DS1000 dataset" to avoid confusion. Updates on length_calculator will be made on another branch/PR |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Haven't tested the actual functionality but the executors and datasets look good in general.
Made some comments, can you fix the raised issue?
Thanks!
DS_1000/ds1000.jsonl
Outdated
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
please put this file in the data
directory
DS_1000/README.md
Outdated
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We don't want DS-1000's license and readme files in our repo
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'll delete the license, but the readme file is the instructions for evaluation and creating jsonl file.
DS_1000/ds1000.py
Outdated
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this part of the testing harness for DS-1000?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, this script with the zip file are needed for evaluation. The DS-1000 evaluation requires having the original data so that the problems could be loaded into a DS1000Problem class. This class is then used for evaluation.
DS_1000/ds1000_data.zip
Outdated
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
zip files should not be checked in (i.e., committed to online repo) unless there is a strong reason in doing so
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I can work on removing the need for the original problem data, but these data in the zip file are indeed needed for evaluation.
DS_1000/preprocess.py
Outdated
for key in keys: | ||
new_dict[key] = dictionary[key] | ||
|
||
with open("ds1000.jsonl", 'a') as file: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why is this appending lines (i.e., using open mode a
)?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Each iteration of the for loop will append one dictionary as a new line to the jsonl file, which is why I used 'a' here. If other ways are commonly used, I can definitely change that!
length_calculator.py
Outdated
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We should probably put this file in analysis
or utils
or preprocessing
…to rui/ds-1000 Merging with Yilun's branch to run evaluation on DS1000
Using jsonargparse==4.15.0 would resolve the previous problems. Still working on fixing the directory problem when evaluating 208 or more problems. |
Error with torch tensor occurred when evaluating incoder-1b, same as this discussion. I added incoder-1b to the if statement at line 159 of seq2seq_model.py to fix this issue |
fixed result saving issue by adding a safe executing wrapper in the DS1000Executor class. The wrapper uses execute in safe_execution_util as a template. |
No description provided.