-
Notifications
You must be signed in to change notification settings - Fork 27
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Megatron-LM fine-tuning: No such file or directory model_optim_rng.pt #14
Comments
Hmm do you have
|
i run now ,but i did not how to merge many chekpoint and convert to hugging face format, can you help me? |
This is the script for merging & converting: octopack/training/convert_large.sh Line 4 in 4f0e261
Let me know if it does not work for you |
yes, but i should git clone which branch , is this bigcode-project/Megatron-LM#40? , when i use mft branch , it did not run. |
Yeah that one; It is already merged into main so you can probably also use the main branch |
thx |
i have a question, i find that humaneval python@1 value reduced a lot after fintune. |
Yeah that's why we only fine-tune for few steps, e.g. OctoCoder is only fine-tuned for 2M tokens. |
|
What script are you using to evaluate it? That may explain the small difference. |
First,I convert this checkpoint to hf format, then using greedy decoding to evaluate. |
Yeah that looks correct. I think for pass@1 HumanEval StarCoder is evaluated using temperature=0.2. Also I would set n_samples=20. |
You have to shard it into multiple files when saving it |
How to shard it into multiple files? Is there any code to refer to? |
i want use Megatron-LM fine-tuning, but i run the process, get. a error : No such file or directory model_optim_rng.pt model_optim_rng.pt
The text was updated successfully, but these errors were encountered: