-
Notifications
You must be signed in to change notification settings - Fork 395
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
minicpm-v 全参数finetune报错 #677
Comments
猜测是visual部分编码存在问题导致缺了一个维度,由于对swift框架的训练逻辑缺乏了解,我基于huggingface trainer+deepspeed重新实现了minicpm-v的全参数微调,并在我的环境中成功运行,参考MR: |
微调cogvlm遇到类似错误,正好发生在一个比较小的validation loop,所以把问题数据找到了。附件可以复现这个错误。 设置: 错误: Traceback (most recent call last): |
可以复现 我修复一下哈 |
所以现在可以正常全参数微调了吗? |
Describe the bug
训练几步后报错,相同的数据集用来训练qwen-vl-chat正常,启动训练命令如下
Your hardware and system info
Additional context
The text was updated successfully, but these errors were encountered: