-
Notifications
You must be signed in to change notification settings - Fork 52
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[QA] InternEvo能否load预训练llama2的参数 #268
Comments
|
@zigzagcai 帮忙看看 |
支持的
|
我可以复现这个报错,已经在这个PR修复:#276 这个bug的原因在于在早期版本的InternEvo LLaMA实现中,ffn w2和w3的层与meta发布的LLaMA反了。在后来与meta LLaMA对齐之后,load func没有同步更新导致。 |
描述问题
InternEvo能否load预训练llama2的参数,再继续预训练,用hf的格式还是原始的格式
The text was updated successfully, but these errors were encountered: