-
Notifications
You must be signed in to change notification settings - Fork 52
Issues: InternLM/InternEvo
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[QA] 训练到一半Nan grad norm occurs, please check it
question
Further information is requested
#375
opened Nov 24, 2024 by
wang-benqiang
[QA] Does internEvo support loongtrain selective checkpoint++?
question
Further information is requested
#372
opened Nov 20, 2024 by
wplf
[QA] 如何进行单卡微调的,需要调整那些设置
question
Further information is requested
#354
opened Oct 25, 2024 by
OkGuai
[QA] loong train 支持packed_sample_into_one=false吗
question
Further information is requested
#346
opened Sep 27, 2024 by
Lzhang-hub
[Feature] MoE模型里稠密层和专家层zero和并行的解耦
enhancement
New feature or request
#325
opened Sep 11, 2024 by
sunpengsdu
1 task
[Feature] 不使用memory pool
enhancement
New feature or request
#324
opened Sep 11, 2024 by
sunpengsdu
1 task
[QA] check import system var at the start of training
question
Further information is requested
#300
opened Aug 17, 2024 by
sunpengsdu
[Feature] how to finetuning lora
enhancement
New feature or request
#286
opened Jul 26, 2024 by
wen020
1 task
[Bug] RuntimeError: [3] is setting up NCCL communicator and retrieving ncclUniqueId from [0] via c10d key-value store by key '0', but store->get('0') got error: Socket Timeout
bug
Something isn't working
#285
opened Jul 25, 2024 by
kkscilife
[Bug] 仅支持了GShard模式的MoE模型转huggingface
bug
Something isn't working
#279
opened Jul 16, 2024 by
Cerberous
[QA] InternEvo能否load预训练llama2的参数
question
Further information is requested
#268
opened Jul 2, 2024 by
JunZhan2000
[QA] Internevo是否支持tied_embedding?
question
Further information is requested
#267
opened Jul 1, 2024 by
Cerberous
[Bug] 使用internevo训练,转换成hf模型用opencompass测试时候有一定概率会nan
bug
Something isn't working
#266
opened Jul 1, 2024 by
Cerberous
[Bug] 好像没有把internevo的MoE权重转换成huggingface版本的脚本?
bug
Something isn't working
#262
opened Jun 27, 2024 by
Cerberous
[QA] Internevo这个框架里面MoE支持expert parallel嘛?
question
Further information is requested
#253
opened Jun 18, 2024 by
Cerberous
[QA] 用Internevo已经训练出来了一个7B模型,如何用这个internevo权重跑MoE?
question
Further information is requested
#251
opened Jun 17, 2024 by
Cerberous
[Bug] AssertionError: Only flash cross entropy support parallel_output
bug
Something isn't working
#245
opened Jun 6, 2024 by
wen020
Add tool for data cleaning
question
Further information is requested
#241
opened May 31, 2024 by
www516717402
[QA] 关于使用张量并行或流水线并行的模型切分与合并问题
question
Further information is requested
#176
opened Apr 2, 2024 by
BaiBlanc
[Feature] Should we remove other dependency of flashattention?
enhancement
New feature or request
#164
opened Apr 1, 2024 by
sunpengsdu
1 task
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.