-
Notifications
You must be signed in to change notification settings - Fork 386
Issues: modelscope/ms-swift
Fine-tuning best practices for qwen2.5-72b-instruct and qwen2...
#2064
opened Sep 18, 2024 by
Jintao-Huang
Open
19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
华为910B上推理Qwen2.5报错RuntimeError: call aclnnIndex failed, detail:EZ9999: Inner Error!
#2517
opened Nov 27, 2024 by
guihonghao
Error occurs in lazy tokenize: unsupported operand type(s) for /: 'list' and 'int'
#2512
opened Nov 26, 2024 by
gaussiangit
Does it support fine-tuning the PaliGemma model for object detection and segmentation?
#2508
opened Nov 26, 2024 by
WangRongsheng
lora 微调的模型使用--resume_from_checkpoint参数,继续训练报显存不足;不使用--resume_from_checkpoint参数可以正常训练
#2505
opened Nov 26, 2024 by
xyz515
swift 部署的模型,调用的时候支持。client.beta.chat.completions.parse 这个方法吗
#2504
opened Nov 26, 2024 by
caiji2019-cai
[WARNING:swift] Current length of row(2210) is larger than the max_length(2048), deleted.
#2501
opened Nov 25, 2024 by
ep0p
A Problem with parameters --sequence_parallel_size with torch==2.5.1
#2493
opened Nov 23, 2024 by
Gary-code
readme里面多节点训练的例子好像有问题,master节点的master_addr设置成127.0.0.1会报错,设置成IP就没问题
#2485
opened Nov 21, 2024 by
tppppppppp
Previous Next
ProTip!
What’s not been updated in a month: updated:<2024-10-27.