Training Log 2022-06-23 #39
zh-zheng
announced in
Training Logs 训练日志
Replies: 1 comment 2 replies
-
it is not a good explanation. It reached the bottleneck of the model in this setting? Would it be better to use a lower learning rate? Is the model becoming unstable? When this happens, how can we test the hypothesis? @zh-zheng |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
CPM-Live Training Log (June, 23)
Time: June, 23 2022 16:00
Recorder: @zh-zheng
Loss
Completed Data
Average Grad Norm
Progress
Comment
Well, the loss is still increasing... If you don't know what happened, see our log yesterday. It seems that our model is a little confused with the new data. Looking at the loss curve, when do you think the loss will start to decrease🤔?
Beta Was this translation helpful? Give feedback.
All reactions