You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It would be nice to have the possibility to resume the training from a checkpoint. For on-policy algorithm we can just load the model and restart the optimization; for an off-policy method we can choose to restart with the old buffer, if it has been saved, or to restart while collecting a bunch of trajectories with the trained policy and restart from there.
This should be a global utility.
The text was updated successfully, but these errors were encountered:
It would be nice to have the possibility to resume the training from a checkpoint. For on-policy algorithm we can just load the model and restart the optimization; for an off-policy method we can choose to restart with the old buffer, if it has been saved, or to restart while collecting a bunch of trajectories with the trained policy and restart from there.
This should be a global utility.
The text was updated successfully, but these errors were encountered: