You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It would be convenient if OptimizationState included a searchdirection and/or a step width. I can usually pull these quantities out of the original state, but it would be nice to have them in a well-defined location.
For simple gradient descent, the search direction would be the negative gradient, but e.g. for LBFGS it would be the direction obtained from the internal quasi-Hessian. Knowing the search direction allows to really drill down into the linesearch, see, e.g., the plots in the README of GRAPELinesearchAnalysis.jl. In principle, I can infer the search direction as the difference in u from the previous iteration, but then I don't know the step width (how far the optimizer went in the direction of the search direction). Knowing both quantities directly from the solver is useful for actually checking that the update is in the search direction, but if that's guaranteed, only one of them would be sufficient.
The text was updated successfully, but these errors were encountered:
I can usually pull these quantities out of the original state
Well, I thought I could, but it turns out the state is woefully inadequate, at least for some optimizers. I'm dumping state in the callback in an optimization with NLopt.LD_LBFGS() and seeing this:
It would be convenient if OptimizationState included a
searchdirection
and/or a step width. I can usually pull these quantities out of theoriginal
state, but it would be nice to have them in a well-defined location.For simple gradient descent, the search direction would be the negative gradient, but e.g. for LBFGS it would be the direction obtained from the internal quasi-Hessian. Knowing the search direction allows to really drill down into the linesearch, see, e.g., the plots in the README of
GRAPELinesearchAnalysis.jl
. In principle, I can infer the search direction as the difference inu
from the previous iteration, but then I don't know the step width (how far the optimizer went in the direction of the search direction). Knowing both quantities directly from the solver is useful for actually checking that the update is in the search direction, but if that's guaranteed, only one of them would be sufficient.The text was updated successfully, but these errors were encountered: