Add test cases, disable flatness check in HZ #180

timholy · 2024-10-22T13:30:28Z

This may cause problems of its own, but for the time being it's better
than the status quo.

Fixes #173
Closes #174
Fixes #175

I am pretty sure that without the flatness check, I've seen HZ iterate until the linesearchmax limit is hit, which is rather inefficient. This comes nowhere close to hitting that limit, though. So until we have a better test case we should probably disable the flatness check.

Note: this PR should probably not be squash-merged, the three commits are crafted to be independent.

The flatness check in HagerZhang was intended to prevent excessive iteration in cases where the objective function is flat to within numerical precision. This test is a poor attempt at capturing the issue. HZ takes more iterations than any other linesearch algorithm, but it's still a far cry from real-world cases I've seen where the linesearch exits from the iteration limit.

This may cause problems of its own, but for the time being it's better than the status quo. Fixes #173 Closes #174 Fixes #175

timholy · 2024-10-22T13:41:11Z

Aha, it looks like CI caught a case where the new flatness test (without the flatness check) spent 32 iterations in a flat basin. That's a little closer to the behavior I've seen.

I'll leave this open and wait for some feedback.

timholy · 2024-10-22T13:46:18Z

Oh interesting, it's also the StrongWolfe algorithm. With this diff:

diff --git a/test/issues.jl b/test/issues.jl
index 7b40677..b37a851 100644
--- a/test/issues.jl
+++ b/test/issues.jl
@@ -57,14 +57,15 @@ end
                BackTracking(; cache=cache), BackTracking(; order=2, cache=cache) )

     n = 0
-    while n < 10
+    while n < 1000
         ϕ, dϕ, ϕdϕ = makeϕdϕ(randn(2))
         ϕ0, dϕ0 = ϕdϕ(0)
         dϕ0 < -eps(abs(ϕ0)) || continue    # any "slope" is just roundoff error, but we want roundoff that looks like descent
         n += 1
         for ls in lsalgs
             res = ls(ϕ, dϕ, ϕdϕ, 1.0, ϕ(0.0), dϕ(0.0))
-            @test length(cache.alphas) < 10   # really should be < 5
+            length(cache.alphas) >= 10 && println(typeof(ls), ": ", length(cache.alphas))
+            # @test length(cache.alphas) < 10   # really should be < 5
         end
     end
 end

I get the following printout:

HagerZhang{Float64, Base.RefValue{Bool}}: 10
StrongWolfe{Float64}: 32
HagerZhang{Float64, Base.RefValue{Bool}}: 17
StrongWolfe{Float64}: 32
StrongWolfe{Float64}: 32
HagerZhang{Float64, Base.RefValue{Bool}}: 40
HagerZhang{Float64, Base.RefValue{Bool}}: 15
HagerZhang{Float64, Base.RefValue{Bool}}: 13
HagerZhang{Float64, Base.RefValue{Bool}}: 16
HagerZhang{Float64, Base.RefValue{Bool}}: 33
StrongWolfe{Float64}: 32
HagerZhang{Float64, Base.RefValue{Bool}}: 11
HagerZhang{Float64, Base.RefValue{Bool}}: 35
HagerZhang{Float64, Base.RefValue{Bool}}: 16
HagerZhang{Float64, Base.RefValue{Bool}}: 13
StrongWolfe{Float64}: 32
HagerZhang{Float64, Base.RefValue{Bool}}: 35
HagerZhang{Float64, Base.RefValue{Bool}}: 10
StrongWolfe{Float64}: 32
HagerZhang{Float64, Base.RefValue{Bool}}: 11
HagerZhang{Float64, Base.RefValue{Bool}}: 12
StrongWolfe{Float64}: 32
HagerZhang{Float64, Base.RefValue{Bool}}: 11
HagerZhang{Float64, Base.RefValue{Bool}}: 18
HagerZhang{Float64, Base.RefValue{Bool}}: 15
HagerZhang{Float64, Base.RefValue{Bool}}: 14
StrongWolfe{Float64}: 32
HagerZhang{Float64, Base.RefValue{Bool}}: 14
StrongWolfe{Float64}: 32
BackTracking{Float64, Int64}: 11
BackTracking{Float64, Int64}: 12
StrongWolfe{Float64}: 32
HagerZhang{Float64, Base.RefValue{Bool}}: 11
HagerZhang{Float64, Base.RefValue{Bool}}: 19
HagerZhang{Float64, Base.RefValue{Bool}}: 10
HagerZhang{Float64, Base.RefValue{Bool}}: 12
HagerZhang{Float64, Base.RefValue{Bool}}: 10
HagerZhang{Float64, Base.RefValue{Bool}}: 14
StrongWolfe{Float64}: 32
HagerZhang{Float64, Base.RefValue{Bool}}: 40
HagerZhang{Float64, Base.RefValue{Bool}}: 20
HagerZhang{Float64, Base.RefValue{Bool}}: 14
HagerZhang{Float64, Base.RefValue{Bool}}: 16
HagerZhang{Float64, Base.RefValue{Bool}}: 11
HagerZhang{Float64, Base.RefValue{Bool}}: 11
HagerZhang{Float64, Base.RefValue{Bool}}: 23
HagerZhang{Float64, Base.RefValue{Bool}}: 10
HagerZhang{Float64, Base.RefValue{Bool}}: 10
HagerZhang{Float64, Base.RefValue{Bool}}: 15
HagerZhang{Float64, Base.RefValue{Bool}}: 16
StrongWolfe{Float64}: 11

timholy · 2024-10-22T14:06:19Z

In trying this locally, I got this many passes (out of 1000 attempts):

HZ: 963
StrongWolfe: 988
MoreThuente: 1000
Backtracking (both): 999

timholy · 2024-10-22T14:18:46Z

OK, this was more successful in capturing the purpose of the flatness check than I initially thought. Maybe best to wait to merge this for an actual fix.

This follows the notation in the paper and is much more readable. It also introduces an external parameter, `epsilonk`, that "solves" the flatness problem (though it requires external specification).

ChrisRackauckas · 2024-10-29T09:43:03Z

Are there any ideas from here that @avikpal should consider for the newer LineSearch.jl?

devmotion · 2024-12-03T20:05:53Z

What's the status of this PR? I ran into #157 and it seems that issue would be fixed by the rewrite in this PR.

timholy added 3 commits October 22, 2024 06:31

Add failing test cases for flatness check

ba7217c

Disable the flatness check in HZ

c043db6

This may cause problems of its own, but for the time being it's better than the status quo. Fixes #173 Closes #174 Fixes #175

timholy force-pushed the teh/flatness branch from 045e126 to c043db6 Compare October 22, 2024 13:36

timholy added 2 commits October 22, 2024 08:55

More exhaustive

2c98734

Try this

c5043bb

timholy added 2 commits October 22, 2024 09:08

debug

a278334

More lenient

15be1b1

timholy marked this pull request as draft October 22, 2024 14:18

timholy mentioned this pull request Oct 24, 2024

Issues affecting changes in line search methods JuliaNLSolvers/Optim.jl#1104

Open

2 tasks

timholy added 2 commits October 24, 2024 08:36

WIP: new HZ algorithm

875b2ca

This follows the notation in the paper and is much more readable. It also introduces an external parameter, `epsilonk`, that "solves" the flatness problem (though it requires external specification).

add missing eltype definitions

92c46a6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add test cases, disable flatness check in HZ #180

Add test cases, disable flatness check in HZ #180

timholy commented Oct 22, 2024 •

edited

Loading

timholy commented Oct 22, 2024

timholy commented Oct 22, 2024

timholy commented Oct 22, 2024

timholy commented Oct 22, 2024

ChrisRackauckas commented Oct 29, 2024

devmotion commented Dec 3, 2024

Add test cases, disable flatness check in HZ #180

Are you sure you want to change the base?

Add test cases, disable flatness check in HZ #180

Conversation

timholy commented Oct 22, 2024 • edited Loading

timholy commented Oct 22, 2024

timholy commented Oct 22, 2024

timholy commented Oct 22, 2024

timholy commented Oct 22, 2024

ChrisRackauckas commented Oct 29, 2024

devmotion commented Dec 3, 2024

timholy commented Oct 22, 2024 •

edited

Loading