-
-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to handle long compile times? #153
Comments
One thing we could try is adding |
Yeah, I guess that is the simplest option. Won't it hurt the SciML μs-hunting cases? |
maybe @ChrisRackauckas has some advice here |
Make a flamegraph of the compile times on a standard case to see what actually matters. See for example SciML/DifferentialEquations.jl#786 (comment) . That will tell you what is taking all of the compile time and should be focused on. Removing specialization doesn't necessarily make compilation faster. Sometimes, the fact that it can remove inference at a level can actually make it worse, so check and see if it's inference time or llvm time.
That's fine. If we need something separate we can do SimpleOptimization.jl. We already have SimpleDiffEq.jl and SimpleNonlienarSolve.jl which are specailized implementation for things like GPU kernels and ultra-small problems on the same interface. If we need a SimpleOptimization.jl for a SimpleGradientDecent and SimpleAdam then I wouldn't be too surprised. So if you go and make Optimisers the best thing for neural networks we won't mind and we can just make sure Optimization.jl has multiple options with the right trade-offs. |
Is there some known remedy for explosive compile times when updating deeply nested models?
I did this to prevent spurious stalls with NaiveGAflux, but maybe there is a better way.
The text was updated successfully, but these errors were encountered: