Merging `TuringBenchmarking` into `DynamicPPL` #715

yebai · 2024-11-11T17:19:04Z

DynamicPPL's tools for profiling and debugging models are growing.

TuringBenchmarking is very lightweight, and its scope overlaps with growing utilities inside DynamicPPL, so it makes sense to consider merging. Besides, these benchmarking codes will receive better maintenance under DynamicPPL.

The extra deps like ReverseDiff, Zygote, PrettyTables can be removed or managed via weak deps.

penelopeysm · 2024-11-11T23:05:03Z

I had an unsubstantiated opinion on Slack that this should be a separate repo. The main reason is to avoid the DPPL repo itself becoming a home for all sorts of things – while we might not have much 'utility' code right now, this can grow in the future, and it's harder to remove code than to add it in.

I can see the point about maintainability being harder – for example having to keep the repo up to date with DPPL version releases. One way to keep us on track with this is the docs repo – if we have a doc page explaining these debugging / profiling tools (which we really should have!), and DPPL is bumped in the docs environment, then it will force us to keep the profiling repo up to date as well.

(the docs repo is actually doing a good job of forcing us to iron out stuff like Bijectors right now!)

torfjelde · 2024-11-12T09:16:35Z

Also agree with @penelopeysm here 👍

yebai · 2024-11-12T11:01:04Z

We don't generally consider the amount of code when deciding whether to create a new repo or package. As long as the code is organised and readable, it is okay.

In reality, IIRC, the TuringBenchmarks repo was created from some small utilities that @torfjelde used in personal workflows. Then, it was rushed into a package for a winter school so that people could use it following Julia's Pkg workflow.

The code in TuringBenchmarks should have been hosted in DynamicPPL from the beginning, in retrospect. Thus, this issue.

EDIT: My primary metric when deciding on new packages is isolating complexity and encouraging reusability or stability. For example, developing and debugging MCMC samplers is often very challenging, so we decided to isolate them from the extra complexity of DynamicPPL.

torfjelde · 2024-11-12T12:19:06Z

Then, it was rushed into a package for a winter school so that people could use it following Julia's Pkg workflow.

This is not strictly true. It was somewhat rushed into having its first release; it was a module from the get-go, and made so because I thought the initial thing I had been doing, i.e. putting benchmarking in benchmarks/ in DynamicPPL wasn't a good idea. This is ofc separate from adding a separate DynamicPPL.BenchmarkingTools submodule or whatever, but just wanted to clarify the original motivation behind TuringBenchmarking.jl 👍

The code in TuringBenchmarks should have been hosted in DynamicPPL from the beginning, in retrospect. Thus, this issue.

I guess the question is exactly how this is to be done? Do you mean to make a new submodule of DPPL that contains benchmarking tools, e.g. DynamicPPL.BenchmarkTools?

My original motivation for making the benchmarking a separate package (and also why I'm still of the opinion that this is the way to go) is:

It allows us to add dependencies much more freely, e.g. having several of the default AD backends loaded already so the user can just do adbackend=[...] instead of all the using stuff is nice.
We can add more extensive benchmarking than just logdensity computations, e.g. benchmarking a model with a SamplingContext containing a sampler from Turing.jl, or me and @mhauru breifly discussed the idea of maybe adding benchmarking of AbstractMCMC.step for samplers too.
TuringBenchmarking.jl currently also holds some code for running benchmarks on Stan models, which is used for comparison against other PPLs. We can't add this to DPPL.
Different release schedules.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merging `TuringBenchmarking` into `DynamicPPL` #715

Merging `TuringBenchmarking` into `DynamicPPL` #715

yebai commented Nov 11, 2024

penelopeysm commented Nov 11, 2024 •

edited

Loading

torfjelde commented Nov 12, 2024

yebai commented Nov 12, 2024 •

edited

Loading

torfjelde commented Nov 12, 2024

Merging TuringBenchmarking into DynamicPPL #715

Merging TuringBenchmarking into DynamicPPL #715

Comments

yebai commented Nov 11, 2024

penelopeysm commented Nov 11, 2024 • edited Loading

torfjelde commented Nov 12, 2024

yebai commented Nov 12, 2024 • edited Loading

torfjelde commented Nov 12, 2024

Merging `TuringBenchmarking` into `DynamicPPL` #715

Merging `TuringBenchmarking` into `DynamicPPL` #715

penelopeysm commented Nov 11, 2024 •

edited

Loading

yebai commented Nov 12, 2024 •

edited

Loading