Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support complex composed models by preventing name collisions #252

Closed
seabbs opened this issue Jun 3, 2024 · 11 comments · Fixed by #296
Closed

Support complex composed models by preventing name collisions #252

seabbs opened this issue Jun 3, 2024 · 11 comments · Fixed by #296
Assignees
Labels

Comments

@seabbs
Copy link
Collaborator

seabbs commented Jun 3, 2024

At the moment no matter what we do you can end up with situations where models can have name conclusions (i.e when they use a lot of latent models.

What should we do about this? #249 introduces a partial solution for observation models (that uses a prefix based on the named assigned by the user - partial as the uniqueness of this name is not enforced).

We also see this problem more generally (for example if ascertainment models are stacked and if multiple latent models are combined)

We have a few options as I see it.

  • Enforce prefix for all submodel calls so that a parameter name becomes dependent on its call stack
  • Introduce the pattern that all calls to a latent model should have a clear (and documented prefix) - potentially as in the above solution based on the type of the dispatching struct.

The latter seems preferable to me as variables names will be shorter and easier to reason on. There may also be other options.

@seabbs seabbs added the EpiAware label Jun 3, 2024
@SamuelBrand1
Copy link
Collaborator

SamuelBrand1 commented Jun 4, 2024

The latter seems preferable to me as variables names will be shorter and easier to reason on. There may also be other options.

Enforce prefix for all submodel calls so that a parameter name becomes dependent on its call stack

I agree here.

@seabbs
Copy link
Collaborator Author

seabbs commented Jun 4, 2024

The problem with this proposed approach is what if you have the following:

Acercertainment(Ascertainment(...))

Here both would have the same prefix and so collide. you could argue that this is not a pattern we want to support and they should rather write

Ascertainment(model, CombineLatentModels(Ascr1, ascr2))

This would make sense but would be hard to steer people towards potentially, may have implications for hierarchical modelling, and would mean that utility helper functions for constructing common ascertainment models etc couldn't be used in these settings (without modification).

@SamuelBrand1
Copy link
Collaborator

Wouldn't the approach we are thinking off parse the top example as having a prefix like asc_asc... which is obviously ugly but distinguishable?

@seabbs
Copy link
Collaborator Author

seabbs commented Jun 4, 2024

Wouldn't the approach we are thinking off parse the top example as having a prefix like asc_asc... which is obviously ugly but distinguishable?

So not if we made the prefix only on the latent model vs both the latent and obs model (which is what I was suggesting). If it applies to all models then at that point you might as well have everything prefix because you will have very inconsistent behaviour depending on which structs you use.

@SamuelBrand1
Copy link
Collaborator

In my head I was agreeing to the call stack suggestion... and C&Ped the wrong bit. Is the main downside that its really ugly?

@seabbs
Copy link
Collaborator Author

seabbs commented Jun 4, 2024

In my head I was agreeing to the call stack suggestion... and C&Ped the wrong bit. Is the main downside that its really ugly?

Yes really ugly and really tedious to navigate over for people manipulating the posterior output. It also makes it very hard to write code for multiple models.

@seabbs
Copy link
Collaborator Author

seabbs commented Jun 11, 2024

Like CombineLatentModels, ConcatLatentModels (see #270) also has this problem.

@seabbs
Copy link
Collaborator Author

seabbs commented Jun 12, 2024

Another option we could take here that would be less automated but maybe more informative is make defining the prefix part of the struct. This could look something like:

Ascertainment(NegativeBinomialError(), RW(), prefix = "IFR")

This could be nice as it lets the user set very clear names for things but it could also maybe cause issues as the models will error a few times (due to name collisions) and the user will need to know what to do (which maybe isn't the most fun way. to interact with a package). Some of this could be mitigated if we make a default prefix for these complex operators?

@SamuelBrand1
Copy link
Collaborator

SamuelBrand1 commented Jun 12, 2024

Yes! Instant reaction is that this is the best idea.

I think the kind of users who'd want to construct a sufficiently complex model that naming collisions come into play, probably have their own namespace ideas for the model components.

This feels like a natural solution. And the defaults should be simple to select?

@seabbs
Copy link
Collaborator Author

seabbs commented Jun 12, 2024

I like that it is 1. Simple and 2. Should be adaptable to all the issues we have identified above with automated strategies.

@seabbs seabbs self-assigned this Jun 13, 2024
@seabbs
Copy link
Collaborator Author

seabbs commented Jun 13, 2024

I've started looking at implementing this here: https://github.com/CDCgov/Rt-without-renewal/blob/issue252/EpiAware/src/EpiLatentModels/manipulators/CombineLatentModels.jl

One uncertainty is if we should let the user specify the prefix prefix (i.e here Combine) or not.

A slightly annoying issue is you can't have an empty prefix ("" will add "." on everything) so this can't be turned off as a feature.

Otherwise work to do here is:

  • Fix ConcatLatentModels constructor
  • Add prefixes for Ascertainment
  • Check where else prefixes may be needed.

Something this really does highlight is how much the named tuple returns of everything don't work this approach (as the returns now have a different name to the variables in the model).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants