Explicit model specification #68

wilsonmr · 2021-04-28T10:28:16Z

It looks slightly circular but I think I was careful to catch the infinite recursive loops.

The idea is that sequential_model allows the user to specify exactly which layers they want in any order. See the new runcard I added for reference. I believe that this means you can train all the models we talk about in the paper (and any others you can think of). It also means you can specify models where not all layers have same "shared" parameters

I also was careful to retain backwards compatibility, but we should check this more thoroughly.

jmarshrossney

This is really great.

I do think we may as well standardise the inputs for the models, i.e. n_blocks instead of n_affine, n_spline, n_additive, and also replace z2_equivar_spline with z2_equivar.

This does mean that the models affine_spline and spline_affine would no longer be particularly useful, and I'm sure this is why you didn't already do this, but personally I don't mind that one bit since the new scheme is a massive improvement.

wilsonmr · 2021-04-29T10:51:06Z

I do think we may as well standardise the inputs for the models

Let's discuss this on slack

wilsonmr · 2021-04-29T15:27:59Z

I removed the preset models, slightly changed the naming conventions and then standardised the layer inputs, I think this makes the specification of flows flexible. It makes specifying the "standard" flow a bit more verbose but I think it's worth it.

I think it might be clearer if we made sequential go into layers and then we renamed core to feed_forward_network and have it as a single class module. We don't need to decide yet.

I changed the equivar bit of RQS because it looked like it was using an undefined variable, can you check I changed it to the correct variable?

Also I changed the comments because I thought we were trying to enforce C(-v) == C(v) not anti-symmetry, but maybe I'm missing something here?

anvil/models.py

examples/runcards/train.yml

anvil/models.py

…for much more flexibility

wilsonmr · 2021-04-30T10:45:44Z

add tests.

wilsonmr · 2021-04-30T13:12:33Z

This is ready for review. Btw should we add an epsilon to our batch norm? if the standard deviation of input is really small then this can be unstable

jmarshrossney · 2021-05-14T10:10:17Z

I changed the equivar bit of RQS because it looked like it was using an undefined variable, can you check I changed it to the correct variable?

Yep, good spot.

Also I changed the comments because I thought we were trying to enforce C(-v) == C(v) not anti-symmetry, but maybe I'm missing something here?

Ah the typo was in the affine layers, thanks for spotting. We do want the coupling layers to be odd, but the exp(-s) needs to be even, hence replacing s -> |s|.

Btw should we add an epsilon to our batch norm? if the standard deviation of input is really small then this can be unstable

Yeah you're right. I removed when I was getting rid of all the unused rubbish, since if we've got a tiny standard deviation there's probably something else that's gone wrong, but it's probably sensible to keep it.

jmarshrossney · 2021-05-14T10:34:20Z

I think we can now add batch normalisation and global rescaling layers as layers in their own right, rather than including them as part of the three existing models. I'm going to do this because batch normalisation and rescaling are definitely things we would like to be able to easily toggle on/off.

jmarshrossney · 2021-05-14T12:02:50Z

Ok I've done this but the problem is that we can't have independent instances of these layer actions, which means I can only include one global_rescaling layer in the model (which I could put in more than one place but this isn't useful). This also means we can't build models like e.g. affine->spline->affine since the affine sections will share parameters.

I guess this comes back to the old stumbling block, that reportengine is specifically designed to not re-calculate anything. Do you think there is a relatively painless workaround?

wilsonmr · 2021-05-14T12:21:50Z

No you can have as many layers as you want, the instances shouldn't be identical

wilsonmr · 2021-05-14T12:22:51Z

Like I can have (not a valid input but hopefully you get the point):

model:
 - layer: global rescaling
   <params>
 - layer: global rescaling
   <params>
 - etc.

wilsonmr · 2021-05-14T12:35:56Z

That last build was taking ages and looked like it was stuck at the testing stage. I'm just double checking the tests are still running in a reasonable time.

jmarshrossney · 2021-05-14T13:18:46Z

For future reference:

Ok I've done this but the problem is that we can't have independent instances of these layer actions, which means I can only include one global_rescaling layer in the model (which I could put in more than one place but this isn't useful). This also means we can't build models like e.g. affine->spline->affine since the affine sections will share parameters.

This problem was occurring when the model contained multiple layers which only used default parameters. If the user overrides one or more defaults, the layers are no longer duplicates and everything works as expected.

wilsonmr · 2021-05-14T14:37:15Z

is this done? Shall I double check your changes?

jmarshrossney · 2021-05-14T14:41:26Z

Just writing a test to catch the case when layers wrongly share parameters

wilsonmr · 2021-05-14T14:56:17Z

if you do like layer.parameters *= 2 or something like that and then check if both layers parameters are still equal does that fail for the current failing case and succeed when we remove all defaults?

…mple

anvil/tests/test_models.py

wilsonmr · 2021-05-14T16:48:40Z

anvil/tests/test_models.py

+                raise LayersNotIndependentError(
+                    "Parameters are being shared amongst layers that should be independent."


was there a reason you couldn't ~~except~~ expect an assertion error?

wilsonmr requested a review from jmarshrossney April 28, 2021 10:28

jmarshrossney approved these changes Apr 29, 2021

View reviewed changes

jmarshrossney mentioned this pull request Apr 29, 2021

[Do not merge] Extend post-processing and plotting to act on intermediate layers of model #64

Open

7 tasks

wilsonmr force-pushed the explicit_model branch from 1b487f0 to 21c0af3 Compare April 29, 2021 15:22

wilsonmr commented Apr 29, 2021

View reviewed changes

anvil/models.py Outdated Show resolved Hide resolved

wilsonmr commented Apr 29, 2021

View reviewed changes

examples/runcards/train.yml Outdated Show resolved Hide resolved

wilsonmr commented Apr 30, 2021

View reviewed changes

anvil/models.py Outdated Show resolved Hide resolved

wilsonmr added 8 commits April 30, 2021 10:22

move actions from core, stop from being a provider module

a6b9de9

add new type of model, which is a sequence of other models, allowing …

5b6b28d

…for much more flexibility

remove duplicate import

7b6b3c5

only allow explicit specification of models

4122eb7

move sequential to layers

4dbe1a4

update module names, cleanup some docs warnings

8be7987

remove spurious comment in train

2e3d670

flatten out the inner layers in model_to_load

76cb8ea

wilsonmr force-pushed the explicit_model branch from 21c0af3 to 76cb8ea Compare April 30, 2021 10:41

might as well just call the model params input model

4d287db

wilsonmr added 2 commits April 30, 2021 13:01

add basic layers tests

3a1232e

model tests, probably could be improved but at least have coverage.

4f68d58

renamed coupling_pair to coupling_block

20a41de

layer actions for batch norm and global rescaling

af77d41

jmarshrossney added 2 commits May 14, 2021 14:11

update example runcards

3b52d3c

remove default scale factor for global rescaling layer

5bfc87a

jmarshrossney added 2 commits May 14, 2021 14:54

add epsilon and remove shift from data standardisation

6e44773

update tests

ff02307

add test for independence of rescaling layers, including breaking exa…

f147afa

…mple

jmarshrossney reviewed May 14, 2021

View reviewed changes

anvil/tests/test_models.py Show resolved Hide resolved

update layer independence test for generic layers

8abf747

wilsonmr commented May 14, 2021

View reviewed changes

raise AssertionError instead of custom exception

3e50d20

wilsonmr merged commit a3b1f15 into master May 20, 2021

wilsonmr mentioned this pull request May 20, 2021

Parsing info from trained models could be improved. #62

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Explicit model specification #68

Explicit model specification #68

wilsonmr commented Apr 28, 2021

jmarshrossney left a comment

wilsonmr commented Apr 29, 2021

wilsonmr commented Apr 29, 2021

wilsonmr commented Apr 30, 2021 •

edited

Loading

wilsonmr commented Apr 30, 2021

jmarshrossney commented May 14, 2021

jmarshrossney commented May 14, 2021

jmarshrossney commented May 14, 2021

wilsonmr commented May 14, 2021

wilsonmr commented May 14, 2021

wilsonmr commented May 14, 2021

jmarshrossney commented May 14, 2021

wilsonmr commented May 14, 2021

jmarshrossney commented May 14, 2021

wilsonmr commented May 14, 2021

wilsonmr May 14, 2021 •

edited

Loading

jmarshrossney May 14, 2021

		raise LayersNotIndependentError(
		"Parameters are being shared amongst layers that should be independent."

Explicit model specification #68

Explicit model specification #68

Conversation

wilsonmr commented Apr 28, 2021

jmarshrossney left a comment

Choose a reason for hiding this comment

wilsonmr commented Apr 29, 2021

wilsonmr commented Apr 29, 2021

wilsonmr commented Apr 30, 2021 • edited Loading

wilsonmr commented Apr 30, 2021

jmarshrossney commented May 14, 2021

jmarshrossney commented May 14, 2021

jmarshrossney commented May 14, 2021

wilsonmr commented May 14, 2021

wilsonmr commented May 14, 2021

wilsonmr commented May 14, 2021

jmarshrossney commented May 14, 2021

wilsonmr commented May 14, 2021

jmarshrossney commented May 14, 2021

wilsonmr commented May 14, 2021

wilsonmr May 14, 2021 • edited Loading

Choose a reason for hiding this comment

jmarshrossney May 14, 2021

Choose a reason for hiding this comment

wilsonmr commented Apr 30, 2021 •

edited

Loading

wilsonmr May 14, 2021 •

edited

Loading