DropPath implementation #2119

IsmaelElsharkawi · 2024-03-20T23:30:57Z

IsmaelElsharkawi
Mar 20, 2024

Continuing our discussion in #2118 , the paper says" independently for each sample or
mini-batch.", so thanks for your answer to that question.

As for the second question, (as far as I understand) equation 5 mentions that this is only done during inference/testing time, not during training (equation 2 is used for the training forward pass), and even in equation 5, it's a multiplication operation with p_l, not a division operation, can you please help me understand this part?

pytorch-image-models/timm/layers/drop.py

Line 166 in 492947d

random_tensor.div_(keep_prob)

Answered by rwightman

Mar 21, 2024

@IsmaelElsharkawi better to divide at train time so the next layer gets consistent activation stats than muck around at test time :)

And note, you can see a note a TF impl of this that accompanied the original EfficientNet code, they called it drop connect (which conflicted with another paper name)

https://github.com/tensorflow/tpu/blob/master/models/official/efficientnet/utils.py#L276-L291

View full answer

rwightman · 2024-03-21T00:37:39Z

rwightman
Mar 21, 2024
Maintainer

@IsmaelElsharkawi better to divide at train time so the next layer gets consistent activation stats than muck around at test time :)

And note, you can see a note a TF impl of this that accompanied the original EfficientNet code, they called it drop connect (which conflicted with another paper name)

https://github.com/tensorflow/tpu/blob/master/models/official/efficientnet/utils.py#L276-L291

1 reply

IsmaelElsharkawi Mar 21, 2024
Author

@rwightman That make sense. Thanks a lot for your help!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DropPath implementation #2119

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

DropPath implementation #2119

IsmaelElsharkawi Mar 20, 2024

Replies: 1 comment · 1 reply

rwightman Mar 21, 2024 Maintainer

IsmaelElsharkawi Mar 21, 2024 Author

IsmaelElsharkawi
Mar 20, 2024

Replies: 1 comment 1 reply

rwightman
Mar 21, 2024
Maintainer

IsmaelElsharkawi Mar 21, 2024
Author