Skip to content
This repository has been archived by the owner on Nov 1, 2024. It is now read-only.

Question about the COMPUTE_PRECISE_BN? #50

Open
gooners1886 opened this issue Sep 6, 2018 · 0 comments
Open

Question about the COMPUTE_PRECISE_BN? #50

gooners1886 opened this issue Sep 6, 2018 · 0 comments

Comments

@gooners1886
Copy link

gooners1886 commented Sep 6, 2018

hello,
in the update_bn_stats_gpu function,
workspace.FeedBlob(
'gpu
{}/'.format(i) + bn_layer + '_bn_rm',
np.array(self._meanX_dict[bn_layer], dtype=np.float32),

meanX of 200 * batch_size * num_gpu training samples is computed, then rewrite the mem of bn_layer + '_bn_rm'.
so why not use the running mean accumulated during training?
why the mean computed during COMPUTE_PRECISE_BN switch is more precise?

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant