Question about the COMPUTE_PRECISE_BN? #50

gooners1886 · 2018-09-06T13:33:33Z

hello,
in the update_bn_stats_gpu function,
workspace.FeedBlob(
'gpu{}/'.format(i) + bn_layer + '_bn_rm',
np.array(self._meanX_dict[bn_layer], dtype=np.float32),

meanX of 200 * batch_size * num_gpu training samples is computed, then rewrite the mem of bn_layer + '_bn_rm'.
so why not use the running mean accumulated during training?
why the mean computed during COMPUTE_PRECISE_BN switch is more precise?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about the COMPUTE_PRECISE_BN? #50

Question about the COMPUTE_PRECISE_BN? #50

gooners1886 commented Sep 6, 2018 •

edited

Loading

Question about the COMPUTE_PRECISE_BN? #50

Question about the COMPUTE_PRECISE_BN? #50

Comments

gooners1886 commented Sep 6, 2018 • edited Loading

gooners1886 commented Sep 6, 2018 •

edited

Loading