Missing Convolution Subsampling? #1

fmac2000 · 2023-09-28T17:51:00Z

Hi Lucas,
I'm looking over the code and I believe you have missed the two convolution subsampling layers in conformer.py,

4.1.1. NON-STREAMING MODELS The model has two convolution layers at the bottom which provide 4 times temporal-dimension reduction for the input sequences. The rest of the layers are a stack of Conformer models. We explore 0.6B model size which is extensively studied in the previous works. The model contains 24 layers of Conformer models.

If you'd like I can create a pull request and implement this for you now.
Thanks - If I've misunderstood the paper, please call me out! 😅

lucasnewman · 2023-09-28T18:03:29Z

@fmac2000 Yes, please, a PR would be great! I was aware of them in the paper but I skipped them for simplicity and downsampled in the feature extractor to get it off the ground. I would love to have it reflect the paper as closely as possible though!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Missing Convolution Subsampling? #1

Missing Convolution Subsampling? #1

fmac2000 commented Sep 28, 2023

lucasnewman commented Sep 28, 2023

Missing Convolution Subsampling? #1

Missing Convolution Subsampling? #1

Comments

fmac2000 commented Sep 28, 2023

lucasnewman commented Sep 28, 2023