Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implementation questions #4

Open
gofysuil opened this issue Feb 27, 2019 · 2 comments
Open

Implementation questions #4

gofysuil opened this issue Feb 27, 2019 · 2 comments

Comments

@gofysuil
Copy link

Hi Joseph,

Thanks for posting this notebook about WaveNet implementation in the context of pure time-series forecast. It is very helpful for me to understand the model. However I have the following questions about your implementation in the notebook:

  1. Why is there no activation function applied to the Conv1D layers? It looks like all non-linearity of the model comes from the last fully connected layer.
  2. Should the dilation_rates be (2, 4, 8, ...) according to the diagram?
  3. Since only last 14 output values are used in the calculation of loss, can we truncate the input sequence (encoding_interval) to the width of the receptive field (128) without affecting training?

Thanks!

@gofysuil
Copy link
Author

gofysuil commented Mar 1, 2019

To answer 1: In full implementation ("TS_Seq2Seq_Conv_Full") gated activations are included in the dilated Conv1D layers.

@gofysuil
Copy link
Author

gofysuil commented Mar 1, 2019

To answer 2: I misunderstood dilation_rate in ConvNets and confused it with strides. I found a good explanation of this topic here: https://theblog.github.io/post/convolution-in-autoregressive-neural-networks/

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant