Compute more embedding stats (using stats_to_compute.md)
Compute phoneme stats (using stats_to_compute.md)
Plot an example of alignement using the groundtruth mapping
Make an animation of the second embedding projection
Compute the receptive field of the model
Refractor jitter computation with the index version and compare it with the sequential one (e.g. [0, 1, -1, 0] with 0.12 to not be 0 and 0.5 to be 1 or -1)
Add options in configuration to record quantized and one hot in audio preprocessing
Handle LibriSpeech dataset
Handle the flow_wavenet and clarinet as possible decoders in vq_vae_wavenet decoder
Add a description of each sub module in the README?
Replace logging with loguru

Provide feedback

Saved searches