Skip to content

Conversation

@tijsmaas
Copy link

@tijsmaas tijsmaas commented Jun 1, 2020

I came across a typo in the implementation of wavenet and stochastic wavenet gating mechanism. Originally wavenet uses a GTU: tanh(x)*sigm(x), however in the code the gating is x*tanh(x). Haven't tested it but likely this explains some of the instability encountered during training.
A bugfix and fixes of missing imports (+updated pytorch definitions) are in this pull request. Either way, cool work.

tijsmaas added 2 commits June 2, 2020 00:47
The gate was initialized as x*tanh(x) due to a typo. This should have been sigm(x)*tanh(x). My commit also includes missing imports.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant