Bugfix for the gating mechanism + update imports #1

tijsmaas · 2020-06-01T23:09:24Z

I came across a typo in the implementation of wavenet and stochastic wavenet gating mechanism. Originally wavenet uses a GTU: tanh(x)*sigm(x), however in the code the gating is x*tanh(x). Haven't tested it but likely this explains some of the instability encountered during training.
A bugfix and fixes of missing imports (+updated pytorch definitions) are in this pull request. Either way, cool work.

The gate was initialized as x*tanh(x) due to a typo. This should have been sigm(x)*tanh(x). My commit also includes missing imports.

tijsmaas added 2 commits June 2, 2020 00:47

Fix gate bug in wavenet & swavenet

43acb0e

The gate was initialized as x*tanh(x) due to a typo. This should have been sigm(x)*tanh(x). My commit also includes missing imports.

Updated imports & Pytorch version

4f34fc7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bugfix for the gating mechanism + update imports #1

Bugfix for the gating mechanism + update imports #1

Uh oh!

tijsmaas commented Jun 1, 2020 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Bugfix for the gating mechanism + update imports #1

Are you sure you want to change the base?

Bugfix for the gating mechanism + update imports #1

Uh oh!

Conversation

tijsmaas commented Jun 1, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

tijsmaas commented Jun 1, 2020 •

edited

Loading