Hi,
Well done on this amazing work and thank you so much for putting this on github and sharing.
My apologies if this is a silly question.
I have been struggling to figure out where i am going wrong.
I am trying to recreate the results on the Electricity data set.
The training runs perfectly, however up till about epoch 12 or 13. At this point i get a 'nan' loss.
Please help me understand where i am going wrong.
thank you