Increasing the training rate to ~1 will make the program output all 0. The program does not converge on the right values or it does so very slowly.