Why am I so slow to train the mnist data set with training functions, even though I've trained it with Gpu