Are there any speed benchmarks with the GPU implementation and other popular methods of training AI's such as Muzero? Also would there be any advantage in paring this with Muzero?