It would be interesting to benchmark TEMML against wikitexvc.
It was almost ten years ago that I developed an evaluation framework https://github.com/MaRDI4NFDI/mathpipe, and I'm a bit afraid of the effort it might be to update that.
Maybe one can cooperate on extending https://temml.org/docs/en/comparison instead?