@adityasoni9998 has an analysis of some places where the reward signal is failing and we could improve those.