Look at the distribution of metrics over time, use honest-ml to generate distributions, test distributions to see if they are the same.