Read this paper: https://www.nature.com/articles/s41524-023-01180-8 Consider add-on techniques like conformal prediction for comparison (comparing degree of mis-calibration).