- [ ] Are the fairness experiments sufficient? or are they too shallow? - [ ] Why are sections not numerated? - [ ] Experiment of detecting distribution shift does not align with the narrative. - [ ] Rephrase table 4