- Load in finetuned model - Generate a bunch of text - Check how many layers were skipped - Put generated text into a inspect-style prompt asking for coherence (<coherence_score> </coherence_score>)