We currently only have Phoneme Error Rate (PER) and Weighted Phonetic Error Rate/Feature Error Rate (WPER/FER) for evaluating Speech2IPA models (see metrics.py). These don't quantify the alignment quality. Ye et al. have code to accomplish that with the Boundary Loss (BL) metric. Would be good to also evaluate our models using this metric.