Well, there are several ways to compare the AUC different for binary classifier, such as DeLong test, wheather there are similar method to compare AUC for multiple classifier? or i should split multiclass model into several one vs all components and compare each pair?