-
Notifications
You must be signed in to change notification settings - Fork 0
Open
Description
- Additional metrics to test
- [high priority] BERTScore-sentence, MNLI, {RoBERTa, DeBERTA}, Entail - Contradict, top-k and top-p
- [medium] BERTScore-original, DeBERTa -- to see how much language models impact it
[low] BERTScore-sentence, cosine, DeBERTa -- again, to see the impact of language models
- how to print result into Google Sheets
Done Add pretty print EvalBase#4
Metadata
Metadata
Assignees
Labels
No labels