The EthioEmo dataset split into 60:30:10 train, test, and dev, respectively.
This dataset is available also in Hugging Face
| Emotion | Amharic_emotion | Amharic_intensity | Oromo_emotion | Somali_emotion | Tigrinya_emotion |
|---|---|---|---|---|---|
| Anger | 1977 | 2364 | 1077 | 546 | 912 |
| Disgust | 2105 | 3134 | 926 | 801 | 2175 |
| Fear | 185 | 239 | 209 | 504 | 235 |
| Sadness | 1253 | 2013 | 509 | 652 | 984 |
| Joy | 918 | 1468 | 1821 | 991 | 690 |
| Surprise | 260 | 287 | 225 | 295 | 605 |
If the given text is out of the given six basic emotions, the text has 0 0 0 0 0 0 column values under Anger, Disgust, Fear, Sadness, Joy, and Surprise emotion columns.
To cite the paper and dataset, use the the following paper
@inproceedings{belay-etal-2025-evaluating,
title = "Evaluating the Capabilities of Large Language Models for Multi-label Emotion Understanding",
author = "Belay, Tadesse Destaw and Azime, Israel Abebe and Ayele, Abinew Ali and Sidorov, Grigori and
Klakow, Dietrich and Slusallek, Philip and Kolesnikova, Olga and Yimam, Seid Muhie",
booktitle = "Proceedings of the 31st International Conference on Computational Linguistics",
month = jan,
year = "2025",
address = "Abu Dhabi, UAE",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2025.coling-main.237/",
pages = "3523--3540"
}

