Using BERT for emotion analysis – A novel, resource constrained emotions classification approach with test on a Hungarian media corpus
Datasets and codes for the paper mentioned in the title
bert.py processes texts from etl_*.tsv
gridsearch_*.py fits 100 optimized models and saves results in easily parseable JSON-files
Files included in the repo:
etl_*.tsvfiles are prepared corpora for extracting BERT-embeddingsfeaturesfinal_*.npyare numpy files with BERT-embeddings,labels_*.npyare corresponding labelsv1[023]_corpus.tsvfiles are conventionally preprocessed, tf-idf weighted document-term matrices without the "document" column,v1[023]_labels.tsvfiles are corresponding labels