Skip to content

Conversation

@erip
Copy link
Owner

@erip erip commented Dec 3, 2020

Closes #5

Currently computes weighted F1 score across the entire test set.

TODO: add confusion matrix, but there are some issues with PyTorch-Lightning reducing CMs...

@erip
Copy link
Owner Author

erip commented Dec 3, 2020

cc @kylebgorman

@kylebgorman
Copy link

Looks good to me. Tag accuracy would also be good; F1 is not helpful for the non-chunk-ing-type tasks. Some folks report whole-sentence accuracy too.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Prediction/generation support

3 participants