WIP per-class metrics by swsuggs · Pull Request #1538 · twosixlabs/armory

swsuggs · 2022-05-27T17:42:50Z

swsuggs · 2022-05-27T17:45:54Z

@davidslater The poisoning scenario adds this metric automatically, right in the code. Is this an acceptable way to make it available to other scenarios through the config/metric/task section?

armory/instrument/config.py

davidslater · 2022-05-27T19:09:27Z

I like adding it to the config part of instrumentation.

swsuggs · 2022-05-31T17:35:41Z

@davidslater So far I've added (beside per-class accuracy) a confusion matrix, and precision and recall for each class. Two questions: Will you check that I'm computing the right thing (I described my understanding in the comments), and that the output is in a sufficiently useful format (dict, array, etc)? Can you think of any other metrics or statistics that would be nice to have?

davidslater

Let's move the metrics to task, as they take in the paired y and y_pred as inputs.

armory/metrics/statistical.py

davidslater · 2022-06-01T22:28:20Z

armory/metrics/statistical.py

+    if y_pred.ndim == 2:
+        y_pred = np.argmax(y_pred, axis=1)
+    N = len(np.unique(y))


If y_pred is 2D, you can use that to derive N. (Or at least check to ensure that they match).

I may be misunderstanding, but N is the number of classes, not the total number of items. Hence length of np.unique(y) and not length of y. I don't think we can assume every class will show up in y_pred. For that matter it seems a little risky to assume they will all be present in y.

If y_pred is 2D, then it outputs either logits or probability distributions over the set of predicted classes, so you can do N = y_pred.shape[1].

If y_pred is 1D, however, that doesn't work.

I think this implicitly assumes that the classes are all integers from 0 to N - 1. However, if y has missing classes, then there will be some misalignment.

Oh right, of course. Is it true that Armory scenarios will always have a 2D y_pred? Or it just depends on how the meter and probes are set up? So far the only source of a 1D y_pred I've encountered is my own unit tests, but I can expand those to 2D and then get N the way you described.

Right now it's dependent on the underlying model, unfortunately.

Well after all, if a class is totally absent from y, its row in the matrix would be all zeros since it was never classified as anything. So maybe what I need to do is make this a dictionary after all, and key it with class labels, so if one is missing, then at least it will be clear what rows are what class. Alternatively, I could add a row of zeros at the index of missing class labels, but this would only be possible for missing labels less than the greatest non-missing label.

Let's just have the function assume that y_pred is 2D (and add that to the docstring). Other things can be handled by the user.

armory/metrics/statistical.py

armory/instrument/config.py

swsuggs · 2022-06-02T20:19:44Z

armory/metrics/statistical.py

+        total_selected = C[:, class_].sum()
+        precision = tp / total_selected
+
+        # recall: true positives / number of actual items in class_


per-class recall is the exact same as per-class accuracy, which I didn't realize till now. Is it still useful to have two separate per_class_accuracy and per_class_precision_and_recall functions?

Sure, let's keep both.

davidslater · 2022-12-06T00:35:11Z

See my two recent comments. Beyond that, I think what needs to be done is:
A) remove the changes to armory/instrument/config.py, and armory/utils/config_schema.json.
B) move the new functions from statistical to task, since we're registering them as task populationwise metrics anyhow.
C) Probably easiest to rebase off of develop.

one way to add per-class accuracy

d30dd7a

swsuggs commented May 27, 2022

View reviewed changes

armory/instrument/config.py Outdated Show resolved Hide resolved

swsuggs added 5 commits May 30, 2022 10:24

replace lambda with normal function

6ea83fb

add confusion matrix metric

0874bae

add precision and recall

4a79ccd

move confusion_matrix from task.py to statistical.py

416906d

add tests for confusion matrix and precision/recall

0431ffb

davidslater suggested changes Jun 1, 2022

View reviewed changes

davidslater reviewed Jun 1, 2022

View reviewed changes

armory/metrics/statistical.py Outdated Show resolved Hide resolved

davidslater reviewed Jun 1, 2022

View reviewed changes

armory/metrics/statistical.py Outdated Show resolved Hide resolved

davidslater reviewed Jun 1, 2022

View reviewed changes

armory/metrics/statistical.py Outdated Show resolved Hide resolved

davidslater reviewed Jun 1, 2022

View reviewed changes

armory/instrument/config.py Outdated Show resolved Hide resolved

swsuggs added 4 commits June 2, 2022 09:38

move metrics from statistical to task

f63fe03

move tests from statistical to task

d581625

rename function and update comments

873a9d1

additional name updates and consolidate repeated code

8153642

swsuggs commented Jun 2, 2022

View reviewed changes

swsuggs added 2 commits June 2, 2022 14:33

black

e34f632

fix keyword argument

07a3878

Conversation

swsuggs commented May 27, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

swsuggs commented May 27, 2022

Uh oh!

Uh oh!

davidslater commented May 27, 2022

Uh oh!

swsuggs commented May 31, 2022

Uh oh!

davidslater left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

davidslater commented Dec 6, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

swsuggs commented May 27, 2022 •

edited

Loading