Add harmonization of metrics to Metric API by MiroDudik · Pull Request #9 · fairlearn/fairlearn-proposals

MiroDudik · 2020-05-04T18:24:27Z

No description provided.

Signed-off-by: Miro Dudik <mdudik@gmail.com>

…sals

Signed-off-by: Miro Dudik <mdudik@gmail.com>

api/METRICS.md

romanlutz · 2020-05-04T18:50:47Z

api/METRICS.md

+    _Alternative proposals_:
+    * `MeanLoss(<loss>)`
+    * `OverallLoss(<loss>)`
+    * the object `<loss>` doubles as (1) the loss evaluator and (2) the moment implementing the objective


Can we foresee any situation where clubbing them together will hurt us in the future?
Why not just LossMoment(<loss>)? Then it's clear that the moment takes the loss evaluator. With OverallLoss and the others you mention it's still not entirely clear that it's a moment (at least not from the name).

One use case where this might bite us is if we want to add support for various kinds of sample weights, e.g.:

balanced loss (the re-weighting happens on the level of the moment, but not on the level of the loss evaluator)

counterfactual loss (this style of re-weighting corrects for partial observability in data; again it's only incurred on the moment level, but not on the loss level)

I would prefer calling it SomethingLoss to better match what we do for the classification moments--where I think we will be simply calling them <Metric> rather than <Metric>Moment. Also, I don't expect that the user would ever specify this objective explicitly (it would be automatically derived from the BGL constraint object).

The special case you mention could still be handled fairly simply with if-else, so I'm not too worried.

Hmm, then how do we distinguish between loss evaluator and loss moment in terms of names?

It occurred to me that we may need a special loss object if we ever want to implement demographic parity for regression--and I just heard that somebody will be working on it this summer, so we definitely need both a loss evaluator object and a loss moment object.

I don't think it's necessarily a problem that we have moments called BoundedGroupLoss and MeanLoss, and loss evaluators called SquareLoss, AbsoluteLoss, and ZeroOneLoss. But if you feel strongly about this, we could maybe change the evaluators to SquareLossEvaluator etc.?

api/METRICS.md

riedgar-ms · 2020-05-05T18:09:09Z

api/METRICS.md

+|`mean_absolute_error` | G,D,R,Max | class, reg | sklearn | class only: `error_rate` |
+|`mean_squared_error`| G,Max | prob, reg | sklearn | - |
+|`r2_score`| G,Min | reg | sklearn | - |
+|`_mean_overprediction` | G | class, reg | | - |


When do you think you'll have an alternative to these underscores?

I'd like to ask the community what their thoughts are given that we're doing somewhat non-standard things here.

api/METRICS.md

riedgar-ms · 2020-05-05T18:39:50Z

api/METRICS.md

+    _r_ &sdot; _metric_(_a_) - _metric_(\*) &le; &epsilon;, <br>
+    _r_ &sdot; _metric_(\*) - _metric_(_a_) &le; &epsilon;.
+
+  * `DemographicParity` and `EqualizedOdds` have the same calling convention as `<Metric>Parity`


Does this mean that we should be able to run direct comparisons between things in metrics and things in moments to make sure they agree that they're calculating the same thing?

Not quite, but almost... there's a small discrepancy between metrics and moments:

<metric>_difference and <metric>_ratio look at the difference / ratio between the max and the min

<Metric>Parity is looking at the max difference/ratio between any group and the overall metric

I think that the final solution would be to enable a flag relative_to_overall in <metric>_difference and <metric>_ratio to get the same functionality as in the moments when the flag equals True and the current functionality when the flag equals False. But I'm not sure we need to put it in this proposal (but definitely will put it in the user guide).

Great catch, Richard. I think this definitely needs to be documented well.

riedgar-ms · 2020-05-13T12:51:55Z

Given that this is actually already largely implemented, can we finalise this, and get it merged?

Signed-off-by: Miro Dudik <mdudik@gmail.com>

api/METRICS.md

riedgar-ms

A few points I'm not quite sure about.

api/METRICS.md

Signed-off-by: Miro Dudik <mdudik@gmail.com>

romanlutz · 2020-05-27T03:10:27Z

api/METRICS.md

+* For each _base metric_, we provide the list of predefined derived metrics, using D, R, Min, Max to refer to the transformations from the table above, and G to refer to `<metric>_group_summary`. We follow these rules:
+  * always provide G (except for demographic parity and equalized odds, which do not make sense as group-level metrics)
+  * provide D and R for confusion-matrix metrics
+  * provide Min for score functions (worst-case score)


I can't think of any such cases, but are there scores people could come up with that would work the other way round? Or would you just consider them losses then?

romanlutz

Looks great. We should get to this work asap to avoid having more releases before we finalize this. If you need me to put some time into any of this please reach out.

api/METRICS.md

romanlutz · 2020-05-27T03:16:11Z

api/METRICS.md

+    _r_ &sdot; _metric_(_a_) - _metric_(\*) &le; &epsilon;, <br>
+    _r_ &sdot; _metric_(\*) - _metric_(_a_) &le; &epsilon;.
+
+  * `DemographicParity` and `EqualizedOdds` have the same calling convention as `<Metric>Parity`


Great catch, Richard. I think this definitely needs to be documented well.

MiroDudik added 7 commits February 28, 2020 09:50

add metrics API proposal

794a07d

Signed-off-by: Miro Dudik <mdudik@gmail.com>

add clarifications and confusion_matrix

3b93629

Signed-off-by: Miro Dudik <mdudik@gmail.com>

fix list markdown

9359f13

Signed-off-by: Miro Dudik <mdudik@gmail.com>

rename _by_group to _group_summary for consistency

ddde2ff

Signed-off-by: Miro Dudik <mdudik@gmail.com>

remove dashboard questions

0b86e6d

Signed-off-by: Miro Dudik <mdudik@gmail.com>

Merge branch 'master' of https://github.com/fairlearn/fairlearn-propo…

afa5890

…sals

add harmonization section

37a2c14

Signed-off-by: Miro Dudik <mdudik@gmail.com>

MiroDudik requested review from riedgar-ms and romanlutz May 4, 2020 18:26

romanlutz reviewed May 4, 2020

View reviewed changes

api/METRICS.md Outdated Show resolved Hide resolved

romanlutz reviewed May 4, 2020

View reviewed changes

riedgar-ms reviewed May 5, 2020

View reviewed changes

api/METRICS.md Outdated Show resolved Hide resolved

romanlutz reviewed May 5, 2020

View reviewed changes

api/METRICS.md Show resolved Hide resolved

riedgar-ms reviewed May 5, 2020

View reviewed changes

MiroDudik added 2 commits May 25, 2020 21:54

finalize the proposal

51c101c

Signed-off-by: Miro Dudik <mdudik@gmail.com>

add upper bound to BGL

82740e6

Signed-off-by: Miro Dudik <mdudik@gmail.com>

riedgar-ms reviewed May 26, 2020

View reviewed changes

api/METRICS.md Show resolved Hide resolved

riedgar-ms reviewed May 26, 2020

View reviewed changes

api/METRICS.md Show resolved Hide resolved

api/METRICS.md Show resolved Hide resolved

api/METRICS.md Outdated Show resolved Hide resolved

api/METRICS.md Show resolved Hide resolved

api/METRICS.md Show resolved Hide resolved

api/METRICS.md Show resolved Hide resolved

address some comments

b1fc2b6

Signed-off-by: Miro Dudik <mdudik@gmail.com>

romanlutz reviewed May 27, 2020

View reviewed changes

romanlutz approved these changes May 27, 2020

View reviewed changes

riedgar-ms approved these changes May 27, 2020

View reviewed changes

afehmiu mentioned this pull request May 28, 2020

Added bound arguments to moments and implemented/updated unit tests accordingly fairlearn/fairlearn#424

Merged

MiroDudik merged commit a49c51a into fairlearn:master May 29, 2020

Conversation

MiroDudik commented May 4, 2020

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

riedgar-ms commented May 13, 2020

Uh oh!

Uh oh!

riedgar-ms left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

romanlutz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants