Metrics dashboard api by MiroDudik · Pull Request #6 · fairlearn/fairlearn-proposals

MiroDudik · 2020-03-24T22:39:44Z

No description provided.

Signed-off-by: Miro Dudik <mdudik@gmail.com>

…sals

api/METRICS-DASHBOARD.md

riedgar-ms · 2020-03-30T17:48:02Z

I've been thinking a bit about this over the weekend, and there are two slightly different sets of changes which are required.

The first is on the level of the entire dashboard JSON. I think that we should merge the modelNames and predictedY portions into a single list of dictionaries, similar to the way the sensitive features are handled. I'm guessing that the current separation is just a historical accident. At this time, we can also decide on the keys we want - basically, camel or snake case. These are fairly mechanical changes, but would impose a very non-zero cost to rollout.

The second is the trickier one, and involves the contents of those dictionaries in the precomputedMetrics (2d) array. I've not got very far with my thoughts yet, but essentially.... if we want to make things less brittle (and also enable third parties to contribute their own views of the statistics), we also need to make sure that we devise something queryable. Basically, a 'view' would list its required statistics, and the 'engine' inside the dashboard would be responsible for matching the available views up with the available statistics. How all that works.... I'm not sure yet.

Signed-off-by: Miro Dudik <mdudik@gmail.com>

romanlutz · 2020-04-16T23:20:06Z

api/METRICS-DASHBOARD.md

+
+```python
+{ 
+  "prediction_type":  "binary_classification" or "probabilistic_binary_classification" or "regression",


Any reason why we're omitting multiclass classification?

because we don't have any support for it yet, but we can definitely add other prediction types in future.

romanlutz · 2020-04-16T23:22:27Z

api/METRICS-DASHBOARD.md

+      "name": "y_true",
+      "values": [0, 1, 1, 1, 0],
+    },
+    "sample_weight": {


This is user-provided, not the one set within ExponentiatedGradient, right?

If so, perhaps it's worth documenting this with a short comment

will do. this is just an example of an array that we may want to pass to the metrics--since many metrics work with this kind of an argument.

riedgar-ms · 2020-04-17T19:18:16Z

api/METRICS-DASHBOARD.md

+    "sensitive_feature gender" : {   # an example feature
+      "name": "gender",
+      "values": [0, 1, 0, 0, 2],
+      "value_names": ["female", "male", "non-binary"],


Should there be a 'type' field in here, so things like 'prediction' and 'sensitive_feature' don't have to go into the key?

riedgar-ms · 2020-04-17T19:19:12Z

api/METRICS-DASHBOARD.md

+  },
+  "cache" : [
+    {
+      "function": string,   # python function name; we could either limit to fairlearn.metrics


Use fully qualified names for sure.

riedgar-ms · 2020-04-17T19:21:00Z

api/METRICS-DASHBOARD.md

+      "return_value": {
+        "overall": 0.11,      
+        "by_group": {
+          "keys": [0, 1, 2],


Are the 'keys' necessary, if we required all categoricals to be integer-encoded?

riedgar-ms · 2020-04-17T19:22:25Z

api/METRICS-DASHBOARD.md

+    "<array_key>" : {   # the keys can be arbitrary strings; not sure we need to force any convention, but see examples below
+      "name": string,   # the name of a feature would be the feature name, of a prediction vector would be the model name
+      "values": number[],  
+      "value_names": string[],       # an optional field to encode categorical data   


Presumably we also specify that extra keys (e.g. inserted by AzureML) are to be preserved.

riedgar-ms · 2020-05-13T13:03:34Z

This is something I've started cogitating on again, in the context of the AzureML - now MLFlow - integration. We do want to enable composability, but also avoid saving lots copies of y_true etc. in the composable pieces.

Then again, we also need an API which allows users to 'mess around in a notebook' without having to set up a bunch of prerequisites. A small example of this is how the dashboard doesn't require model and sensitive feature names, but will generate them itself if invoked without them. In contrast _create_group_metric_set() absolutely requires these names, to minimise the chances of users shooting themselves in the foot.

MiroDudik added 8 commits February 28, 2020 09:50

add metrics API proposal

794a07d

Signed-off-by: Miro Dudik <mdudik@gmail.com>

add clarifications and confusion_matrix

3b93629

Signed-off-by: Miro Dudik <mdudik@gmail.com>

fix list markdown

9359f13

Signed-off-by: Miro Dudik <mdudik@gmail.com>

rename _by_group to _group_summary for consistency

ddde2ff

Signed-off-by: Miro Dudik <mdudik@gmail.com>

remove dashboard questions

0b86e6d

Signed-off-by: Miro Dudik <mdudik@gmail.com>

start proposal for metrics/dashboard api

ace56ba

Signed-off-by: Miro Dudik <mdudik@gmail.com>

Merge branch 'master' of https://github.com/fairlearn/fairlearn-propo…

afa5890

…sals

Merge branch 'master' into metrics-dashboard-api

92ea6fd

MiroDudik requested review from riedgar-ms and rihorn2 March 24, 2020 22:40

riedgar-ms reviewed Mar 24, 2020

View reviewed changes

api/METRICS-DASHBOARD.md Outdated Show resolved Hide resolved

riedgar-ms reviewed Mar 24, 2020

View reviewed changes

api/METRICS-DASHBOARD.md Outdated Show resolved Hide resolved

riedgar-ms reviewed Mar 24, 2020

View reviewed changes

api/METRICS-DASHBOARD.md Outdated Show resolved Hide resolved

riedgar-ms reviewed Mar 24, 2020

View reviewed changes

api/METRICS-DASHBOARD.md Outdated Show resolved Hide resolved

MiroDudik added 2 commits April 16, 2020 10:11

flatten the dictionary

366c3e0

Signed-off-by: Miro Dudik <mdudik@gmail.com>

close code block

c3172e2

Signed-off-by: Miro Dudik <mdudik@gmail.com>

romanlutz reviewed Apr 16, 2020

View reviewed changes

riedgar-ms reviewed Apr 17, 2020

View reviewed changes

Conversation

MiroDudik commented Mar 24, 2020

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

riedgar-ms commented Mar 30, 2020

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

riedgar-ms commented May 13, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants