Implement R-Pol policy risk score and ROC AUC score 

Some datasets cannot be evaluated using the currently used scores PEHE or ENoRMSE, because

1. No ground truth is available (e.g. the Jobs dataset from Lalonde)
2. The classes are imbalanced and binary (e.g. the Twins dataset) 

Thus, we need more scores for comprehensive evaluation. Especially the policy risk used, for example, by [Shalit et al](https://arxiv.org/pdf/1606.03976.pdf). Also, the ROC-curve or the area-under-the-curve (AUC) of the ROC-Curve should be used in binary cases like the wins dataset. 




Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement R-Pol policy risk score and ROC AUC score #9

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Implement R-Pol policy risk score and ROC AUC score #9

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions