negate the result and prefix the metric name for error/loss metrics #278

sebhrusen · 2021-04-01T15:56:22Z

Only changing the result and metric columns when the metric represents an error:

metric name is prefixed with neg_.
result is negated.

Example:

Summing up scores for current run:
                  id         task          framework constraint fold    result       metric   mode version params               app_version                  utc  duration  training_duration  predict_duration models_count       seed       acc  auc    balacc   logloss      mae        r2     rmse
0  openml.org/t/3913          kc2  constantpredictor       test    0   0.50000          auc  local  0.23.2         dev [issue/268, b990e93]  2021-04-01T15:52:54       0.2                0.0               0.0            1  933769621  0.792453  0.5  0.500000  0.510714      NaN       NaN      NaN
1  openml.org/t/3913          kc2  constantpredictor       test    1   0.50000          auc  local  0.23.2         dev [issue/268, b990e93]  2021-04-01T15:52:54       0.1                0.0               0.0            1  933769622  0.792453  0.5  0.500000  0.510714      NaN       NaN      NaN
2    openml.org/t/59         iris  constantpredictor       test    0  -1.09861  neg_logloss  local  0.23.2         dev [issue/268, b990e93]  2021-04-01T15:52:54       0.0                0.0               0.0            1  933769621  0.333333  NaN  0.333333  1.098610      NaN       NaN      NaN
3    openml.org/t/59         iris  constantpredictor       test    1  -1.09861  neg_logloss  local  0.23.2         dev [issue/268, b990e93]  2021-04-01T15:52:54       0.0                0.0               0.0            1  933769622  0.333333  NaN  0.333333  1.098610      NaN       NaN      NaN
4  openml.org/t/2295  cholesterol  constantpredictor       test    0 -45.68970     neg_rmse  local  0.23.2         dev [issue/268, b990e93]  2021-04-01T15:52:54       0.0                0.0               0.0            1  933769621       NaN  NaN       NaN       NaN  35.6774 -0.077562  45.6897
5  openml.org/t/2295  cholesterol  constantpredictor       test    1 -55.00410     neg_rmse  local  0.23.2         dev [issue/268, b990e93]  2021-04-01T15:52:54       0.0                0.0               0.0            1  933769622       NaN  NaN       NaN       NaN  45.3871 -0.049133  55.0041

sebhrusen · 2021-04-01T15:57:56Z

@Innixma does this look reasonable?

PGijsbers

Prefixing neg_ is probably more sensible (since scikit-learn does it) so I agree with that choice even though it takes more space. Other than that I tested it (with constant predictor) and it looks good.

Innixma

Apologies for late response, I was on PTO.

Looks good to me, had a comment for long-term improving code quality and extensibility.

Innixma · 2021-04-06T18:28:18Z

amlb/results.py

    def auc(self):
+        """Array Under (ROC) Curve, computed on probabilities, not on predictions"""


nit: area instead of array

oups! will fix

Innixma · 2021-04-06T18:30:49Z

amlb/results.py

        return float(r2_score(self.truth, self.predictions))


+def higher_is_better(metric):


This seems a bit hacky. Better to have either a dictionary mapping or metrics as classes (example in AutoGluon).

I can't disagree with you: it IS a bit hacky.
Ideally, there should be a class for each metric. It's probably something I'll do at some point to support custom metrics or other customizations in a more satisfying way that what was done in #141.
If there's a demand for it, I'll do it.

negate the result and prefix the metric name for error/loss metrics

4dcde75

sebhrusen requested a review from PGijsbers April 1, 2021 15:59

Sebastien Poirier added 4 commits April 1, 2021 21:21

use the occasion to document available metrics and add more of them

4cabe71

docstring

932c5c4

fixed reports library+notebook to accommodate the changes

45dfc03

small cleanup in reports notebook

b8c36e7

PGijsbers approved these changes Apr 6, 2021

View reviewed changes

fixed wrong search/replace in reporting notebook

1ee40a3

sebhrusen merged commit f7b68eb into master Apr 6, 2021

sebhrusen deleted the issue/268 branch April 6, 2021 15:43

Innixma reviewed Apr 6, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

negate the result and prefix the metric name for error/loss metrics #278

negate the result and prefix the metric name for error/loss metrics #278

sebhrusen commented Apr 1, 2021 •

edited

Loading

sebhrusen commented Apr 1, 2021

PGijsbers left a comment

Innixma left a comment

Innixma Apr 6, 2021

sebhrusen Apr 6, 2021

Innixma Apr 6, 2021

sebhrusen Apr 6, 2021

		def auc(self):
		"""Array Under (ROC) Curve, computed on probabilities, not on predictions"""

		return float(r2_score(self.truth, self.predictions))


		def higher_is_better(metric):

negate the result and prefix the metric name for error/loss metrics #278

negate the result and prefix the metric name for error/loss metrics #278

Conversation

sebhrusen commented Apr 1, 2021 • edited Loading

sebhrusen commented Apr 1, 2021

PGijsbers left a comment

Choose a reason for hiding this comment

Innixma left a comment

Choose a reason for hiding this comment

Innixma Apr 6, 2021

Choose a reason for hiding this comment

sebhrusen Apr 6, 2021

Choose a reason for hiding this comment

Innixma Apr 6, 2021

Choose a reason for hiding this comment

sebhrusen Apr 6, 2021

Choose a reason for hiding this comment

sebhrusen commented Apr 1, 2021 •

edited

Loading