Corrected f_beta computation #4183

abhinavg97 · 2020-10-16T03:33:26Z

Added METRIC_EPS in the denominator of computation to avoid nan values in f_beta score.

What does this PR do?

Will give the correct f_beta score and prevent outputting nan values for multilabel classification with macro.

Before submitting

Was this discussed/approved via a Github issue? (no need for typos and docs improvements)
Did you read the contributor guideline, Pull Request section?
Did you make sure your PR does only one thing, instead of bundling different changes together? Otherwise, we ask you to create a separate PR for every change.
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?
Did you verify new and existing tests pass locally with your changes?
If you made a notable change (that affects users), did you update the CHANGELOG?

PR review

Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in Github issues there's a high chance it will not be merged.

Did you have fun?

Make sure you had fun coding 🙃 👍

Added METRIC_EPS in the denominator to avoid nan values in f_beta score.

pep8speaks · 2020-10-16T03:33:29Z

Hello @abhinavg97! Thanks for updating this PR.

In the file tests/metrics/classification/test_f_beta.py:

Line 91:121: E501 line too long (121 > 120 characters)

Comment last updated at 2020-10-21 08:07:39 UTC

Made changes flake8 compliant

codecov · 2020-10-16T04:06:47Z

Codecov Report

Merging #4183 into master will decrease coverage by 1%.
The diff coverage is 100%.

@@           Coverage Diff           @@
##           master   #4183    +/-   ##
=======================================
- Coverage      90%     89%    -1%     
=======================================
  Files         103     103            
  Lines        7863    8313   +450     
=======================================
+ Hits         7064    7403   +339     
- Misses        799     910   +111

Borda

I think that this offset epsilon adding to all is not good, we shall check it there are zeros and then add it just there, well if know that some are rore just retunt the output...
@justusschock @SkafteNicki

SkafteNicki · 2020-10-16T07:59:35Z

We have this function (still used in functional) that took care of this:
https://github.com/PyTorchLightning/pytorch-lightning/blob/130de22fd75253330320473d8081fd60698c3d64/pytorch_lightning/metrics/functional/reduction.py#L40-L78
We should probably reuse it in the new class api

Makes use of class_reduce for macro f_beta computation to avoid nans

Made flake8 compliant

SkafteNicki · 2020-10-16T09:24:49Z

@abhinavg97 is it correct that this happens when both recall and precision is 0 for a particular class?
Could we add a test case for this?

abhinavg97 · 2020-10-16T09:34:16Z

Yes, that's right.

Here is an example:

predictions = torch.Tensor([[0,0],[0,1]])
labels = torch.Tensor([[0,1],[1,0]])

f_beta = Fbeta(num_classes=len(t[0]), average='macro', multilabel=True)

f_beta(predictions, labels)

> torch(nan)

Expected: torch(0)

Yeah I think we can add a test for this. Is it okay if I add the above case as a test?

Borda · 2020-10-16T10:24:22Z

can we add a test to verify that these changes fix the actual master...?

SkafteNicki · 2020-10-19T08:32:21Z

@abhinavg97 can you add the following testcase to tests/metrics/classification/inputs and afterwards add to the corresponding parametrization of the f-beta tests:

# Generate edge multilabel edge case, where nothing matches (scores are undefined)
__temp_preds = torch.randint(high=2, size=(NUM_BATCHES, BATCH_SIZE, NUM_CLASSES))
__temp_target = abs(__temp_preds - 1)

_multilabel_inputs_no_match = Input(
    preds=__temp_preds,
    target=__temp_target
)

abhinavg97 · 2020-10-19T08:37:37Z

@abhinavg97 can you add the following testcase to tests/metrics/classification/inputs and afterwards add to the corresponding parametrization of the f-beta tests:
# Generate edge multilabel edge case, where nothing matches (scores are undefined)
__temp_preds = torch.randint(high=2, size=(NUM_BATCHES, BATCH_SIZE, NUM_CLASSES))
__temp_target = abs(__temp_preds - 1)

_multilabel_inputs_no_match = Input(
    preds=__temp_preds,
    target=__temp_target
)

Yes sure, will do. Thanks for the pointer to the input location :)

SkafteNicki

LGTM

tchaton

Great PR ! Thanks !

Borda · 2020-10-21T07:56:48Z

pytorch_lightning/metrics/classification/f_beta.py

@@ -124,9 +125,11 @@ def compute(self):
            precision = self.true_positives.sum().float() / (self.predicted_positives.sum() + METRIC_EPS)


do we really want to keep our metric systematically unprecise by adding some offset?

we can move towards class_reduce function which explicit handles nans, when we unify the class based metrics and functional metrics.

yeah, makes sense. Just checked and the tests pass without METRIC_EPS. Pushing the update now.

Update f_beta.py

761fa56

Added METRIC_EPS in the denominator to avoid nan values in f_beta score.

mergify bot requested a review from a team October 16, 2020 03:34

Update f_beta.py

b3d48c1

Made changes flake8 compliant

abhinavg97 changed the title ~~Update f_beta.py~~ Corrected f_beta computation by adding METRIC_EPS Oct 16, 2020

abhinavg97 changed the title ~~Corrected f_beta computation by adding METRIC_EPS~~ Corrected f_beta computation by adding METRIC_EPS in denominator Oct 16, 2020

abhinavg97 mentioned this pull request Oct 16, 2020

F beta with macro computation outputs nans for certain inputs #4187

Closed

Borda requested changes Oct 16, 2020

View reviewed changes

Borda added bug Something isn't working Metrics labels Oct 16, 2020

mergify bot requested a review from a team October 16, 2020 07:12

abhinavg97 added 2 commits October 16, 2020 18:07

Update f_beta.py

a468d4d

Makes use of class_reduce for macro f_beta computation to avoid nans

Update f_beta.py

8a35d18

Made flake8 compliant

abhinavg97 requested review from Borda and removed request for a team October 16, 2020 09:13

mergify bot requested a review from a team October 16, 2020 09:13

Borda requested review from justusschock and SkafteNicki October 16, 2020 10:23

Merge remote-tracking branch 'upstream/master'

23a7c49

abhinavg97 changed the title ~~Corrected f_beta computation by adding METRIC_EPS in denominator~~ Corrected f_beta computation Oct 19, 2020

edenlightning added this to the 1.0.3 milestone Oct 19, 2020

Corrected F beta computation

99d9cc3

SkafteNicki mentioned this pull request Oct 21, 2020

Metrics do not support multilabel tasks. #4238

Closed

SkafteNicki approved these changes Oct 21, 2020

View reviewed changes

mergify bot requested a review from a team October 21, 2020 07:48

tchaton approved these changes Oct 21, 2020

View reviewed changes

justusschock approved these changes Oct 21, 2020

View reviewed changes

Borda reviewed Oct 21, 2020

View reviewed changes

Removed offset to make the computation precise

0d54cf2

Borda approved these changes Oct 21, 2020

View reviewed changes

Borda added the ready PRs ready to be merged label Oct 21, 2020

SkafteNicki merged commit 5d1583d into Lightning-AI:master Oct 21, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Corrected f_beta computation #4183

Corrected f_beta computation #4183

abhinavg97 commented Oct 16, 2020 •

edited by ydcjeff

Loading

pep8speaks commented Oct 16, 2020 •

edited

Loading

codecov bot commented Oct 16, 2020 •

edited

Loading

Borda left a comment

SkafteNicki commented Oct 16, 2020

SkafteNicki commented Oct 16, 2020

abhinavg97 commented Oct 16, 2020 •

edited

Loading

Borda commented Oct 16, 2020

SkafteNicki commented Oct 19, 2020

abhinavg97 commented Oct 19, 2020

SkafteNicki left a comment

tchaton left a comment

Borda Oct 21, 2020

SkafteNicki Oct 21, 2020

abhinavg97 Oct 21, 2020

		@@ -124,9 +125,11 @@ def compute(self):
		precision = self.true_positives.sum().float() / (self.predicted_positives.sum() + METRIC_EPS)

Corrected f_beta computation #4183

Corrected f_beta computation #4183

Conversation

abhinavg97 commented Oct 16, 2020 • edited by ydcjeff Loading

What does this PR do?

Before submitting

PR review

Did you have fun?

pep8speaks commented Oct 16, 2020 • edited Loading

Comment last updated at 2020-10-21 08:07:39 UTC

codecov bot commented Oct 16, 2020 • edited Loading

Codecov Report

Borda left a comment

Choose a reason for hiding this comment

SkafteNicki commented Oct 16, 2020

SkafteNicki commented Oct 16, 2020

abhinavg97 commented Oct 16, 2020 • edited Loading

Borda commented Oct 16, 2020

SkafteNicki commented Oct 19, 2020

abhinavg97 commented Oct 19, 2020

SkafteNicki left a comment

Choose a reason for hiding this comment

tchaton left a comment

Choose a reason for hiding this comment

Borda Oct 21, 2020

Choose a reason for hiding this comment

SkafteNicki Oct 21, 2020

Choose a reason for hiding this comment

abhinavg97 Oct 21, 2020

Choose a reason for hiding this comment

abhinavg97 commented Oct 16, 2020 •

edited by ydcjeff

Loading

pep8speaks commented Oct 16, 2020 •

edited

Loading

codecov bot commented Oct 16, 2020 •

edited

Loading

abhinavg97 commented Oct 16, 2020 •

edited

Loading