Sprinkle M1 with comments on what the evaluation means #550

ArturoAmorQ · 2022-01-24T16:45:44Z

Partially addresses #530.

This PR adds comments on the score method in the M1:

the first time the method is used
the first time we use a test set for scoring
in the pipeline video, where care has to be taken on interpreting the result naively

python_scripts/02_numerical_pipeline_introduction.py

python_scripts/video_pipeline.py

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

ArturoAmorQ · 2022-02-10T06:58:38Z

Thanks for the comments @ogrisel !

python_scripts/02_numerical_pipeline_hands_on.py

lesteve · 2022-02-10T13:13:13Z

python_scripts/02_numerical_pipeline_introduction.py

-# But, can this evaluation be trusted, or is it too good to be true?
+# This result means that the model makes a correct _prediction_ for
+# approximately 82 samples out of 100. But, can a model _predict_ something
+# that it already saw? In other words, can this evaluation be trusted, or is it


I think I know what you are trying to say (that we are measuring the accuracy on the training data and that it is kind of cheating) but I find the wording super confusing ...

In particular: "can a model predict something that it already saw?" I would answer "yes why not, sorry is this a trick question?"

I think you probably mean "can this really be called prediction when we are learning and predicting from the same data" but I can't find a good wording that convinces me.

I kind of think the next section in train-test data split explains this kind of thing already so I would stay short maybe something like this:

Note that here we used the same data to learn and evaluate our model, so can this evaluation be trusted, or is it too good to be true?

Co-authored-by: Loïc Estève <loic.esteve@ymail.com>

python_scripts/02_numerical_pipeline_introduction.py

lesteve · 2022-02-11T11:22:38Z

I tried hard to refrain from tweaking the wording but I did not manage to ...

Thanks, merging this one!

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org> Co-authored-by: Loïc Estève <loic.esteve@ymail.com> 4e126c7

ArturoAmorQ added 3 commits January 24, 2022 15:31

Add interpretation to score in working with numerical data notebook

3be4b7d

Add interpretation to score the first time it is used

6be0b24

Add interpretation to score in the numerical pipeline video

753277d

ArturoAmorQ changed the title ~~Sprinkle m1~~ Sprinkle M1 with comments on what the evaluation means Jan 24, 2022

ogrisel reviewed Feb 9, 2022

View reviewed changes

python_scripts/02_numerical_pipeline_introduction.py Outdated Show resolved Hide resolved

ogrisel reviewed Feb 9, 2022

View reviewed changes

python_scripts/video_pipeline.py Outdated Show resolved Hide resolved

ogrisel reviewed Feb 9, 2022

View reviewed changes

python_scripts/video_pipeline.py Outdated Show resolved Hide resolved

ArturoAmorQ and others added 3 commits February 10, 2022 07:40

Update python_scripts/02_numerical_pipeline_introduction.py

e5e8c4b

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

Update python_scripts/video_pipeline.py

f9beb33

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

Update python_scripts/video_pipeline.py

8c31681

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

lesteve reviewed Feb 10, 2022

View reviewed changes

python_scripts/02_numerical_pipeline_hands_on.py Outdated Show resolved Hide resolved

lesteve reviewed Feb 10, 2022

View reviewed changes

Update python_scripts/02_numerical_pipeline_hands_on.py

4a1a8b1

Co-authored-by: Loïc Estève <loic.esteve@ymail.com>

ArturoAmorQ commented Feb 10, 2022

View reviewed changes

python_scripts/02_numerical_pipeline_introduction.py Outdated Show resolved Hide resolved

ArturoAmorQ and others added 2 commits February 10, 2022 16:45

Update python_scripts/02_numerical_pipeline_introduction.py

1a96ce0

Update 02_numerical_pipeline_introduction.py

72d4d4e

lesteve merged commit 4e126c7 into INRIA:main Feb 11, 2022

github-actions bot pushed a commit that referenced this pull request Feb 11, 2022

[ci skip] Sprinkle M1 with comments on what the evaluation means (#550)

6dfc12c

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org> Co-authored-by: Loïc Estève <loic.esteve@ymail.com> 4e126c7

ArturoAmorQ deleted the sprinkle_M1 branch March 11, 2022 13:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sprinkle M1 with comments on what the evaluation means #550

Sprinkle M1 with comments on what the evaluation means #550

ArturoAmorQ commented Jan 24, 2022

ArturoAmorQ commented Feb 10, 2022

lesteve Feb 10, 2022

lesteve commented Feb 11, 2022

Sprinkle M1 with comments on what the evaluation means #550

Sprinkle M1 with comments on what the evaluation means #550

Conversation

ArturoAmorQ commented Jan 24, 2022

ArturoAmorQ commented Feb 10, 2022

lesteve Feb 10, 2022

Choose a reason for hiding this comment

lesteve commented Feb 11, 2022