Add `ImageClassificationEvaluator` #173

fxmarty · 2022-07-04T15:42:35Z

Let's stall #167 that requires some more work and try to merge this one before instead.

Image classification is a much simplier task, and the pipeline for evaluation works out of the box.

I did some refactoring to avoid copying code, on the suggestion from @ola13 .

HuggingFaceDocBuilderDev · 2022-07-04T15:45:31Z

The documentation is not available anymore as the PR was closed or merged.

ola13

Thanks a lot for looking into it @fxmarty, some minor comments inline :)

src/evaluate/evaluator/__init__.py

ola13 · 2022-07-05T09:06:06Z

src/evaluate/evaluator/text_classification.py

+        metric = self.prepare_metric(metric)
+
+        references = data[label_column]
+        predictions = self._compute_predictions(pipe, data[input_column], label_mapping=label_mapping)


given current refactoring it might be more readable to do _compute_predictions inline (same for image_classification - splitting it into a separate function may have been a suboptimal idea from the get go.

That's a good point, let me know if the modifs I did in fbf66a8 are fine.

ola13

Thanks @fxmarty, looks great! Accepting the PR, however, looks like there are some tests failing (problems with imports?), let's make sure these are resolved before merging

fxmarty · 2022-07-05T10:11:34Z

@ola13 Should be fine now, we needed a rebase :)

lvwerra

Looks good, just a question why we need a bigger machine for the tests? Thanks for adding this ❤️

lvwerra · 2022-07-06T09:50:26Z

.circleci/config.yml

@@ -8,7 +8,7 @@ jobs:
        working_directory: ~/evaluate
        docker:
            - image: cimg/python:3.7
-        resource_class: medium
+        resource_class: large


Why is the large class needed? because of the evaluator/trainer test?

Yes, it is because of the evaluator/trainer. I don't understand why, locally running the tests I was using at most 800 MB of RAM for the evaluator/trainer test.

With medium: https://app.circleci.com/pipelines/github/huggingface/evaluate/468/workflows/5b57ecc8-abd1-4fdb-a1fb-db655510fc60/jobs/1409/resources
With large: https://app.circleci.com/pipelines/github/huggingface/evaluate/479/workflows/7f837ed5-bb94-4c60-a26d-9ba0bfae3c93/jobs/1442/resources

The models in the parity tests are 45 MB and 18 MB. The "beans" dataset looks to be <200 MB ( https://huggingface.co/datasets/beans/tree/main/data ). sst2 should be < 10 MB. So I am not sure what the issue is.

The test can be run locally to check memory usage: pytest tests/test_trainer_evaluator_parity.py

lvwerra · 2022-07-06T09:58:48Z

tests/test_trainer_evaluator_parity.py

@@ -37,7 +37,7 @@ def setUp(self):
        )

    def tearDown(self):
-        shutil.rmtree(self.dir_path, onerror=onerror)
+        shutil.rmtree(self.dir_path, ignore_errors=True)


does that mean that on windows the folder will just no be removed in that case?

On Windows, the read-only files will not be removed, typically the .git content. I remember I still had issues with the onerror from https://stackoverflow.com/a/2656405 . Would you prefer to rollback to an error-handler, that makes sure that read-only files are deleted? I agree it is cleaner if anybody runs the tests on Windows.

lvwerra · 2022-07-07T14:51:54Z

src/evaluate/evaluator/image_classification.py

+
+
+try:
+    from transformers import FeatureExtractionMixin, Pipeline, PreTrainedModel, TFPreTrainedModel


I think the FeatureExtractionMixin was only added in transformers==4.17.0. I think we need to update the setup.py install the right version (the tests were failing for me locally even when installing the evaluator extra).

huggingface/transformers@b5c6fde

Could you confirm @fxmarty?

ola13 suggested changes Jul 5, 2022

View reviewed changes

ola13 approved these changes Jul 5, 2022

View reviewed changes

fxmarty added 12 commits July 5, 2022 12:06

refactoring

55d767f

add image classification evaluator

11fef56

tempo

38275f4

added tests

fbb28aa

better doc

646d95e

better doc

8ce426f

remove unused imports

49f1bd2

style

1641be9

pass test

b85ef56

style

6962e2e

remove unused imports

3f72d49

remove _compute_predictions

c88815a

fxmarty force-pushed the add-image-classification-evaluator branch from 722a7ec to c88815a Compare July 5, 2022 10:06

ola13 merged commit 27ea232 into huggingface:main Jul 5, 2022

lvwerra reviewed Jul 6, 2022

View reviewed changes

lvwerra reviewed Jul 7, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `ImageClassificationEvaluator` #173

Add `ImageClassificationEvaluator` #173

fxmarty commented Jul 4, 2022

HuggingFaceDocBuilderDev commented Jul 4, 2022 •

edited

Loading

ola13 left a comment

ola13 Jul 5, 2022 •

edited

Loading

fxmarty Jul 5, 2022 •

edited

Loading

ola13 left a comment

fxmarty commented Jul 5, 2022 •

edited

Loading

lvwerra left a comment

lvwerra Jul 6, 2022

fxmarty Jul 6, 2022 •

edited

Loading

lvwerra Jul 6, 2022

fxmarty Jul 6, 2022

lvwerra Jul 7, 2022 •

edited

Loading



		try:
		from transformers import FeatureExtractionMixin, Pipeline, PreTrainedModel, TFPreTrainedModel

Add ImageClassificationEvaluator #173

Add ImageClassificationEvaluator #173

Conversation

fxmarty commented Jul 4, 2022

HuggingFaceDocBuilderDev commented Jul 4, 2022 • edited Loading

ola13 left a comment

Choose a reason for hiding this comment

ola13 Jul 5, 2022 • edited Loading

Choose a reason for hiding this comment

fxmarty Jul 5, 2022 • edited Loading

Choose a reason for hiding this comment

ola13 left a comment

Choose a reason for hiding this comment

fxmarty commented Jul 5, 2022 • edited Loading

lvwerra left a comment

Choose a reason for hiding this comment

lvwerra Jul 6, 2022

Choose a reason for hiding this comment

fxmarty Jul 6, 2022 • edited Loading

Choose a reason for hiding this comment

lvwerra Jul 6, 2022

Choose a reason for hiding this comment

fxmarty Jul 6, 2022

Choose a reason for hiding this comment

lvwerra Jul 7, 2022 • edited Loading

Choose a reason for hiding this comment

Add `ImageClassificationEvaluator` #173

Add `ImageClassificationEvaluator` #173

HuggingFaceDocBuilderDev commented Jul 4, 2022 •

edited

Loading

ola13 Jul 5, 2022 •

edited

Loading

fxmarty Jul 5, 2022 •

edited

Loading

fxmarty commented Jul 5, 2022 •

edited

Loading

fxmarty Jul 6, 2022 •

edited

Loading

lvwerra Jul 7, 2022 •

edited

Loading