wrong test acc because redundant data in ddp mode #4732

xiadingZ · 2020-11-18T07:23:49Z

If I have 499 videos to test set, in ddp model, It will load 512 videos to test, maybe by copying videos to match the batch size. But it will cause wrong test accuracy. Now I need to save each videos' predictions and calculate acc by my own. Is there any way to solve this problem?

Borda · 2020-11-18T08:40:04Z

@xiadingZ mind sharing some code example or Colab or how do you know that your test loaded more videos, what is you batch size?

xiadingZ · 2020-11-18T08:44:49Z

I write a test_flist to load video data, it only has 499 line video info. And in test_epoch_end, I write all video's prediction score to disk, with video id. Then I load these predictions, it has 512 videos, some of videos have same id. my batch size is 8, use ddp mode on multi-gpu

this ls my test dataloader

        dataset = VideoDataset(self.hparams, mode='val', transform=transform)
        return DataLoader(dataset, batch_size=self.batch_size,
                          num_workers=self.num_workers, pin_memory=True)

SkafteNicki · 2020-11-18T10:26:51Z

Duplicate of #2398
The short answer is that DistributedSampler adds additional samples to even the load over all processes. This will cause a slight bias in the metric value. Currently, you would need to run the testing on single gpu, until we support uneven inputs in ddp (#3325)

Borda · 2020-11-18T10:36:02Z

closing in favor of #2398 so pls continue the thread there 🐰

xiadingZ added feature Is an improvement or enhancement help wanted Open to be worked on labels Nov 18, 2020

Borda added the information needed label Nov 18, 2020

Borda added duplicate This issue or pull request already exists and removed information needed labels Nov 18, 2020

Borda closed this as completed Nov 18, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

wrong test acc because redundant data in ddp mode #4732

wrong test acc because redundant data in ddp mode #4732

xiadingZ commented Nov 18, 2020

Borda commented Nov 18, 2020

xiadingZ commented Nov 18, 2020 •

edited

Loading

SkafteNicki commented Nov 18, 2020

Borda commented Nov 18, 2020

wrong test acc because redundant data in ddp mode #4732

wrong test acc because redundant data in ddp mode #4732

Comments

xiadingZ commented Nov 18, 2020

Borda commented Nov 18, 2020

xiadingZ commented Nov 18, 2020 • edited Loading

SkafteNicki commented Nov 18, 2020

Borda commented Nov 18, 2020

xiadingZ commented Nov 18, 2020 •

edited

Loading