Refactor Offline-ER to work with `collate_fn` #390

wistuba · 2023-08-23T16:17:11Z

Offline-ER applies collate_fn individually on new and memory data. This change will apply the collate function on the entire batch instead.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

github-actions · 2023-08-23T16:40:52Z

Coverage report

Note

Coverage evolution disabled because this PR targets a different branch
than the default branch, for which coverage data is not available.

The coverage rate is 85.56%.

89.47% of new lines are covered.

Diff Coverage details (click to unfold)

src/renate/memory/buffer.py

100% of new lines are covered (94.02% of the complete file).

src/renate/utils/pytorch.py

100% of new lines are covered (96.11% of the complete file).

src/renate/updaters/learner.py

100% of new lines are covered (95.97% of the complete file).

src/renate/updaters/experimental/offline_er.py

62.5% of new lines are covered (82.19% of the complete file).
Missing lines: 75, 81, 82, 116, 126, 127

prabhuteja12 · 2023-08-24T07:45:50Z

src/renate/utils/pytorch.py

+    Args:
+        dataset_lengths: The length for the different datasets.
+        batch_sizes: Batch sizes used for specific datasets.
+        complete_dataset_iteration: Provide an index to indicate over which dataset to fully


Possibly rename?

suggestions?

prabhuteja12 · 2023-08-24T07:46:21Z

src/renate/utils/pytorch.py

+            else num_batches[self.complete_dataset_iteration]
+        )
+
+    def __iter__(self) -> Iterator[List[int]]:


Can you add comments about the exact logic?

prabhuteja12 · 2023-08-24T07:46:52Z

src/renate/utils/pytorch.py

+                yield [j for i in samples for j in i]
+        else:
+            iterators = [iter(sampler) for sampler in self.subset_samplers]
+            for s in iterators[self.complete_dataset_iteration]:


Is this optimized? Nested for-loops for each batch seems like a lot.

there is no nested loop for each batch. it is a single loop over each iterator. in case 1 this is hidden within zip but it also has a loop over each iterator and calls next.

prabhuteja12

Can you check this works with distributed training? That uses something like a DistributedSampler which also modifies the data to sample from.

prabhuteja12 · 2023-09-21T12:25:51Z

src/renate/utils/pytorch.py

+            data_start_idx = data_end_idx
+        self.length = (
+            min(num_batches)
+            if complete_dataset_iteration is None


Why not self.complete_dataset_iteration here?

prabhuteja12 · 2023-09-21T12:33:42Z

src/renate/utils/pytorch.py

@@ -156,3 +156,76 @@ def complementary_indices(num_outputs: int, valid_classes: Set[int]) -> List[int
        valid_classes: A set of integers of valid classes.
    """
    return [class_idx for class_idx in range(num_outputs) if class_idx not in valid_classes]
+
+
+class ConcatRandomSampler(BatchSampler):


Why inherit from BatchSampler?

changed to Sampler

prabhuteja12 · 2023-09-21T12:43:24Z

src/renate/utils/pytorch.py

+            start_idx = data_start_idx + round(dataset_length / num_replicas * rank)
+            end_idx = data_start_idx + round(dataset_length / num_replicas * (rank + 1))
+            subset_sampler = BatchSampler(
+                SubsetRandomSampler(list(range(start_idx, end_idx)), generator),


Why BatchSampler of SubsetRandomSampler?

BatchSampler creates batches, SubsetRandomSampler creates random ints from the provided list (List[int] vs int)

src/renate/utils/pytorch.py

prabhuteja12 · 2023-09-21T12:55:51Z

test/renate/utils/test_pytorch.py

+
+
+@pytest.mark.parametrize(
+    "complete_dataset_iteration,expected_batches", [[None, 2], [0, 7], [1, 5], [2, 2]]


For None batches is 2 because 20//8 = 2?

yes. it is identical to the [2, 2] case

So a drop_last is implicit?

yes. improved doc

prabhuteja12 · 2023-09-21T12:56:18Z

test/renate/utils/test_pytorch.py

@@ -11,6 +11,7 @@
 from renate.memory.buffer import ReservoirBuffer
 from renate.utils import pytorch
 from renate.utils.pytorch import (


Is it possible to add a DistributedSampler to a test?

I've added a unit test for the distributed case instead

wistuba added 5 commits August 23, 2023 17:08

changes to Offline-ER without drop_last=True in learner

e5f0d3e

fix sampler

542c855

update expected values

c23014d

flake

0f8803a

flake

546b8eb

update expected numbers and add unit test

8cea500

wistuba requested a review from prabhuteja12 August 23, 2023 17:40

wistuba assigned prabhuteja12 Aug 23, 2023

prabhuteja12 reviewed Aug 24, 2023

View reviewed changes

prabhuteja12 requested changes Aug 24, 2023

View reviewed changes

wistuba added 6 commits August 24, 2023 10:22

small code improvements

0b07ac3

Merge branch 'dev' into mw-offline-er-fix

c233f95

ddp compatible sampler

8190e26

lint

8c69db7

fix small bug

40ffc88

add typing

a3bf468

wistuba requested a review from prabhuteja12 September 19, 2023 18:00

prabhuteja12 reviewed Sep 21, 2023

View reviewed changes

src/renate/utils/pytorch.py Show resolved Hide resolved

prabhuteja12 reviewed Sep 21, 2023

View reviewed changes

wistuba added 2 commits September 21, 2023 18:39

provide better docs for batch sampler

2746391

test distributed behavior

615dc16

wistuba requested a review from prabhuteja12 September 22, 2023 08:34

prabhuteja12 approved these changes Sep 22, 2023

View reviewed changes

wistuba merged commit f99cf17 into dev Sep 22, 2023
18 checks passed

wistuba deleted the mw-offline-er-fix branch September 22, 2023 09:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor Offline-ER to work with `collate_fn` #390

Refactor Offline-ER to work with `collate_fn` #390

wistuba commented Aug 23, 2023

github-actions bot commented Aug 23, 2023 •

edited

Loading

src/renate/memory/buffer.py

src/renate/utils/pytorch.py

src/renate/updaters/learner.py

src/renate/updaters/experimental/offline_er.py

prabhuteja12 Aug 24, 2023

wistuba Aug 24, 2023

prabhuteja12 Aug 24, 2023

prabhuteja12 Aug 24, 2023

wistuba Aug 24, 2023

prabhuteja12 left a comment

prabhuteja12 Sep 21, 2023

wistuba Sep 21, 2023

prabhuteja12 Sep 21, 2023

wistuba Sep 21, 2023

prabhuteja12 Sep 21, 2023

wistuba Sep 21, 2023

prabhuteja12 Sep 21, 2023

wistuba Sep 21, 2023

prabhuteja12 Sep 21, 2023

wistuba Sep 21, 2023

prabhuteja12 Sep 21, 2023

wistuba Sep 22, 2023



		@pytest.mark.parametrize(
		"complete_dataset_iteration,expected_batches", [[None, 2], [0, 7], [1, 5], [2, 2]]

Refactor Offline-ER to work with collate_fn #390

Refactor Offline-ER to work with collate_fn #390

Conversation

wistuba commented Aug 23, 2023

github-actions bot commented Aug 23, 2023 • edited Loading

Coverage report

src/renate/memory/buffer.py

src/renate/utils/pytorch.py

src/renate/updaters/learner.py

src/renate/updaters/experimental/offline_er.py

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

prabhuteja12 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Refactor Offline-ER to work with `collate_fn` #390

Refactor Offline-ER to work with `collate_fn` #390

github-actions bot commented Aug 23, 2023 •

edited

Loading