[Feature] Add `Stack` transform #2567

kurtamohler · 2024-11-14T21:13:32Z

Description

Adds a transform that stacks tensors and specs from different keys of a tensordict into a common key.

Motivation and Context

close #2566

I have raised an issue to propose this change (required for new features and bug fixes)

Types of changes

What types of changes does your code introduce? Remove all that do not apply:

New feature (non-breaking change which adds core functionality)
Documentation (update in the documentation)

Checklist

Go over all the following points, and put an x in all the boxes that apply.
If you are unsure about any of these, don't hesitate to ask. We are here to help!

I have read the CONTRIBUTION guide (required)
My change requires a change to the documentation.
I have updated the tests accordingly (required for a bug fix or a new feature).
I have updated the documentation accordingly.

pytorch-bot · 2024-11-14T21:13:36Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2567

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

[DomainsOnly] Jobs fail with GLIBC version not found

❌ 6 New Failures, 18 Unrelated Failures

As of commit f443812 with merge base 408cf7d ():

NEW FAILURES - The following jobs have failed:

Continuous Benchmark (PR) / CPU Pytest benchmark (gh)
FAILED ../../../../../../tmp/test_objectives_benchmarks.py::test_iql_speed[True-None] - torch._dynamo.exc.Unsupported: Graph break under GenericContextWrappingVariable
Continuous Benchmark (PR) / GPU Pytest benchmark (gh)
FAILED ../../../../tmp/test_objectives_benchmarks.py::test_iql_speed[True-None] - torch._dynamo.exc.Unsupported: Graph break under GenericContextWrappingVariable
Generate documentation / build-docs (3.10, 12.1) / linux-job (gh)
Error response from daemon: toomanyrequests: You have reached your pull rate limit. You may increase the limit by authenticating and upgrading: https://www.docker.com/increase-rate-limit
Habitat Tests on Linux / tests (3.9, 12.1) / linux-job (gh)
RuntimeError: Command docker exec -t 1d1398662e83a451959ce303803e8fe829993c2675f9f90ce48a6df02640b17f /exec failed with exit code 1
Unit-tests on Linux / tests-cpu (3.12) / linux-job (gh)
test/test_transforms.py::TestTrajCounter::test_collector_match
Unit-tests on Linux / tests-cpu (3.9) / linux-job (gh)
RuntimeError: Command docker exec -t 46aba9e52258f71b65001c97dc0b08dfb10ced3de88df5a40ee8963837151816 /exec failed with exit code 2

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

Examples Tests on Linux / tests (3.9, 12.1) / linux-job (gh) (matched linux rule in flaky-rules.json)
Error response from daemon: toomanyrequests: You have reached your pull rate limit. You may increase the limit by authenticating and upgrading: https://www.docker.com/increase-rate-limit
Libs Tests on Linux / unittests-gym (3.9, 12.1) / linux-job (gh) (matched linux rule in flaky-rules.json)
Error response from daemon: toomanyrequests: You have reached your pull rate limit. You may increase the limit by authenticating and upgrading: https://www.docker.com/increase-rate-limit
RLHF Tests on Linux / unittests (3.9, 12.1) / linux-job (gh) (matched linux rule in flaky-rules.json)
Error response from daemon: toomanyrequests: You have reached your pull rate limit. You may increase the limit by authenticating and upgrading: https://www.docker.com/increase-rate-limit
Unit-tests on Linux / tests-cpu (3.10) / linux-job (gh) (similar failure)
test/test_exploration.py::TestAdditiveGaussian::test_additivegaussian_sd[wrapper-policy-device0]
Unit-tests on Linux / tests-gpu (3.11, 12.1) / linux-job (gh) (matched linux rule in flaky-rules.json)
Error response from daemon: toomanyrequests: You have reached your pull rate limit. You may increase the limit by authenticating and upgrading: https://www.docker.com/increase-rate-limit
Unit-tests on Linux / tests-olddeps (3.8, 11.6) / linux-job (gh) (matched linux rule in flaky-rules.json)
Error response from daemon: toomanyrequests: You have reached your pull rate limit. You may increase the limit by authenticating and upgrading: https://www.docker.com/increase-rate-limit
Unit-tests on Linux / tests-optdeps (3.11, 12.1) / linux-job (gh) (matched linux rule in flaky-rules.json)
Error response from daemon: toomanyrequests: You have reached your pull rate limit. You may increase the limit by authenticating and upgrading: https://www.docker.com/increase-rate-limit
Unit-tests on Linux / tests-stable-gpu (3.10, 11.8) / linux-job (gh) (matched linux rule in flaky-rules.json)
Error response from daemon: toomanyrequests: You have reached your pull rate limit. You may increase the limit by authenticating and upgrading: https://www.docker.com/increase-rate-limit

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

Build Linux Wheels / pytorch/rl (pytorch/rl, test/smoke_test.py, torchrl) / upload / manywheel-py3_9-rocm6_1 (gh) (trunk failure)
##[error]Unable to find an artifact with the name: pytorch_rl__3.9_rocm6.1_x86_64
Build Linux Wheels / pytorch/rl (pytorch/rl, test/smoke_test.py, torchrl) / upload / manywheel-py3_9-rocm6_2 (gh) (trunk failure)
##[error]Unable to find an artifact with the name: pytorch_rl__3.9_rocm6.2_x86_64
Build Windows Wheels / pytorch/rl / upload / wheel-py3_9-cpu (gh) (trunk failure)
##[error]Unable to find any artifacts for the associated workflow
Build Windows Wheels / pytorch/rl / upload / wheel-py3_9-cuda11_8 (gh) (trunk failure)
##[error]Unable to find any artifacts for the associated workflow
Build Windows Wheels / pytorch/rl / upload / wheel-py3_9-cuda12_4 (gh) (trunk failure)
##[error]Unable to find any artifacts for the associated workflow
Continuous Benchmark / CPU Pytest benchmark (gh) (trunk failure)
torch._dynamo.exc.Unsupported: Graph break under GenericContextWrappingVariable
Continuous Benchmark / GPU Pytest benchmark (gh) (trunk failure)
FAILED test_objectives_benchmarks.py::test_iql_speed[True-None] - torch._dynamo.exc.Unsupported: Graph break under GenericContextWrappingVariable
Unit-tests on Linux / tests-cpu (3.11) / linux-job (gh) (trunk failure)
test/test_exploration.py::TestAdditiveGaussian::test_additivegaussian_sd[wrapper-policy-device0]
Unit-tests on Linux / tests-cpu-oldget (3.12) / linux-job (gh) (trunk failure)
test/test_exploration.py::TestAdditiveGaussian::test_additivegaussian_sd[wrapper-policy-device0]
Unit-tests on Windows / unittests-cpu / windows-job (gh) (trunk failure)
##[error]Process completed with exit code 1.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

kurtamohler · 2024-11-14T21:23:22Z

Looks like there is a minor bug if I try to use this on UnityMLAgentsEnv and then do a rollout. I'll fix that and add a test

vmoens

Thanks for this, long awaited feature!
Just left a couple of comments on the default dim and test set

torchrl/envs/transforms/transforms.py

test/test_transforms.py

vmoens

Thanks this is superb
I'd like to discuss the inverse transform:
Would it make sense in the inverse to get an entry (from the input_spec) and unbind it?
Like: you have a single action with leading dim of 2, and map it to ("agent0", "action"), ("agent1", "action"). The spec seen from the outside is the stack of the 2 specs (as it is for stuff processed in forward).
Would that make sense?

vmoens · 2024-11-19T13:17:40Z

torchrl/envs/transforms/transforms.py

+    forward = _call
+
+    def _inv_call(self, tensordict: TensorDictBase) -> TensorDictBase:
+        values = torch.split(tensordict[self.in_keys_inv[0]], 1, dim=self.dim)


does this work too if the key isn't there?

maybe use unbind instead, not to call squeeze afterwards?

vmoens · 2024-11-19T13:19:30Z

torchrl/envs/transforms/transforms.py

+        values = torch.split(tensordict[self.in_keys_inv[0]], 1, dim=self.dim)
+        for value, out_key_inv in _zip_strict(values, self.out_keys_inv):
+            tensordict.set(out_key_inv, value.squeeze(self.dim))
+        tensordict.exclude(self.in_keys_inv[0], inplace=True)


maybe we don't want to do that inplace? Is there a specific reason to use that?

kurtamohler requested a review from vmoens November 14, 2024 21:13

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 14, 2024

vmoens added the enhancement New feature or request label Nov 15, 2024

vmoens reviewed Nov 15, 2024

View reviewed changes

torchrl/envs/transforms/transforms.py Outdated Show resolved Hide resolved

test/test_transforms.py Show resolved Hide resolved

kurtamohler force-pushed the Stack-Transform-0 branch 2 times, most recently from dc5cceb to 23f7e1b Compare November 19, 2024 05:16

[Feature] Add Stack transform

f443812

kurtamohler force-pushed the Stack-Transform-0 branch from 23f7e1b to f443812 Compare November 19, 2024 05:37

vmoens reviewed Nov 19, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Add `Stack` transform #2567

[Feature] Add `Stack` transform #2567

kurtamohler commented Nov 14, 2024

pytorch-bot bot commented Nov 14, 2024 •

edited

Loading

kurtamohler commented Nov 14, 2024

vmoens left a comment

vmoens left a comment

vmoens Nov 19, 2024

vmoens Nov 19, 2024

vmoens Nov 19, 2024

[Feature] Add Stack transform #2567

Are you sure you want to change the base?

[Feature] Add Stack transform #2567

Conversation

kurtamohler commented Nov 14, 2024

Description

Motivation and Context

Types of changes

Checklist

pytorch-bot bot commented Nov 14, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2567

❗ 1 Active SEVs

❌ 6 New Failures, 18 Unrelated Failures

kurtamohler commented Nov 14, 2024

vmoens left a comment

Choose a reason for hiding this comment

vmoens left a comment

Choose a reason for hiding this comment

vmoens Nov 19, 2024

Choose a reason for hiding this comment

vmoens Nov 19, 2024

Choose a reason for hiding this comment

vmoens Nov 19, 2024

Choose a reason for hiding this comment

[Feature] Add `Stack` transform #2567

[Feature] Add `Stack` transform #2567

pytorch-bot bot commented Nov 14, 2024 •

edited

Loading