[RLlib] Upgrade to gymnasium 1.0.0 (ale_py 0.10.1, mujoco 3.2.4, pettingzoo 1.24.3 supersuit 3.9.3). #45328

sven1977 · 2024-05-14T09:22:55Z

Upgrade RLlib to gymnasium 1.0.0.

Reason:

We require some bug fixes in gymnasium that only exist in 1.0.0a1/2 (not in 0.29.1) that allow us to make use of their vectorized sync and async environments in RLlib's new EnvRunners.

Why are these changes needed?

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: sven1977 <svenmika1977@gmail.com>

sven1977 · 2024-05-14T09:24:45Z

python/requirements/ml/rllib-test-requirements.txt

@@ -3,7 +3,6 @@
 # Environment adapters.
 # ---------------------
 # Atari
-gymnasium==0.28.1


Since gymnasium is already part of the main Ray requirements.txt file, we won't need this here anymore.

Signed-off-by: sven1977 <svenmika1977@gmail.com>

sven1977 · 2024-05-14T12:58:34Z

cc: @pseudo-rnd-thoughts @jkterry1
Congrats on gymnasium 1.0!! This is super exciting. :)

…ade_gymnasium_to_1_0_0a1

Signed-off-by: sven1977 <svenmika1977@gmail.com>

rllib/env/single_agent_env_runner.py

Signed-off-by: Sven Mika <sven@anyscale.io>

sven1977 · 2024-05-14T14:04:44Z

rllib/env/single_agent_env_runner.py

@@ -249,6 +249,8 @@ def _sample_timesteps(
                    observation=obs[env_index],
                    infos=infos[env_index],
                )
+            self._was_terminated = [False for _ in range(self.num_envs)]


This is completely new auto-reset logic of gymnasium 1.0. The sub-env only gets reset'd upon the next(!) step call (with a fake reward of 0.0 and term/trunc=guaranteed False; and the obs/infos being the reset-obs/infos).
This is actually good for us as we should always do the env-to-module connector pass (even after the last timestep with the terminal obs in the Episodes list) to make sure the user - in case they are writing to the episode - gets a chance to also alter the final obs.

simonsays1980

LGTM.

simonsays1980 · 2024-05-14T14:08:47Z

rllib/env/single_agent_env_runner.py

@@ -88,7 +88,7 @@ def __init__(self, config: AlgorithmConfig, **kwargs):
            #  actually hold the spaces for a single env, but for boxes the
            #  shape is (1, 1) which brings a problem with the action dists.
            #  shape=(1,) is expected.
-            module_spec.action_space = self.env.envs[0].action_space
+            module_spec.action_space = self.env.single_action_space


Sweet. This is now gone.

simonsays1980 · 2024-05-14T14:21:34Z

rllib/env/single_agent_env_runner.py

                    eps += 1

-                    episodes[env_index].add_env_step(
-                        infos[env_index].pop("final_observation"),


Okay, i.e. with gymnasium>=1.0.0 the final_observation is gone and instead a regular observartion will be returned?

Correct, the final observation is returned in the actual obs. The reset obs, you only get on the next(!) call to step, together with a dummy reward of 0.0.

Signed-off-by: sven1977 <svenmika1977@gmail.com>

…to upgrade_gymnasium_to_1_0_0a1 # Conflicts: # rllib/env/single_agent_env_runner.py

…ade_gymnasium_to_1_0_0a1

Signed-off-by: sven1977 <svenmika1977@gmail.com>

pseudo-rnd-thoughts · 2024-10-27T13:49:34Z

@mattip https://anaconda.org/conda-forge/gymnasium has been updated to v1.0.0
@sven1977 Let me know if there is any issues with Gymnasium or documentation changes we need to add / note

Signed-off-by: sven1977 <svenmika1977@gmail.com>

sven1977 · 2024-10-27T20:31:03Z

Hey @pseudo-rnd-thoughts , thanks for offering your help. Will do! Thus far, this has been a smoother ride than I thought (at least after I re-picked up this PR two days ago). Looks like all the tests are passing now and I also ran PPO+Pong, which learnt as well as with gymnasium==0.28.1. This is all looking very good.

sven1977 · 2024-10-27T20:32:15Z

Hey @mattip , yes, this should be merged today/tomorrow. Just waiting for the last tests to run through (it's set to auto-merge). Just fixed the last braking one (SingleAgentEnvRunner), the rest looks fine.

mattip · 2024-10-27T21:46:37Z

Great thanks!

can-anyscale · 2024-10-28T20:26:09Z

Reverting since this broke release tests and is blocking release.

….4, pettingzoo 1.24.3 supersuit 3.9.3)." (#48297) Reverts #45328

…ingzoo 1.24.3 supersuit 3.9.3). (ray-project#45328)

….4, pettingzoo 1.24.3 supersuit 3.9.3)." (ray-project#48297) Reverts ray-project#45328

…ingzoo 1.24.3 supersuit 3.9.3). (ray-project#45328) Signed-off-by: JP-sDEV <jon.pablo80@gmail.com>

….4, pettingzoo 1.24.3 supersuit 3.9.3)." (ray-project#48297) Reverts ray-project#45328 Signed-off-by: JP-sDEV <jon.pablo80@gmail.com>

…ingzoo 1.24.3 supersuit 3.9.3). (ray-project#45328) Signed-off-by: mohitjain2504 <mohit.jain@dream11.com>

….4, pettingzoo 1.24.3 supersuit 3.9.3)." (ray-project#48297) Reverts ray-project#45328 Signed-off-by: mohitjain2504 <mohit.jain@dream11.com>

wip

9cb5160

Signed-off-by: sven1977 <svenmika1977@gmail.com>

sven1977 requested review from richardliaw, ericl and edoakes as code owners May 14, 2024 09:22

sven1977 assigned richardliaw and edoakes May 14, 2024

sven1977 commented May 14, 2024

View reviewed changes

sven1977 added 4 commits May 14, 2024 11:58

fixes

5678569

Signed-off-by: sven1977 <svenmika1977@gmail.com>

fixes

f750ac3

Signed-off-by: sven1977 <svenmika1977@gmail.com>

fixes

d2a36b3

Signed-off-by: sven1977 <svenmika1977@gmail.com>

fix

3c90cc7

Signed-off-by: sven1977 <svenmika1977@gmail.com>

sven1977 requested review from avnishn, ArturNiederfahrenhorst, maxpumperla, kouroshHakha and simonsays1980 as code owners May 14, 2024 11:38

edoakes approved these changes May 14, 2024

View reviewed changes

sven1977 added 3 commits May 14, 2024 15:01

Merge branch 'master' of https://github.com/ray-project/ray into upgr…

2bb745a

…ade_gymnasium_to_1_0_0a1

LINT

7921430

Signed-off-by: sven1977 <svenmika1977@gmail.com>

LINT

639698a

Signed-off-by: sven1977 <svenmika1977@gmail.com>

sven1977 assigned simonsays1980 May 14, 2024

sven1977 commented May 14, 2024

View reviewed changes

rllib/env/single_agent_env_runner.py Outdated Show resolved Hide resolved

Apply suggestions from code review

bdda97c

Signed-off-by: Sven Mika <sven@anyscale.io>

sven1977 commented May 14, 2024

View reviewed changes

simonsays1980 approved these changes May 14, 2024

View reviewed changes

sven1977 added 4 commits May 14, 2024 18:44

fixes

cf8f554

Signed-off-by: sven1977 <svenmika1977@gmail.com>

Merge remote-tracking branch 'origin/upgrade_gymnasium_to_1_0_0a1' in…

a67302a

…to upgrade_gymnasium_to_1_0_0a1 # Conflicts: # rllib/env/single_agent_env_runner.py

Merge branch 'master' of https://github.com/ray-project/ray into upgr…

82b1638

…ade_gymnasium_to_1_0_0a1

wip

4d36a44

Signed-off-by: sven1977 <svenmika1977@gmail.com>

github-actions bot disabled auto-merge October 25, 2024 14:02

sven1977 added 7 commits October 25, 2024 16:17

wip

fd3e427

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

e9f0c00

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

16302a2

Signed-off-by: sven1977 <svenmika1977@gmail.com>

fix

6b78348

Signed-off-by: sven1977 <svenmika1977@gmail.com>

fix

95d5f5a

Signed-off-by: sven1977 <svenmika1977@gmail.com>

fix

3f2ccb1

Signed-off-by: sven1977 <svenmika1977@gmail.com>

fixes

a9ce50b

Signed-off-by: sven1977 <svenmika1977@gmail.com>

aslonnie approved these changes Oct 26, 2024

View reviewed changes

sven1977 added 3 commits October 26, 2024 11:04

wip

00c4b64

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

4c82e1a

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

26f1649

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

7dae36a

Signed-off-by: sven1977 <svenmika1977@gmail.com>

sven1977 enabled auto-merge (squash) October 27, 2024 20:29

sven1977 merged commit bfd0d95 into ray-project:master Oct 28, 2024
6 checks passed

sven1977 deleted the upgrade_gymnasium_to_1_0_0a1 branch October 28, 2024 08:29

can-anyscale mentioned this pull request Oct 28, 2024

Revert "[RLlib] Upgrade to gymnasium 1.0.0 (ale_py 0.10.1, mujoco 3.2.4, pettingzoo 1.24.3 supersuit 3.9.3)." #48297

Merged

can-anyscale added a commit that referenced this pull request Oct 28, 2024

Revert "[RLlib] Upgrade to gymnasium 1.0.0 (ale_py 0.10.1, mujoco 3.2…

d782b84

….4, pettingzoo 1.24.3 supersuit 3.9.3)." (#48297) Reverts #45328

edoakes pushed a commit to edoakes/ray that referenced this pull request Oct 30, 2024

[RLlib] Upgrade to gymnasium 1.0.0 (ale_py 0.10.1, mujoco 3.2.4, pett…

2740f1b

…ingzoo 1.24.3 supersuit 3.9.3). (ray-project#45328)

Jay-ju pushed a commit to Jay-ju/ray that referenced this pull request Nov 5, 2024

[RLlib] Upgrade to gymnasium 1.0.0 (ale_py 0.10.1, mujoco 3.2.4, pett…

f560071

…ingzoo 1.24.3 supersuit 3.9.3). (ray-project#45328)

Jay-ju pushed a commit to Jay-ju/ray that referenced this pull request Nov 5, 2024

Revert "[RLlib] Upgrade to gymnasium 1.0.0 (ale_py 0.10.1, mujoco 3.2…

5e10266

….4, pettingzoo 1.24.3 supersuit 3.9.3)." (ray-project#48297) Reverts ray-project#45328

JP-sDEV pushed a commit to JP-sDEV/ray that referenced this pull request Nov 14, 2024

[RLlib] Upgrade to gymnasium 1.0.0 (ale_py 0.10.1, mujoco 3.2.4, pett…

2f1ecf4

…ingzoo 1.24.3 supersuit 3.9.3). (ray-project#45328) Signed-off-by: JP-sDEV <jon.pablo80@gmail.com>

mohitjain2504 pushed a commit to mohitjain2504/ray that referenced this pull request Nov 15, 2024

[RLlib] Upgrade to gymnasium 1.0.0 (ale_py 0.10.1, mujoco 3.2.4, pett…

671448a

…ingzoo 1.24.3 supersuit 3.9.3). (ray-project#45328) Signed-off-by: mohitjain2504 <mohit.jain@dream11.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Upgrade to gymnasium 1.0.0 (ale_py 0.10.1, mujoco 3.2.4, pettingzoo 1.24.3 supersuit 3.9.3). #45328

[RLlib] Upgrade to gymnasium 1.0.0 (ale_py 0.10.1, mujoco 3.2.4, pettingzoo 1.24.3 supersuit 3.9.3). #45328

sven1977 commented May 14, 2024 •

edited

Loading

sven1977 May 14, 2024

sven1977 commented May 14, 2024 •

edited

Loading

sven1977 May 14, 2024

simonsays1980 left a comment

simonsays1980 May 14, 2024

simonsays1980 May 14, 2024

sven1977 May 14, 2024

pseudo-rnd-thoughts commented Oct 27, 2024

sven1977 commented Oct 27, 2024

sven1977 commented Oct 27, 2024

mattip commented Oct 27, 2024

can-anyscale commented Oct 28, 2024

[RLlib] Upgrade to gymnasium 1.0.0 (ale_py 0.10.1, mujoco 3.2.4, pettingzoo 1.24.3 supersuit 3.9.3). #45328

[RLlib] Upgrade to gymnasium 1.0.0 (ale_py 0.10.1, mujoco 3.2.4, pettingzoo 1.24.3 supersuit 3.9.3). #45328

Conversation

sven1977 commented May 14, 2024 • edited Loading

Why are these changes needed?

Related issue number

Checks

sven1977 May 14, 2024

Choose a reason for hiding this comment

sven1977 commented May 14, 2024 • edited Loading

sven1977 May 14, 2024

Choose a reason for hiding this comment

simonsays1980 left a comment

Choose a reason for hiding this comment

simonsays1980 May 14, 2024

Choose a reason for hiding this comment

simonsays1980 May 14, 2024

Choose a reason for hiding this comment

sven1977 May 14, 2024

Choose a reason for hiding this comment

pseudo-rnd-thoughts commented Oct 27, 2024

sven1977 commented Oct 27, 2024

sven1977 commented Oct 27, 2024

mattip commented Oct 27, 2024

can-anyscale commented Oct 28, 2024

sven1977 commented May 14, 2024 •

edited

Loading

sven1977 commented May 14, 2024 •

edited

Loading