[RLlib] MultiAgentEnv API enhancements (related to defining obs-/action spaces for agents). #47830

sven1977 · 2024-09-26T17:22:29Z

MultiAgentEnv API enhancements (related to defining obs-/action spaces for agents).

The MultiAgentEnv API gets the following face-lift:

The MultiAgentEnv will have a new get_observation_space(agent_id=...) method, which is the only path that RLlib will use to get an agent's space.
We remove/deprecate other methods to simplify the MultiAgentEnv class a bit. E.g. observation_space_sample() .
If users want (b/c they have a lot of agents, some of which share the same spaces), users can now override this method in MultiAgentEnv and always return the correct (single-agent) space for the given agent_id. This relieves one from having to include all agent IDs in the agent-to-space dict. In fact, you don't even have to define such a dict anymore at all.
Alternatively to overriding get_observation_space(), one can define self.observation_spaces (plural!) to be a dict mapping AgentIDs to individual spaces (<- currently the only supported mechanism).
Alternatively to overriding get_observation_space(), one can define self.observation_space (singular!) to be a single observation space that applies to all agents in the env (<- "lazy" mechanism for when all agents have the same space anyways).

^ Exact same for action spaces.

On top of that, to make things more explicit and more similar to PettingZoo, we should also require users to provide the two attributes in their MultiAgentEnv

self.agents = [list of agents currently in the episode]
self.possible_agents = [list of all possible agent that can ever appear in any episode]

The method self.get_agent_ids() will be deprecated soon (replaced by self.agents or self.possible_agents).
The private attribute self._agent_ids will be deprecated soon (replaced by self.agents or self.possible_agents).

Why are these changes needed?

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: sven1977 <svenmika1977@gmail.com>

…i_agent_env_space_enhancements

Signed-off-by: sven1977 <svenmika1977@gmail.com>

simonsays1980

LGTM. We should test, if checkpointing runs error-free with these changes. I don't see an obvious problem, but let's quickly check this.

simonsays1980 · 2024-09-27T09:34:27Z

rllib/env/multi_agent_env.py

-    def render(self) -> None:
-        """Tries to render the environment."""
+    @property
+    def num_agents(self) -> int:


simonsays1980 · 2024-09-27T09:35:00Z

rllib/env/multi_agent_env.py

-        # By default, do nothing.
-        pass
+    @property
+    def max_num_agents(self) -> int:


This might also help in the sampling from MultiAgentEpisodeBuffers

Signed-off-by: sven1977 <svenmika1977@gmail.com>

…i_agent_env_space_enhancements

Signed-off-by: sven1977 <svenmika1977@gmail.com>

…i_agent_env_space_enhancements

Signed-off-by: sven1977 <svenmika1977@gmail.com>

…i_agent_env_space_enhancements

Signed-off-by: sven1977 <svenmika1977@gmail.com>

…on spaces for agents). (ray-project#47830) Signed-off-by: ujjawal-khare <ujjawal.khare@dream11.com>

sven1977 added 4 commits September 26, 2024 14:34

wip

5f039ee

Signed-off-by: sven1977 <svenmika1977@gmail.com>

Merge branch 'master' of https://github.com/ray-project/ray into mult…

a21f564

…i_agent_env_space_enhancements

wip: need to fix AlgorithmConfig.get_multi_agent_setup (total mess)

675f6c9

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

4cbd06f

Signed-off-by: sven1977 <svenmika1977@gmail.com>

sven1977 requested review from ArturNiederfahrenhorst and simonsays1980 as code owners September 26, 2024 17:22

sven1977 assigned simonsays1980 Sep 26, 2024

simonsays1980 approved these changes Sep 27, 2024

View reviewed changes

merge

b717fbd

Signed-off-by: sven1977 <svenmika1977@gmail.com>

sven1977 enabled auto-merge (squash) September 27, 2024 10:41

github-actions bot disabled auto-merge September 27, 2024 10:41

github-actions bot added the go add ONLY when ready to merge, run all tests label Sep 27, 2024

sven1977 added 4 commits September 27, 2024 14:06

fixes

79b4ab8

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

3fccec0

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

a694b2f

Signed-off-by: sven1977 <svenmika1977@gmail.com>

Merge branch 'master' of https://github.com/ray-project/ray into mult…

b1d3e4e

…i_agent_env_space_enhancements

sven1977 enabled auto-merge (squash) September 27, 2024 16:07

wip

9d022d0

Signed-off-by: sven1977 <svenmika1977@gmail.com>

github-actions bot disabled auto-merge September 28, 2024 12:25

sven1977 added 2 commits September 28, 2024 14:30

Merge branch 'master' of https://github.com/ray-project/ray into mult…

1248596

…i_agent_env_space_enhancements

wip

379cdfa

Signed-off-by: sven1977 <svenmika1977@gmail.com>

sven1977 enabled auto-merge (squash) September 28, 2024 13:22

sven1977 added 2 commits September 28, 2024 19:27

Merge branch 'master' of https://github.com/ray-project/ray into mult…

465bc33

…i_agent_env_space_enhancements

fix

0328778

Signed-off-by: sven1977 <svenmika1977@gmail.com>

github-actions bot disabled auto-merge September 28, 2024 17:32

wip

f0be4e8

Signed-off-by: sven1977 <svenmika1977@gmail.com>

sven1977 enabled auto-merge (squash) September 28, 2024 18:48

sven1977 merged commit e07594e into ray-project:master Sep 28, 2024
6 checks passed

sven1977 added rllib RLlib related issues rllib-multi-agent An RLlib multi-agent related problem. labels Sep 29, 2024

sven1977 added rllib-env rllib env related issues rllib-newstack labels Sep 29, 2024

sven1977 deleted the multi_agent_env_space_enhancements branch September 29, 2024 09:41

ujjawal-khare pushed a commit to ujjawal-khare-27/ray that referenced this pull request Oct 15, 2024

[RLlib] MultiAgentEnv API enhancements (related to defining obs-/acti…

af426db

…on spaces for agents). (ray-project#47830) Signed-off-by: ujjawal-khare <ujjawal.khare@dream11.com>

ujjawal-khare pushed a commit to ujjawal-khare-27/ray that referenced this pull request Oct 15, 2024

[RLlib] MultiAgentEnv API enhancements (related to defining obs-/acti…

20c564f

…on spaces for agents). (ray-project#47830) Signed-off-by: ujjawal-khare <ujjawal.khare@dream11.com>

ujjawal-khare pushed a commit to ujjawal-khare-27/ray that referenced this pull request Oct 15, 2024

[RLlib] MultiAgentEnv API enhancements (related to defining obs-/acti…

7589f14

…on spaces for agents). (ray-project#47830) Signed-off-by: ujjawal-khare <ujjawal.khare@dream11.com>

ujjawal-khare pushed a commit to ujjawal-khare-27/ray that referenced this pull request Oct 15, 2024

[RLlib] MultiAgentEnv API enhancements (related to defining obs-/acti…

07321a0

…on spaces for agents). (ray-project#47830) Signed-off-by: ujjawal-khare <ujjawal.khare@dream11.com>

ujjawal-khare pushed a commit to ujjawal-khare-27/ray that referenced this pull request Oct 15, 2024

[RLlib] MultiAgentEnv API enhancements (related to defining obs-/acti…

1c9ef51

…on spaces for agents). (ray-project#47830) Signed-off-by: ujjawal-khare <ujjawal.khare@dream11.com>

ujjawal-khare pushed a commit to ujjawal-khare-27/ray that referenced this pull request Oct 15, 2024

[RLlib] MultiAgentEnv API enhancements (related to defining obs-/acti…

eeb18e5

…on spaces for agents). (ray-project#47830) Signed-off-by: ujjawal-khare <ujjawal.khare@dream11.com>

ujjawal-khare pushed a commit to ujjawal-khare-27/ray that referenced this pull request Oct 15, 2024

[RLlib] MultiAgentEnv API enhancements (related to defining obs-/acti…

993452c

…on spaces for agents). (ray-project#47830) Signed-off-by: ujjawal-khare <ujjawal.khare@dream11.com>

ujjawal-khare pushed a commit to ujjawal-khare-27/ray that referenced this pull request Oct 15, 2024

[RLlib] MultiAgentEnv API enhancements (related to defining obs-/acti…

1ac860f

…on spaces for agents). (ray-project#47830) Signed-off-by: ujjawal-khare <ujjawal.khare@dream11.com>

ujjawal-khare pushed a commit to ujjawal-khare-27/ray that referenced this pull request Oct 15, 2024

[RLlib] MultiAgentEnv API enhancements (related to defining obs-/acti…

f4a1d5c

…on spaces for agents). (ray-project#47830) Signed-off-by: ujjawal-khare <ujjawal.khare@dream11.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] MultiAgentEnv API enhancements (related to defining obs-/action spaces for agents). #47830

[RLlib] MultiAgentEnv API enhancements (related to defining obs-/action spaces for agents). #47830

sven1977 commented Sep 26, 2024 •

edited

Loading

simonsays1980 left a comment

simonsays1980 Sep 27, 2024

simonsays1980 Sep 27, 2024

[RLlib] MultiAgentEnv API enhancements (related to defining obs-/action spaces for agents). #47830

[RLlib] MultiAgentEnv API enhancements (related to defining obs-/action spaces for agents). #47830

Conversation

sven1977 commented Sep 26, 2024 • edited Loading

Why are these changes needed?

Related issue number

Checks

simonsays1980 left a comment

Choose a reason for hiding this comment

simonsays1980 Sep 27, 2024

Choose a reason for hiding this comment

simonsays1980 Sep 27, 2024

Choose a reason for hiding this comment

sven1977 commented Sep 26, 2024 •

edited

Loading