[RLlib] Fix gym.Wrapper problem for MA Envs. #8314

sven1977 · 2020-05-04T19:54:26Z

When wrapping a MultiAgentEnv with a gym Wrapper, RLlib incorrectly pre-processes observations and crashes due to a Space mismatch. This PR makes sure that RLlib is always aware of the actual underlying (wrapped) Env.

Fixes issue #8303

Closes #8303

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/latest/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failure rates at https://ray-travis-tracker.herokuapp.com/.
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested (please justify below)

AmplabJenkins · 2020-05-04T20:00:30Z

Can one of the admins verify this patch?

ericl · 2020-05-04T20:04:37Z

You can't use a reward wrapper around a MultiAgentEnv though right? I don't think this is a valid use case.

AmplabJenkins · 2020-05-04T21:16:06Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/25536/
Test PASSed.

sven1977 · 2020-05-07T10:03:59Z

@ericl It would be intuitive, though. Can we merge this either way? We don't have to officially support it, but it would help a lot of users who are simply doing this for convenience and expect this to work fine (it does, after all).

ericl · 2020-05-07T17:47:31Z

I'm concerned this is a hacky behavior though. It doesn't really make sense to crawl the inheritance chain looking for a subclass signature. Also, there are clear workarounds to use non gym wrappers.

I think it would be asking trouble to start half supporting gym wrappers for multiagent envs, which were never intended to be compatible anyways.

WIP and LINT.

edd130e

sven1977 requested a review from ericl May 4, 2020 19:54

sven1977 mentioned this pull request May 4, 2020

rllib: Using gym.RewardWrapper around MultiAgentEnv cause observation mismatch with observation_space #8303

Closed

sven1977 closed this May 8, 2020

praveen-palanisamy mentioned this pull request Dec 3, 2022

gym version will affect the usage of ray[rllib] praveen-palanisamy/macad-gym#76

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Fix gym.Wrapper problem for MA Envs. #8314

[RLlib] Fix gym.Wrapper problem for MA Envs. #8314

sven1977 commented May 4, 2020 •

edited

Loading

AmplabJenkins commented May 4, 2020

ericl commented May 4, 2020

AmplabJenkins commented May 4, 2020

sven1977 commented May 7, 2020

ericl commented May 7, 2020

[RLlib] Fix gym.Wrapper problem for MA Envs. #8314

[RLlib] Fix gym.Wrapper problem for MA Envs. #8314

Conversation

sven1977 commented May 4, 2020 • edited Loading

AmplabJenkins commented May 4, 2020

ericl commented May 4, 2020

AmplabJenkins commented May 4, 2020

sven1977 commented May 7, 2020

ericl commented May 7, 2020

sven1977 commented May 4, 2020 •

edited

Loading