[RLlib] Add correct terminated and truncated batch sizes on zero-length episodes #46721

Mark2000 · 2024-07-20T05:30:51Z

Why are these changes needed?

In AddColumnsFromEpisodesToTrainBatch, it is assumed that each sa_episode has length ≥ 1. When a zero-length episode is passed to the connector, the data added to the terminateds and truncateds columns are incorrectly sized; they should just be an empty list. This case generally doesn't come up, but when adding custom connectors that modify episode lengths (e.g. for semi-MDP type problems), zero-length episodes can be produced.

Related issue number

Didn't open issue, but I can if necessary.

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: Mark Stephenson <mark2000stephenson@gmail.com>

Mark2000 · 2024-07-24T23:03:40Z

Bumping this!

Mark2000 · 2024-07-29T17:01:06Z

@sven1977 Could you review when you're able to?

Mark2000 · 2024-09-03T17:44:05Z

Sorry to keep bugging you @sven1977 @simonsays1980 , but any way this could get merged in?

simonsays1980 · 2024-09-11T17:16:17Z

rllib/connectors/learner/add_columns_from_episodes_to_train_batch.py

@@ -100,6 +100,8 @@ def __call__(
                    Columns.TERMINATEDS,
                    items_to_add=(
                        [False] * (len(sa_episode) - 1) + [sa_episode.is_terminated]
+                        if len(sa_episode) > 0


We should probably remove any zero-length episodes in the loop here. They cannot be used for learning anyways.

Leave this to @sven1977 to decide.

@sven1977 could we give this a go? I think this is fine like this. The other option is to remove zero-length epiosdes entirely as they offer nothing to learn from. This could also happen if an agent in a multi-agent scenario received only the initial observation and nothing else.

sven1977

Thanks for the fix @Mark2000 . LGTM.

A slightly more defensive fix would probably be to filter out empty episodes already in the connector pipeline, but maybe that would confuse some code that does draw information from those.

Our built-in EnvRunners do not return empty episodes ever, which is why this problem never came up.

sven1977 · 2024-09-25T13:29:12Z

auto-merge enabled ...

…th episodes (ray-project#46721) Signed-off-by: ujjawal-khare <ujjawal.khare@dream11.com>

Add correct terminated and truncated batch sizes on zero-length episodes

1d29c33

Signed-off-by: Mark Stephenson <mark2000stephenson@gmail.com>

Mark2000 requested review from sven1977, ArturNiederfahrenhorst and simonsays1980 as code owners July 20, 2024 05:30

Update add_columns_from_episodes_to_train_batch.py

6d54957

Signed-off-by: Mark Stephenson <mark2000stephenson@gmail.com>

Mark2000 changed the title ~~Add correct terminated and truncated batch sizes on zero-length episodes~~ [RLlib] Add correct terminated and truncated batch sizes on zero-length episodes Jul 24, 2024

anyscalesam added triage Needs triage (eg: priority, bug/not-bug, and owning component) rllib RLlib related issues labels Aug 12, 2024

Mark2000 mentioned this pull request Sep 6, 2024

[RLlib] AddColumnsFromEpisodesToTrainBatch assumes episode length >= 1 #47542

Closed

simonsays1980 added the rllib-connectorv2 Connector related issues label Sep 11, 2024

simonsays1980 reviewed Sep 11, 2024

View reviewed changes

sven1977 approved these changes Sep 25, 2024

View reviewed changes

sven1977 enabled auto-merge (squash) September 25, 2024 13:29

github-actions bot added the go add ONLY when ready to merge, run all tests label Sep 25, 2024

sven1977 merged commit 4a5207d into ray-project:master Sep 25, 2024
7 checks passed

ujjawal-khare pushed a commit to ujjawal-khare-27/ray that referenced this pull request Oct 15, 2024

[RLlib] Add correct terminated and truncated batch sizes on zero-leng…

e5708b4

…th episodes (ray-project#46721) Signed-off-by: ujjawal-khare <ujjawal.khare@dream11.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Add correct terminated and truncated batch sizes on zero-length episodes #46721

[RLlib] Add correct terminated and truncated batch sizes on zero-length episodes #46721

Mark2000 commented Jul 20, 2024 •

edited

Loading

Mark2000 commented Jul 24, 2024

Mark2000 commented Jul 29, 2024

Mark2000 commented Sep 3, 2024

simonsays1980 Sep 11, 2024

simonsays1980 Sep 11, 2024

simonsays1980 Sep 16, 2024

sven1977 left a comment

sven1977 commented Sep 25, 2024

[RLlib] Add correct terminated and truncated batch sizes on zero-length episodes #46721

[RLlib] Add correct terminated and truncated batch sizes on zero-length episodes #46721

Conversation

Mark2000 commented Jul 20, 2024 • edited Loading

Why are these changes needed?

Related issue number

Checks

Mark2000 commented Jul 24, 2024

Mark2000 commented Jul 29, 2024

Mark2000 commented Sep 3, 2024

simonsays1980 Sep 11, 2024

Choose a reason for hiding this comment

simonsays1980 Sep 11, 2024

Choose a reason for hiding this comment

simonsays1980 Sep 16, 2024

Choose a reason for hiding this comment

sven1977 left a comment

Choose a reason for hiding this comment

sven1977 commented Sep 25, 2024

Mark2000 commented Jul 20, 2024 •

edited

Loading