Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CI test linux://rllib:learning_tests_pendulum_ppo is flaky #47434

Closed
can-anyscale opened this issue Aug 30, 2024 · 13 comments
Closed

CI test linux://rllib:learning_tests_pendulum_ppo is flaky #47434

can-anyscale opened this issue Aug 30, 2024 · 13 comments
Assignees
Labels
bug Something that is supposed to be working; but isn't ci-test flaky-tracker Issue created via Flaky Test Tracker https://flaky-tests.ray.io/ P1 Issue that should be fixed within a few weeks ray-test-bot Issues managed by OSS test policy rllib RLlib related issues stability

Comments

@can-anyscale
Copy link
Collaborator

CI test linux://rllib:learning_tests_pendulum_ppo is consistently_failing. Recent failures:
- https://buildkite.com/ray-project/postmerge/builds/6087#0191a512-f573-4d83-999e-fe176135ac78
- https://buildkite.com/ray-project/postmerge/builds/6087#0191a4e8-13f4-45a1-9943-c5f6a7b5d80d

DataCaseName-linux://rllib:learning_tests_pendulum_ppo-END
Managed by OSS Test Policy

@can-anyscale can-anyscale added bug Something that is supposed to be working; but isn't ci-test flaky-tracker Issue created via Flaky Test Tracker https://flaky-tests.ray.io/ ray-test-bot Issues managed by OSS test policy rllib RLlib related issues stability triage Needs triage (eg: priority, bug/not-bug, and owning component) weekly-release-blocker Issues that will be blocking Ray weekly releases labels Aug 30, 2024
@can-anyscale
Copy link
Collaborator Author

Blamed commit: a967a35 found by bisect job https://buildkite.com/ray-project/release-tests-bisect/builds/1510

@can-anyscale
Copy link
Collaborator Author

@can-anyscale
Copy link
Collaborator Author

@can-anyscale can-anyscale changed the title CI test linux://rllib:learning_tests_pendulum_ppo is consistently_failing CI test linux://rllib:learning_tests_pendulum_ppo is flaky Sep 2, 2024
@can-anyscale can-anyscale reopened this Sep 2, 2024
@can-anyscale
Copy link
Collaborator Author

@can-anyscale can-anyscale removed the weekly-release-blocker Issues that will be blocking Ray weekly releases label Sep 4, 2024
@can-anyscale
Copy link
Collaborator Author

@can-anyscale can-anyscale changed the title CI test linux://rllib:learning_tests_pendulum_ppo is flaky CI test linux://rllib:learning_tests_pendulum_ppo is consistently_failing Sep 6, 2024
@can-anyscale can-anyscale reopened this Sep 6, 2024
@can-anyscale
Copy link
Collaborator Author

Blamed commit: d8a85c5 found by bisect job https://buildkite.com/ray-project/release-tests-bisect/builds/1531

@can-anyscale
Copy link
Collaborator Author

@can-anyscale can-anyscale changed the title CI test linux://rllib:learning_tests_pendulum_ppo is consistently_failing CI test linux://rllib:learning_tests_pendulum_ppo is flaky Sep 6, 2024
@can-anyscale can-anyscale reopened this Sep 6, 2024
@simonsays1980 simonsays1980 added P0 Issues that should be fixed in short order and removed triage Needs triage (eg: priority, bug/not-bug, and owning component) labels Sep 11, 2024
@can-anyscale
Copy link
Collaborator Author

@sven1977 sven1977 added P1 Issue that should be fixed within a few weeks and removed P0 Issues that should be fixed in short order labels Sep 16, 2024
@sven1977
Copy link
Contributor

Downprio'd to P1. This instability should get fixed with doing batch shuffling for PPO.
PR about to be merged (more testing pending).
#47458

@can-anyscale
Copy link
Collaborator Author

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something that is supposed to be working; but isn't ci-test flaky-tracker Issue created via Flaky Test Tracker https://flaky-tests.ray.io/ P1 Issue that should be fixed within a few weeks ray-test-bot Issues managed by OSS test policy rllib RLlib related issues stability
Projects
None yet
Development

No branches or pull requests

3 participants