-
It looks like CRPO can be adapted to algorithms like SAC-Lag, has anyone tried this? |
Beta Was this translation helpful? Give feedback.
Answered by
Gaiejj
Aug 6, 2023
Replies: 1 comment
-
CRPO's off-policy version is currently under development. We also welcome community contributions. Feel free to contribute by submitting pull requests to enhance safe reinforcement learning. |
Beta Was this translation helpful? Give feedback.
0 replies
Answer selected by
Zarzard
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
CRPO's off-policy version is currently under development. We also welcome community contributions. Feel free to contribute by submitting pull requests to enhance safe reinforcement learning.