Clean single-file implementation of offline RL algorithms in JAX
reinforcement-learning flax cql single-file jax awac iql offline-rl offline-reinforcement-learning d4rl decision-transformer td3bc
-
Updated
Dec 1, 2024 - Python