Flax Implementation of DreamerV3 on Crafter

TODO

Implement Curiosity Replay (https://arxiv.org/abs/2306.15934)

Modifications to the Original Implementation

Layer normalization: We use the default epsilon value for layer normalization from Flax.
GRU: We adopt the default GRU implementation from Flax.
Adam: We use the default epsilon value for Adam from Flax.
Policy optimizer: We employ a single optimizer for the policy.
DynamicScale: We use the default DynamicScale for FP16 training from Optax.

Installation

pip install --upgrade setuptools==65.5.0 wheel==0.38.4
pip install --upgrade "jax[cuda12_pip]" -f https://storage.googleapis.com/jax-releases/jax_cuda_releases.html
pip install -r requirements.txt
pip install -e .

Training

python train.py --exp_name [exp_name] --seed [seed]

Result

score (10 seeds): 17.65 ± 2.29

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
dreamerv3_flax		dreamerv3_flax
figures		figures
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Flax Implementation of DreamerV3 on Crafter

TODO

Modifications to the Original Implementation

Installation

Training

Result

About

Releases

Packages

Languages

symoon11/dreamerv3-flax

Folders and files

Latest commit

History

Repository files navigation

Flax Implementation of DreamerV3 on Crafter

TODO

Modifications to the Original Implementation

Installation

Training

Result

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages