A Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures (IMPALA) implemented in pytorch
$ sudo apt install python3.7
$ pip install torch
$ pip install tensorboard
-
Edit hyperparameters in
main.py
-
Train the model
$ python train.py
- Logs will be collected in specified folder. You can use
tensorboard
to view them in a browser
$ tensorboard --logdir ./logs/
- Test the model
$ python test.py
Example
$ python test.py -pp ./models/IMPALA_RacecarBulletEnv-v0_400.pt -hd 32 -en RacecarBulletEnv-v0 -ne 10 -el 1000 -ld ./logs/
- [IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures by Espeholt, Soyer, Munos et al.] (https://arxiv.org/pdf/1802.01561.pdf)
- Fix OSError
- Add batched updates
- Add tensorboard logging
- Test performance
- Fix performance collapse issues
- Add comand line argument support