Skip to content

Latest commit

 

History

History
14 lines (9 loc) · 900 Bytes

README.md

File metadata and controls

14 lines (9 loc) · 900 Bytes

DRL-Tutorial

Useful Tools

Summarize

Index Algorithm Authors Publication Code Classification Features Detailed
1⃣️ Temporal Difference (TD) Learning Tesauro, G. Communications of the ACM / Cornerstone TBD TBD
2⃣️ Q-Learning Watkins, C. J. et al. Machine Learning / / Q Table TBD
3⃣️ Deep Q-Networks (DQN) Mnih, V. et al. Nature PyTorch TBD Introducing the deep network into Q-learning TBD