Reinforcement Learning for Super Mario Bros using DQN

Modified to work with gym-super-mario-bros

NOTE: This is an unofficial fork of the original code published as dqn-mario.

使用PyTorch實作DQN演算法，並訓練super-mario-bros，整體架構參考openai/baselines。

Warning：訓練DQN請開足夠的記憶體，Replay Buffer以預設值1000000為例至少會使用約8G的記憶體.

使用8顆cpu在GCP上跑16個小時，RAM開24G非常足夠，但很難收斂，無法穩定過關。訓練的影像預設位置在/video/mario/。

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
common		common
deepq		deepq
img		img
.gitignore		.gitignore
README.md		README.md
train_mario.py		train_mario.py