This project represents the implementation of the popular reinforcement learning algorithm called Deep-Q Learning introduced by Google Deep Mind. The reference to the paper is given on the following link:Human-Level Control through Deep Reinforcement Learning
The implementation consists of:
- Deep Q-Learning
- Using experience replay memory
- Using two separate deep convolutional neural networks: Q and Target-Q network
- Python 2.7 or Python 3.3+
- gym
- OpenCV2
- TensorFlow 0.11.0
To train and test a model for CartPole:
$ python cartpole_dqn.py
To train and test a model for Pong:
$ python pong_dqn_conv.py
To train and test a model for Breakout:
$ python breakout_dqn_conv.py
[1] Accumulated reward, Q values and average error values obtained for CartPole environment:
[2] Accumulated reward, Q values and average error values obtained for Pong environment:
[3] Accumulated reward, Q values and average error values obtained for Breakout environment:
MIT licence.