Play Atari Pong with REINFORCE and Deep Q-Learning

Simple implementations of fundamental reinforcement learning algorithms to learn to play Pong. Instead of using Pixels, the agents perceive the environment using simple hand-crafted features.

Limitations:

The implementations can only learn to play Pong, but could be adapted to other similar games.
No optimization has been done - both algorithms could be massively parallelized.

Train a neural network with the REINFORCE algorithm.

To watch a pre-trained agent in action, just type:

python pgm_evaluate.py

To start the training from scratch, just type:

python pgm_train.py

Result after 5000 batches of 10 episodes each: the agent beats the computer-controlled player most of the times.

Train a Deep Q-Network.

To watch a pre-trained agent in action, just type:

python dqn_evaluate.py

To start the training from scratch, just type:

papermill --log-output dqn_train.ipynb dqn_results.ipynb

Result after 10000 episodes: the agent wins most games with nearly perfect score.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
common		common
dqn		dqn
images		images
pgm		pgm
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dqn_evaluate.py		dqn_evaluate.py
dqn_train.ipynb		dqn_train.ipynb
pgm_evaluate.py		pgm_evaluate.py
pgm_train.py		pgm_train.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Play Atari Pong with REINFORCE and Deep Q-Learning

Train a neural network with the REINFORCE algorithm.

Train a Deep Q-Network.

About

Releases

Packages

Languages

License

alebruno/pgm_dqn

Folders and files

Latest commit

History

Repository files navigation

Play Atari Pong with REINFORCE and Deep Q-Learning

Train a neural network with the REINFORCE algorithm.

Train a Deep Q-Network.

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages