This is a naive DQN example implemented in torch7 to help future research.
The environment is based on rlenvs of Kaixhin and the model makes use of "Increasing the Action Gap" (http://arxiv.org/abs/1512.04860).
Implemented by Yannis M. Assael (yannisassael.com)