Name		Name	Last commit message	Last commit date
parent directory ..
experiments		experiments
README.md		README.md
agent.go		agent.go
memory.go		memory.go
memory_test.go		memory_test.go
policy.go		policy.go
policy_test.go		policy_test.go

README.md

Deep Q-learning

Implementation of the DeepQ algorithm with Double Q.

How it works

DeepQ is an progression on standard Q-learning.

With DeepQ, rather than storing Q-values in a table, they are aprroximated using neural networks. This allows for more accurate Q-value estimates as well as the ability to model continuous states.

DeepQ also includes the notion of experience replay, in which the agent stores the states, actions, and outcomes at every step in memory and then randomly samples from them during training.

Double-Q is further implemented in which the target, or expected future rewards, is modeled in a separate network having the weights intermittently copied over from the 'online' network making the predictions. This helps learning by providing a more stable target to pursue.

Examples

See the experiments folder for example implementations.

Roadmap

Prioritized replay
Dueling Q
Soft updates
More environments

References

DeepQ paper: https://arxiv.org/abs/1312.5602
Double Q paper: https://arxiv.org/abs/1509.06461
Tutorial: https://towardsdatascience.com/deep-q-learning-for-the-cartpole-44d761085c2f
Tutorial: https://towardsdatascience.com/cartpole-introduction-to-reinforcement-learning-ed0eb5b58288

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

deepq

deepq

README.md

Deep Q-learning

How it works

Examples

Roadmap

References

Files

deepq

Directory actions

More options

Directory actions

More options

Latest commit

History

deepq

Folders and files

parent directory

README.md

Deep Q-learning

How it works

Examples

Roadmap

References