Skip to content

This project aims to replicate figure 3, 4 and 5 from the Richard Sutton’s 1988 paper “Learning to Predict by the Methods of Temporal Differences.”

Notifications You must be signed in to change notification settings

NouiliKh/TD-learning-in-random-walk-environment

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

TD learning in random walk environment (RL)

This project aims to replicate figure 3, 4 and 5 from the Richard Sutton’s 1988 paper “Learning to Predict by the Methods of Temporal Differences.”

Getting Started

These instructions will get you a copy of the project up and running on your local machine.

Prerequisites

In order to get this prpject running on our machine, you need to have

Python v3.6
Jupyter-notebook

Installing

After setting up the environmnet, you need to install the used python libraries in this project

 pip3 install -r requirements.txt

Run the project

using Jupyter-notebook, open TD.ipynb file.

More about the project

alt text alt text alt text alt text alt text

About

This project aims to replicate figure 3, 4 and 5 from the Richard Sutton’s 1988 paper “Learning to Predict by the Methods of Temporal Differences.”

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published