gail-carracing!

Everytime forward Reinforcement Learning(RL) is not feasible for all of the problems due to the complexity involved in the designing of the reward function. In those circumstances, Inverse Reinforcement Learning(IRL) is the game changer. Imitation learning technique is part of it and it showed wonderful results on some of the problems.

In this project, I created an agent that tries to imitate the expert and learns the path navigation in the process. Thanks to openAI-Gym simulator for providing such a wonderful platform for creating the dynamics of the environment.

The project is divided into two steps

Triaining the expert using Proximal Policy Optimization(PPO) algorithm
Train the agent using the expert trajectories from the step1 by utilizing GAN architecture.

Design:

Results:

More details can be found in the report.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
algo		algo
logs/1637125916.7221706		logs/1637125916.7221706
network_models		network_models
param		param
saved_model		saved_model
Project Report.pdf		Project Report.pdf
README.md		README.md
environment.py		environment.py
expert_trajectories.py		expert_trajectories.py
gail_test.py		gail_test.py
gail_training.py		gail_training.py
readme		readme

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

gail-carracing!

About

Releases

Packages

Languages

venkatrebba/gail-carracing

Folders and files

Latest commit

History

Repository files navigation

gail-carracing!

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages