Skip to content

TD3 and PPO implementation -- Final project for the course ELEC-E8125 Reinforcement Learning at Aalto University

Notifications You must be signed in to change notification settings

spetravic/Reinforcement_Learning-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 

Repository files navigation

ELEC-E8125 Reinforcement Learning - Final Project

Part 1

Implementation of Twin Delayed Deep Deterministic Policy Gradient (TD3) and Proximal Policy Optimization (PPO) for continuous control tasks in the InvertedPendulumBulletEnv-v0 and HalfCheetahBulletEnv-v0 environments.

Part 2

Implementation of TD3 with behavioral cloning (TD3+BC) for offline reinforcement learning.

About

TD3 and PPO implementation -- Final project for the course ELEC-E8125 Reinforcement Learning at Aalto University

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published