DDPG & TD3 Implementation with PyBullet

This repository contains a Deep Deterministic Policy Gradient (DDPG) and Twin Delayed DDPG implementation for reinforcement learning environments provided by PyBullet.

Structure

utils.py: Contains utility classes and functions such as ScheduledNoise and ReplayBuffer.
models.py: Contains neural network architectures for the Actor and Critic.
ddpg.py: Contains the implementation of the DDPG agent (DDPGAgent).
TD3.py: Contains the implementation of the DDPG agent (TD3Agent).
train.py: The main script that contains the training loop.
env_config.py: Configuration file specifying different environments available for training.

Environment Configuration

The env_config.py file provides a dictionary of environment names and their respective string identifiers. By default, it includes environments like:

HalfCheetah: HalfCheetahBulletEnv-v0
Hopper: HopperBulletEnv-v0
Walker2D: Walker2DBulletEnv-v0
Humanoid: HumanoidBulletEnv-v0

You can easily extend this list by adding more environments to the env_config.py file. To train on a specific environment, set the env_name variable in train.py to the desired environment name from the dictionary.

Setup

Install required libraries:

pip install pybullet jax jaxlib gym dm-haiku optax tensorboardX

Clone this repository:

git clone https://github.com/FayElhassan/DDPG_IMP_PYBULLET

Navigate to the directory:
```
cd /path/to/DDPG_IMP_PYBULLET
```

Training

To train the agent, run:

python train.py

This will train the agent on the HalfCheetahBulletEnv-v0 environment from PyBullet and log metrics using TensorBoard.

Visualization

Metrics such as reward, actor loss, critic loss, and noise standard deviation are logged using TensorBoard. You can visualize them by:

Installing TensorBoard:
```
pip install tensorboard
```
Launching TensorBoard:
```
tensorboard --logdir=./runs
```

Then, navigate to the URL provided (typically http://localhost:6006/) in your web browser.

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
.DS_Store		.DS_Store
DDPG and TD3 Report.pdf		DDPG and TD3 Report.pdf
README.md		README.md
TD3.py		TD3.py
TD3agent_behavior1.mp4		TD3agent_behavior1.mp4
TD3agent_behavior2.mp4		TD3agent_behavior2.mp4
agent_behavior2_TD3_new5.mp4		agent_behavior2_TD3_new5.mp4
agent_behavior2_ddpg_new5.mp4		agent_behavior2_ddpg_new5.mp4
ddpagent_behavior1.mp4		ddpagent_behavior1.mp4
ddpg.py		ddpg.py
ddpg_agent_HUMANOID-2.mp4		ddpg_agent_HUMANOID-2.mp4
env_config.py		env_config.py
models.py		models.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DDPG & TD3 Implementation with PyBullet

Structure

Environment Configuration

Setup

Training

Visualization

About

Releases

Packages

Contributors 3

Languages

FayElhassan/DDPG_IMP_PYBULLET

Folders and files

Latest commit

History

Repository files navigation

DDPG & TD3 Implementation with PyBullet

Structure

Environment Configuration

Setup

Training

Visualization

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages