GitHub - adwng/werdna_gym: Pybullet Gymnasium Environment for WERDNA

Gymnasium<-WERDNA RL GYM->PPO

Fix observation space

Scripts

Run Training Inference

train.py 
train_advanced.py #recommended

To Run Test Model

test_model.py
test_model_advanced.py #recommended

To Run a Simple Teleoperation from Trained Agent

model_teleop.py
model_teleop_advanced.py #recommended

Tensorboard Viewing

To run tensorboard, simply run:

tensoboard --logdir logs/xxx

Adding New Environment

Can simply add one more environment under env directory. Once added, make sure to update the train.py or train_advanced.py script to include ur environment in. To run training script, add one more <custom_configuration>.yaml under config section.

Configuration For Advanced

Configuration Set up is to specify:

Robot Model: path to the URDF file
Environment: custom environment name
Connect_Type: Whether to train on CPU or GPU
Biases: Types of rewards biases to priotize during training
ec Since the agents are trained using PPO, the entrophy coefficient is a method to stablize and encourage exploration during training to avoid either overfitting/underfitting or overtrained/undertrained conditions
Filename: The name of the trained agent's file
Timesteps: Number of timesteps per training session
Record Video: Whether to record video in the kernel_pca_evaluation script, recorded video will be saved to video directory.

robot_model: "models/werdna_revised_bullet.urdf"
environment: "werdna_advanced"
connect_type: "DIRECT"
device: "cpu"

biases:
  r_bias: 0.0
  p_bias: 0.3
  y_bias: 0.25
  dR_bias: 0.0
  dP_bias: 0.2
  dY_bias: 0.0
  x_bias: 0.25
  v_bias: 0.0

ec: 0.01

filename: "werdna_advanced_v2"

timesteps: 500000
record_video: false

The results or what you would called trained agent is saved in the results directory, but under a new directory that is named after the environment's name and biases specified

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scripts

Adding New Environment

About

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.vscode		.vscode
config		config
env		env
logs/werdna_stand_tensorboard/0.7Z \| 0.3dZ \| 0.0X \| 0.0V \| 0.0ec_1		logs/werdna_stand_tensorboard/0.7Z \| 0.3dZ \| 0.0X \| 0.0V \| 0.0ec_1
meshes		meshes
models		models
results/0.7Z \| 0.3dZ \| 0.0X \| 0.0V \| 0.0ec		results/0.7Z \| 0.3dZ \| 0.0X \| 0.0V \| 0.0ec
urdf		urdf
utils		utils
video		video
.gitignore		.gitignore
README.md		README.md
model_teleop.py		model_teleop.py
model_teleop_advanced.py		model_teleop_advanced.py
test_model.py		test_model.py
test_model_advanced.py		test_model_advanced.py
train.py		train.py
train_advanced.py		train_advanced.py

adwng/werdna_gym

Folders and files

Latest commit

History

Repository files navigation

Scripts

Adding New Environment

About

Topics

Resources

Stars

Watchers

Forks

Languages