Adding updates for Quadrotor #166

svsawant · 2024-09-25T11:53:22Z

Changes:

Alternate control interface (attitude interface)
New dynamics formulation
Corresponding configs and models for controllers

… of JSRL on PPO.

…ckpoint in ppo.py. 4. Boolean var in ppo_sampler.

…mples as s2.

…l-gym into benchmark

…parameters.

…ate.

…to benchmark

…izier. 3. improve parallel computes for Vizier.

…l-gym into benchmark

… logging. 3. change PPO search space. 4. make cost parameters independent. 5. save intermediate results during HPO. 6. two packages tested on iLQR, GPMPC, and PPO.

Federico-PizarroBejarano

It looks good but I am having trouble understanding what is an addition and what is a bug fix. It seems like a lot of the changes have impacts on the other environments. Thus, I have two main comments:

I would like to better understand how these changes affect the other environments, with some guarantee that they either fix or do not change their behaviour
I think you've made a lot of changes to the pre-process function. I am not sure why but I think we need to standardize how it is used.

Additionally, in terms of linting, you need to run the pre-commit hooks. We should all install them so that they all run on every commit. Additionally, you have commented out a lot of code. For merging to main I think there should be essentially no commented out code: either the code is useful and integrated or it is obselete and deleted. Happy to discuss edge-cases

safe_control_gym/controllers/lqr/ilqr.py

Federico-PizarroBejarano · 2024-09-25T13:39:46Z

safe_control_gym/envs/benchmark_env.py

-        processed_action = self._preprocess_control(action)
-        return processed_action
+        # processed_action = self._preprocess_control(action)
+        return action


This seems like a serious breaking change that would dramatically change the behaviour of the other environments. Am I misunderstanding something?

Yes, this will change the behaviour. The primary necessity was that the attitude controller needs to run at a higher frequency and is made to be a part of the preprocess function. Then, the intent is to run "preprocess_action" at the same frequency as pyb in advance_simulation.
Corresponding care is taken to correct it in cartpole.

I am not sure if y'all discussed this yet but we should talk about the best way to change the preprocess function to accomodate the attitude controller then. I don't understand what the impacts of this change are yet

safe_control_gym/envs/gym_control/cartpole.py

examples/rl/train_rl_model.sh

Federico-PizarroBejarano · 2024-09-25T15:54:34Z

safe_control_gym/envs/gym_pybullet_drones/base_aviary.py

                     Check the only line in this method and `_update_and_store_kinematic_information()`
                     to understand its format.
        '''
        state = np.hstack([
            self.pos[nth_drone, :], self.quat[nth_drone, :],
            self.rpy[nth_drone, :], self.vel[nth_drone, :],
-            self.ang_v[nth_drone, :], self.last_clipped_action[nth_drone, :]
+            self.ang_v[nth_drone, :], self.rpy_rates[nth_drone, :], self.last_clipped_action[nth_drone, :]


what is the difference? does this affect the other systems? was it a bug beforehand?

Federico-PizarroBejarano · 2024-09-25T15:56:57Z

safe_control_gym/envs/gym_pybullet_drones/base_aviary.py

+        pos = pos + self.PYB_TIMESTEP * vel
+        rpy = rpy + self.PYB_TIMESTEP * rpy_rates
+        # Set PyBullet's state.
+        p.resetBasePositionAndOrientation(self.DRONE_IDS[nth_drone],


the drone state is set in several places. Didn't you create a function for this? Could it be used to decrease this repeated code?

That's from the original code. Not sure, why it got highlighted.

Fair enough. It seems you have added a function that can be used to clean up the code a lot tho, would be nice to replace all those chunks with your new _set_pybullet_information function.

safe_control_gym/envs/gym_pybullet_drones/base_aviary.py

safe_control_gym/envs/gym_pybullet_drones/quadrotor.py

middleyuan added 30 commits July 10, 2023 20:37

edit bash file with correct arg name

84830df

add another host in gpmpc_hpo.sh

e69e048

change to new dir in gpmpc_hpo.sh

097e1c2

1. fix a small bug 2. add test_train_gpmpc_cartpole

405dcea

add a hpo parallelism test

549ff3e

saving before runing hpo

81b5602

I think the bug is that it reaches thee goal in the first step.

a5ad5f2

1. PPO configs. 2. Make cartpole init states harder. 3. First version…

ce4d75e

… of JSRL on PPO.

Re-organize a bit (file name, remove __init__.py in test folders).

b40566c

1. HPO strategies. 2. test on hpo for ppo. 3. another way to save che…

23f571d

…ckpoint in ppo.py. 4. Boolean var in ppo_sampler.

update gitignore

802edb6

change configs

02d1c33

update bash for hpo on gpmpc

20d3a7f

add prior arg in gpmpc_sampler

ad96f6f

1. HPO effort evaluations. 2. Bash file for hpo strategy evalution.

5318c25

update dependencies

924d3b3

add the freedom to choose between random sampler and TPE sampler.

14ae2aa

1. add strategy 5. 2. add unit test accordingly.

c0b1b34

1. prior configs. 2. update eval.py, sen.sh, and .gitifonore.

f5c3a5a

gpmpc hpo strategy study

0e1248a

refactor the code

a0feec7

1. hpo on sac. 2. add activation arg in sac and fix a small bug.

bd39347

fix typos

4342b2a

change to two jobs

1e1f7cf

change num of repetitions to make sure it at least has same num of sa…

b087c87

…mples as s2.

Merge branch 'hpo-on-ppo' into hpo-on-sac

59b4220

reduce the budget

4c22c86

toy example

fe02a65

consider 4 version of noisy functions.

3d33487

include var study

714a76d

middleyuan and others added 21 commits September 16, 2024 14:21

Merge branch 'benchmark' of https://github.com/middleyuan/safe-contro…

101002f

…l-gym into benchmark

1. update dependencies. 2. distinguish discrete and categorical hyper…

951a7b2

…parameters.

1. fix type issue in hpo. 2. change space to log scale for learning r…

228c07a

…ate.

Merge branch 'benchmark' of github.com:middleyuan/safe-control-gym in…

67f09fc

…to benchmark

set done_on_max_steps to False.

472c2f3

Merge branch 'benchmark' of github.com:middleyuan/safe-control-gym in…

7aa3940

…to benchmark

rollback quad ref to only x-z ref

af8f1b1

small fixes to ppo, sac and td3

fb75194

Merge remote-tracking branch 'origin/benchmark' into benchmark

781505d

1. avoid evaluation during hpo for ppo. 2. Save trials and plot for V…

35696db

…izier. 3. improve parallel computes for Vizier.

Merge branch 'benchmark' of https://github.com/middleyuan/safe-contro…

5972ea8

…l-gym into benchmark

improve logging during hpo.

017d9d3

change database to SQLite.

930b63d

Prepare for deploying on cluster: 1. add weights & Biases. 2. improve…

68bcb87

… logging. 3. change PPO search space. 4. make cost parameters independent. 5. save intermediate results during HPO. 6. two packages tested on iLQR, GPMPC, and PPO.

Merge remote-tracking branch 'middleyuan/benchmark' into developmental

b8094e3

small fix 1

f75a85a

Auto stash before merge of "middleyuan/benchmark" and "developmental"

fcc5bae

fixes for config files, etc.

f11c62d

updated models for safe explorer ppo

a1e1572

fix 2

a523715

fix 3

7530482

Federico-PizarroBejarano requested review from Federico-PizarroBejarano and adamhall September 25, 2024 13:20

Federico-PizarroBejarano assigned svsawant Sep 25, 2024

Federico-PizarroBejarano added the enhancement New feature or request label Sep 25, 2024

Federico-PizarroBejarano requested changes Sep 25, 2024

View reviewed changes

svsawant and others added 3 commits September 28, 2024 10:25

gitignore update

f716ce9

WIP on developmental

5828716

add ilqr hardware compatibility code; add traj stet reset

f4cfce8

Federico-PizarroBejarano mentioned this pull request Oct 28, 2024

Example Using PPO (Or any Controller) With Real Crazyfile Drone #163

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding updates for Quadrotor #166

Adding updates for Quadrotor #166

svsawant commented Sep 25, 2024

Federico-PizarroBejarano left a comment

Federico-PizarroBejarano Sep 25, 2024

svsawant Sep 29, 2024

Federico-PizarroBejarano Oct 8, 2024

Federico-PizarroBejarano Sep 25, 2024

Federico-PizarroBejarano Sep 25, 2024

svsawant Sep 30, 2024

Federico-PizarroBejarano Oct 8, 2024

Adding updates for Quadrotor #166

Are you sure you want to change the base?

Adding updates for Quadrotor #166

Conversation

svsawant commented Sep 25, 2024

Federico-PizarroBejarano left a comment

Choose a reason for hiding this comment

Federico-PizarroBejarano Sep 25, 2024

Choose a reason for hiding this comment

svsawant Sep 29, 2024

Choose a reason for hiding this comment

Federico-PizarroBejarano Oct 8, 2024

Choose a reason for hiding this comment

Federico-PizarroBejarano Sep 25, 2024

Choose a reason for hiding this comment

Federico-PizarroBejarano Sep 25, 2024

Choose a reason for hiding this comment

svsawant Sep 30, 2024

Choose a reason for hiding this comment

Federico-PizarroBejarano Oct 8, 2024

Choose a reason for hiding this comment