[gym] Add symmetry surrogate loss in rllib custom ppo #780

mwulfman · 2024-04-30T08:06:24Z

This PR adds the possibility to add the surrogate loss on the mirrored observation-action training data. Naming can be changed if desired as we could name it also "symmetry_surrogate_loss" for instance.

duburcqa · 2024-04-30T14:31:46Z

I'm in favour of changing the name of the option as you suggested. It would be nice if you could update to documentation to explain what it does and give the source of the article introducing this approach. Thank you !

mwulfman · 2024-04-30T20:36:15Z

I've renamed the loss to symmetry_surrogate_loss and added some documentation to review. Should I add the reference article for CAPS as well ?

duburcqa · 2024-04-30T20:49:37Z

If you are motivated, then it would be nice yes ! You could even add mine in addition that is giving more insights regarding the interesting properties that emerges and how to tune the parameters.

…entation

mwulfman · 2024-04-30T21:19:27Z

I have updated the documentation to add the mentioned articles.

* [core] Make 'jiminy::Model' and 'jiminy::Robot' copyable. (#791) * [core] Add stairs terrain (#797) * [core] Define position, velocity and effort limits at motor level. (#798) * [core] Implement velocity limits at motor-level. (#798) * [core] Allow direct access to all motors / sensors attached to a robot. (#798) * [core] Remove spring-damper position bounds dynamics to always use constraints. (#798) * [core] Remove damper velocity bounds dynamics. (#798) * [core] Define motor armature option at motor-side. (#798) * [core] Remove dedicated robot attributes for position, velocity and effort limits. (#798) * [core] Remove inefficient motor / sensor convenience helpers of 'Robot'. (#798) * [core] Log total joint effort instead of confusing motor effort on joint side. (#798) * [core] Enable attaching encoders either on motor or joint side. (#798) * [core] Mechanical reduction is now fully supported. (#798) * [core] Fix adding flexibility frame on root joint of fixed-base robot. (#798) * [core] Move 'enable(Velocity|Effort)Limit' options from AbstractMotor to SimpleMotor. (#798) * [core] Add velocity-torque slope when approaching maximum torque. (#798) * [core] Keep track of motor efforts both on motor and joint sides. (#801) * [core] Effort sensors now measure motor effort before transmission. (#801) * [core|python] Various cleanup. (#790) * [python] Introduce first-class trajectory object. (#790) * [python/log] Fix trajectory and sensor data extraction and replay from log. (#800) * [python/plot] Get around tabbed figure hanging for 'matplotlib>=3.8'. (#795) * [python/viewer] Add missing documentation. (#790) * [python/replay] Make specifying 'robot' optional in most helpers. (#800) * [gym/common] Add generic stacked quantity wrapper. (#783) * [gym/common] Do NOT pass 'truncated' argument to 'compute_reward'. (#783) * [gym/common] More generic quantity batching. (#783) * [gym/common] Pass 'info' in argument of 'has_terminated' for consistency. (#783) * [gym/common] Remove BaseJiminyEnv 'enforce_bounded_spaces' option by lack of use-case. (#783) * [gym/common] Add masked quantity. (#784) * [gym/common] Add average odometry velocity quantity. (#784) * [gym/common] Introduce reward objects. (#784) * [gym/common] Add locomotion rewards. (#784) * [gym/common] Add base reward mixture class to ease pipeline instantiation. (#786) * [gym/common] Add basic survive reward. (#786) * [gym/common] Increase timeout ratio from 10 to 15 to avoid Mac OS CI failure. (#786) * [gym/common] Add compose reward pipeline wrapper. (#787) * [gym/common] Support specifying reward in pipeline config. (#787) * [gym/common] Add simulation vs trajectory quantity evaluation mode. (#791) * [gym/common] Expose trajectory database in quantity manager. (#792) * [gym/common] Expose interpolate mode for trajectories. (#792) * [gym/common] Add support of trajectory reference in reward computation. (#792) * [gym/common] Enable specifying reference trajectories in pipeline config. (#792) * [gym/common] Get around some memory alignment issue with Boost < 1.78. (#794) * [gym/zoo] Update Atlas learning environment. (#800) * [gym/zoo] Update all robotic environments so that random actions do not trigger truncation. (#801) * [gym/rllib] Add symmetry surrogate loss in rllib custom ppo (#780) * [gym/rllib] Fix checkpoint save location. (#788) * [gym/rllib] Update checkpoint_dir so it is dependant on iteration number (#789) * [misc] Update install instructions on windows. (#787)

mwulfman added 4 commits April 30, 2024 23:18

Add mirror surrogate loss in rllib custom ppo

15a2b90

rename mirror_surrogate_loss -> symmetry_surrogate_loss and add docum…

3cb52ef

…entation

update doc

2b139f9

Update documentation to add more references

fccd03c

mwulfman force-pushed the add-mirror-surrogate branch from eb6f7e4 to fccd03c Compare April 30, 2024 21:18

mwulfman changed the title ~~[gym] Add mirror surrogate loss in rllib custom ppo~~ [gym] Add symmetry surrogate loss in rllib custom ppo Apr 30, 2024

duburcqa merged commit 80b5c44 into duburcqa:dev May 1, 2024
32 checks passed

duburcqa mentioned this pull request May 24, 2024

Jiminy release 1.8.6 #802

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[gym] Add symmetry surrogate loss in rllib custom ppo #780

[gym] Add symmetry surrogate loss in rllib custom ppo #780

mwulfman commented Apr 30, 2024 •

edited

Loading

duburcqa commented Apr 30, 2024

mwulfman commented Apr 30, 2024

duburcqa commented Apr 30, 2024

mwulfman commented Apr 30, 2024

[gym] Add symmetry surrogate loss in rllib custom ppo #780

[gym] Add symmetry surrogate loss in rllib custom ppo #780

Conversation

mwulfman commented Apr 30, 2024 • edited Loading

duburcqa commented Apr 30, 2024

mwulfman commented Apr 30, 2024

duburcqa commented Apr 30, 2024

mwulfman commented Apr 30, 2024

mwulfman commented Apr 30, 2024 •

edited

Loading