chore: uniform sac network sizes #145

limbryan · 2023-03-17T20:32:56Z

This PR is to try to make the Deep RL algorithms more uniform. It allows the SAC, DIAYN and DADS implementation to have separate policy (actor) and critic architectures, as is already the case in the TD3 implementation. This allows more flexibility and is needed sometimes for comparison with QD algorithms which do not necessarily have a critic which needs a bigger size while the actor can be smaller.

This PR changes:

the hidden_layer_sizes to policy_hidden_layer_size and critic_hidden_layer_size for SAC, DIAYN and DADS

Checks

a clear description of the PR has been added
sufficient tests have been written
relevant section added to the documentation (N/A)
example notebook added to the repo (N/A)
clean docstrings and comments have been written (N/A)
if any issue/observation has been discovered, a new issue has been opened (N/A)

Future improvements

for DADS and DIAYN there is also the discriminator or dynamics architecture to expose. For now it is set to the critic architecture

… networks. dads and diayn too

codecov-commenter · 2023-03-17T21:14:07Z

Codecov Report

Merging #145 (cad8b1f) into develop (6fa19e7) will increase coverage by 0.01%.
The diff coverage is 100.00%.

📣 This organization is not using Codecov’s GitHub App Integration. We recommend you install it so Codecov can continue to function properly for your repositories. Learn more

@@             Coverage Diff             @@
##           develop     #145      +/-   ##
===========================================
+ Coverage    92.28%   92.29%   +0.01%     
===========================================
  Files          116      116              
  Lines         6763     6772       +9     
===========================================
+ Hits          6241     6250       +9     
  Misses         522      522

Impacted Files	Coverage Δ
qdax/baselines/dads.py	`97.00% <ø> (ø)`
qdax/baselines/diayn.py	`93.07% <ø> (ø)`
qdax/core/neuroevolution/networks/dads_networks.py	`94.02% <ø> (ø)`
...dax/core/neuroevolution/networks/diayn_networks.py	`100.00% <ø> (ø)`
qdax/core/neuroevolution/networks/sac_networks.py	`100.00% <ø> (ø)`
qdax/baselines/sac.py	`94.37% <100.00%> (+0.03%)`	⬆️
qdax/baselines/sac_pbt.py	`96.49% <100.00%> (+0.03%)`	⬆️
tests/baselines_test/dads_smerl_test.py	`97.18% <100.00%> (+0.04%)`	⬆️
tests/baselines_test/dads_test.py	`96.96% <100.00%> (+0.04%)`	⬆️
tests/baselines_test/diayn_smerl_test.py	`97.01% <100.00%> (+0.04%)`	⬆️
... and 4 more

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

felixchalumeau

Notebooks might need to be updated as well.

felixchalumeau

Looks good 👍

limbryan added 2 commits March 17, 2023 20:26

separate actor and critic architecture definition for all sac related…

d4a38e9

… networks. dads and diayn too

pre-commit fixes

1df3706

limbryan changed the base branch from main to develop March 17, 2023 20:33

limbryan requested a review from felixchalumeau March 19, 2023 16:56

felixchalumeau requested changes Mar 28, 2023

View reviewed changes

made the changes also in the example notebooks

cad8b1f

limbryan requested a review from felixchalumeau March 28, 2023 17:58

felixchalumeau approved these changes Mar 29, 2023

View reviewed changes

limbryan merged commit 6f78a4a into develop Mar 29, 2023

limbryan deleted the chore/uniform_sac_network_sizes branch March 30, 2023 01:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: uniform sac network sizes #145

chore: uniform sac network sizes #145

limbryan commented Mar 17, 2023 •

edited

Loading

codecov-commenter commented Mar 17, 2023 •

edited

Loading

felixchalumeau left a comment

felixchalumeau left a comment

chore: uniform sac network sizes #145

chore: uniform sac network sizes #145

Conversation

limbryan commented Mar 17, 2023 • edited Loading

Checks

Future improvements

codecov-commenter commented Mar 17, 2023 • edited Loading

Codecov Report

felixchalumeau left a comment

Choose a reason for hiding this comment

felixchalumeau left a comment

Choose a reason for hiding this comment

limbryan commented Mar 17, 2023 •

edited

Loading

codecov-commenter commented Mar 17, 2023 •

edited

Loading