[Feature request] Allow different network architectures for off-policy actor/critic #113

araffin · 2020-07-20T08:27:21Z

Currently, for on-policy algorithm we can specify net_arch=[dict(pi=[64], vf=[64])]

The idea would be to allow the same (expect the shared part which adds too much complexity) for off-policy algorithms:
net_arch=[dict(pi=[64], qf=[64])].

This should be fairly simple to implement.

The text was updated successfully, but these errors were encountered:

araffin added enhancement New feature or request help wanted Help from contributors is welcomed labels Jul 20, 2020

araffin changed the title ~~[New Feature] Allow different network architectures for off-policy actor/critic~~ [Feature request] Allow different network architectures for off-policy actor/critic Jul 20, 2020

araffin mentioned this issue Oct 11, 2020

Add custom arch for off-policy actor/critic networks #182

Merged

13 tasks

araffin removed the help wanted Help from contributors is welcomed label Oct 11, 2020

araffin closed this as completed in #182 Oct 13, 2020

araffin mentioned this issue Oct 13, 2020

Roadmap to Stable-Baselines3 V1.0 #1

Closed

42 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature request] Allow different network architectures for off-policy actor/critic #113

[Feature request] Allow different network architectures for off-policy actor/critic #113

araffin commented Jul 20, 2020

[Feature request] Allow different network architectures for off-policy actor/critic #113

[Feature request] Allow different network architectures for off-policy actor/critic #113

Comments

araffin commented Jul 20, 2020