[Retiarii] Policy-based RL Strategy #3650

ultmaster · 2021-05-17T04:53:04Z

This PR supports a family of RL strategy based on tianshou. The default built-in algorithm is PPO.

TODOs:

logging
tests

QuanluZhang · 2021-05-25T02:12:55Z

nni/retiarii/strategy/rl.py

+        Takes ``ModelEvaluationEnv`` as input and return a policy. See ``_default_policy_fn`` for an example.
+    asynchronous : bool
+        If true, in each step, collector won't wait for all the envs to complete.
+        This should generally not affect the result, but might affect the efficiency. Note that a slightly more trials


don't understand, why asynchronous does not affect the result?

Synchronous doesn't mean single-process sampling. Both synchronous and asynchronous has parallelism. "Asynchronous" induces a mechanism to give up on some environment when it's not finished.

Refer to https://tianshou.readthedocs.io/en/master/tutorials/cheatsheet.html#parallel-sampling if you feel interested. It's a bit complicated and I don't think I can make it clear here in a few words.

ultmaster added 2 commits May 7, 2021 15:04

Initiate RL strategy

8b47630

Simple RL strategy

725e8a0

ultmaster added the NAS label May 17, 2021

ultmaster self-assigned this May 17, 2021

ultmaster added 3 commits May 20, 2021 13:49

Update RL implementation and tests

8c5ad65

Add docstring and dependency

eea6888

Update requirements and lint

c1ad4b3

ultmaster marked this pull request as ready for review May 20, 2021 05:43

Skip tests on windows and MacOS

a4c482b

ultmaster requested review from QuanluZhang and liuzhe-lz May 20, 2021 08:42

liuzhe-lz approved these changes May 21, 2021

View reviewed changes

QuanluZhang reviewed May 25, 2021

View reviewed changes

Update comments for async

f5ac5d0

QuanluZhang approved these changes May 25, 2021

View reviewed changes

ultmaster merged commit 122b5b8 into microsoft:master May 25, 2021

ultmaster mentioned this pull request May 31, 2021

NNI 2021 May~June Iteration Planning #3581

Closed

49 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Retiarii] Policy-based RL Strategy #3650

[Retiarii] Policy-based RL Strategy #3650

ultmaster commented May 17, 2021 •

edited

Loading

QuanluZhang May 25, 2021

ultmaster May 25, 2021

[Retiarii] Policy-based RL Strategy #3650

[Retiarii] Policy-based RL Strategy #3650

Conversation

ultmaster commented May 17, 2021 • edited Loading

QuanluZhang May 25, 2021

Choose a reason for hiding this comment

ultmaster May 25, 2021

Choose a reason for hiding this comment

ultmaster commented May 17, 2021 •

edited

Loading