feature(zjow): add wandb logger features; fix relative bugs for wandb online logger #579

zjowowen · 2023-02-10T07:34:37Z

Description

Add and fix wandb logger features for rendering videos and logging information during training, which is tested in algorithms td3/ddpg/sac.
Fix relative bugs for wandb online logger.
Copy changes to wandb offline logger.

…chmark-3

ding/framework/middleware/functional/collector.py

ding/policy/ddpg.py

ding/framework/middleware/functional/logger.py

…chmark-3

codecov · 2023-02-28T11:54:03Z

Codecov Report

Merging #579 (bb35f90) into main (275141b) will decrease coverage by 0.39%.
The diff coverage is 10.99%.

❗ Current head bb35f90 differs from pull request most recent head 6f49d0a. Consider uploading reports for the commit 6f49d0a to get more accurate results

@@            Coverage Diff             @@
##             main     #579      +/-   ##
==========================================
- Coverage   83.34%   82.96%   -0.39%     
==========================================
  Files         569      570       +1     
  Lines       46819    47013     +194     
==========================================
- Hits        39022    39004      -18     
- Misses       7797     8009     +212

Flag	Coverage Δ
unittests	`82.96% <10.99%> (-0.39%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
ding/bonus/__init__.py	`0.00% <0.00%> (ø)`
ding/bonus/config.py	`0.00% <0.00%> (ø)`
ding/bonus/ppof.py	`0.00% <0.00%> (ø)`
ding/bonus/td3.py	`0.00% <0.00%> (ø)`
ding/framework/middleware/functional/ctx_helper.py	`41.17% <ø> (ø)`
ding/framework/middleware/functional/logger.py	`22.44% <13.63%> (-1.28%)`	⬇️
ding/envs/env_manager/base_env_manager.py	`87.91% <40.00%> (-0.82%)`	⬇️
ding/policy/ddpg.py	`82.71% <47.36%> (-5.13%)`	⬇️
ding/envs/env/ding_env_wrapper.py	`84.07% <50.00%> (+4.31%)`	⬆️
ding/framework/middleware/functional/evaluator.py	`41.99% <50.00%> (-0.12%)`	⬇️
... and 6 more

... and 8 files with indirect coverage changes

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

ding/bonus/ppof.py

ding/bonus/td3.py

ding/policy/ddpg.py

…chmark-3

PaParaZz1 · 2023-03-09T07:12:06Z

ding/framework/middleware/functional/collector.py

@@ -1,6 +1,7 @@
 from typing import TYPE_CHECKING, Callable, List, Tuple, Any
 from easydict import EasyDict
 from functools import reduce
+import numpy as np


remove this

PaParaZz1 · 2023-03-09T07:12:15Z

ding/framework/middleware/functional/evaluator.py

@@ -257,7 +257,7 @@ def _evaluate(ctx: Union["OnlineRLContext", "OfflineRLContext"]):
                eval_monitor.update_video(env.ready_imgs)
                eval_monitor.update_output(inference_output)
            output = [v for v in inference_output.values()]
-            action = [to_ndarray(v['action']) for v in output]  # TBD
+            action = np.array([to_ndarray(v['action']) for v in output])  # TBD


the same problem

ding/bonus/config.py

ding/bonus/ppof.py

PaParaZz1 · 2023-03-09T07:20:15Z

ding/policy/td3.py

+
+    def monitor_vars(self) -> List[str]:
+        variables = ["q_value", "target q_value", "loss", "lr", "entropy", "target_q_value", "td_error"]
+        return variables


directly return

PaParaZz1 · 2023-03-09T07:21:25Z

ding/bonus/td3.py

+    wandb_url: str
+
+
+class TD3:


rename to OffPolicyAgent

…chmark-3

zjowowen and others added 12 commits November 4, 2022 14:04

td3 fix

e571f50

Merge branch 'opendilab:main' into benchmark-2

a614e3f

Add benchmark config file.

9060c53

Merge branch 'opendilab:main' into benchmark-2

731a2ad

add main

82a4944

fix

ad616ff

fix

f1aba9c

add feature to wandb;fix bugs

448daa1

merge main

1e18f25

format code

8de9b9e

remove files.

f36bec8

polish code

e5ec188

zjowowen added bug Something isn't working enhancement New feature or request P2 Important issue, but not time-critical refactor refactor module or component labels Feb 10, 2023

zjowowen self-assigned this Feb 10, 2023

zjowowen changed the title ~~feature(zjow): Add wandb logger features; fix relative bugs for wandb online logger~~ feature(zjow): add wandb logger features; fix relative bugs for wandb online logger Feb 10, 2023

PaParaZz1 mentioned this pull request Feb 12, 2023

Roadmap for DI-engine #548

Open

PaParaZz1 removed the P2 Important issue, but not time-critical label Feb 13, 2023

Merge branch 'main' of https://github.com/zjowowen/DI-engine into ben…

46f64e6

…chmark-3

PaParaZz1 requested changes Feb 22, 2023

View reviewed changes

ding/framework/middleware/functional/collector.py Outdated Show resolved Hide resolved

ding/policy/ddpg.py Show resolved Hide resolved

ding/policy/ddpg.py Outdated Show resolved Hide resolved

ding/framework/middleware/functional/logger.py Show resolved Hide resolved

PaParaZz1 removed the bug Something isn't working label Feb 23, 2023

zjowowen added 3 commits February 24, 2023 17:53

Merge branch 'main' of https://github.com/zjowowen/DI-engine into ben…

e520359

…chmark-3

fix td3 policy

6a9a565

Add td3

0222c04

zjowowen added 3 commits February 28, 2023 20:42

Add td3 env

929776b

Add td3 env

4fba3b9

polish code

0257ae9