feature(zjow): add qgpo policy for new DI-engine pipeline #757

zjowowen · 2023-12-07T08:21:02Z

Add qgpo policy for new DI-engine pipeline based on the implementation of repo https://github.com/ChenDRAG/CEP-energy-guided-diffusion.

ding/example/qgpo.py

ding/model/common/encoder.py

ding/policy/qgpo.py

ding/torch_utils/network/activation.py

dizoo/d4rl/config/halfcheetah_medium_expert_qgpo_config.py

codecov · 2023-12-27T14:05:10Z

Codecov Report

Attention: 739 lines in your changes are missing coverage. Please review.

Comparison is base (acd23e5) 76.82% compared to head (8002cd6) 75.80%.
Report is 3 commits behind head on main.

❗ Current head 8002cd6 differs from pull request most recent head e28b1c5. Consider uploading reports for the commit e28b1c5 to get more accurate results

Files	Patch %	Lines
...ng/torch_utils/diffusion_SDE/dpm_solver_pytorch.py	6.35%	427 Missing ⚠️
ding/model/template/qgpo.py	20.49%	128 Missing ⚠️
ding/policy/qgpo.py	24.09%	63 Missing ⚠️
ding/framework/middleware/functional/evaluator.py	1.61%	61 Missing ⚠️
.../framework/middleware/functional/data_processor.py	10.71%	50 Missing ⚠️
ding/torch_utils/network/res_block.py	27.27%	8 Missing ⚠️
ding/framework/middleware/functional/logger.py	0.00%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #757      +/-   ##
==========================================
- Coverage   76.82%   75.80%   -1.03%     
==========================================
  Files         676      679       +3     
  Lines       54328    55272     +944     
==========================================
+ Hits        41737    41897     +160     
- Misses      12591    13375     +784

Flag	Coverage Δ
unittests	`75.80% <12.95%> (-1.03%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

ding/example/qgpo.py

ding/policy/qgpo.py

dizoo/d4rl/config/halfcheetah_medium_expert_qgpo_config.py

ding/torch_utils/network/res_block.py

ding/policy/qgpo.py

ding/model/common/encoder.py

ding/model/template/qgpo.py

ding/example/qgpo.py

ding/policy/qgpo.py

zjowowen · 2024-01-03T07:05:04Z

ding/policy/qgpo.py

+                        self.qt_update_momentum * param.data + (1 - self.qt_update_momentum) * target_param.data
+                    )
+
+                q0_loss = q0_loss.detach().cpu().numpy()


replace detach() by item()

ding/policy/qgpo.py

zjowowen · 2024-02-04T08:13:59Z

ding/framework/middleware/functional/logger.py

@@ -611,6 +611,9 @@ def _plot(ctx: "OfflineRLContext"):
            )

        if ctx.eval_value != -np.inf:
+            if hasattr(ctx, "info_for_logging"):
+                info_for_logging.update(ctx.info_for_logging)


add comment for it

zjowowen added 15 commits October 16, 2023 15:23

Add CEP

0e8c8b3

Add CEP

0b40be5

Merge branch 'main' of https://github.com/zjowowen/DI-engine into CEP

8ca8495

Add halfcheetah

ab04888

Add halfcheetah

11563ac

add d4rl envs

b5e6774

change setup.py

94bc61a

polish code

b8bc493

change config

48837e9

fix lr bug

e5d0b32

polish code for qgpo

0f20c91

polish code for qgpo

504a108

merge from main

b251a2d

polish code

a36d301

polish code

9263965

zjowowen added the algo Add new algorithm or improve old one label Dec 7, 2023

polish code

177f44a

PaParaZz1 mentioned this pull request Dec 11, 2023

Roadmap for DI-engine #548

Open

PaParaZz1 requested changes Dec 20, 2023

View reviewed changes

zjowowen added 7 commits December 26, 2023 15:29

Merge branch 'main' of https://github.com/zjowowen/DI-engine into CEP-pr

5cddcbe

polish code

3203918

polish code

824a5f1

polish code

b544200

Merge branch 'main' of https://github.com/zjowowen/DI-engine into CEP-pr

36022c8

polish code

62b4f84

polish code

a45f43b

PaParaZz1 changed the title ~~feature(zjow): Add qgpo policy for new DI-engine pipeline~~ feature(zjow): add qgpo policy for new DI-engine pipeline Dec 27, 2023

zjowowen added 2 commits December 27, 2023 21:25

polish code

ca59da9

Merge branch 'main' of https://github.com/zjowowen/DI-engine into CEP-pr

4a45e24

PaParaZz1 requested changes Dec 28, 2023

View reviewed changes

zjowowen added 4 commits December 29, 2023 22:06

polish code

8370185

Merge branch 'main' of https://github.com/zjowowen/DI-engine into CEP-pr

a2d03b8

add hopper walker2d qgpo config

62d8957

add doc

1763141