API Improvement for paddle.nn.layer.state_dict 易用性提升 #64358

NKNaN · 2024-05-16T04:05:03Z

PR Category

User Experience

PR Types

Improvements

Description

添加参数 keep_vars, 默认值 True，若为 False 则返回的 state_dict 中的 tensor 脱离计算图。
参数默认值与 torch 相反：https://pytorch.org/docs/stable/generated/torch.nn.Module.html#torch.nn.Module.state_dict

paddle-bot · 2024-05-16T04:05:07Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

zhwesky2010 · 2024-05-21T12:15:46Z

test/legacy_test/test_state_dict_convert.py

+        y = model(x)
+        y.backward()
+        st = model.state_dict(keep_vars=False)
+        detached_from_graph = (


这个比较方式还是不太好看明白，建议直接：

has_grad = (st["linear.weight"].grad == model.linear.weight.grad).all() and (st["linear.bias"].grad == model.linear.bias.grad).all() and st["model_buffer"].grad == model.model_buffer.grad
self.assertEqual(has_grad, False)

这里因为detach了之后 .grad 返回的是 None，所以改成这样吧：
has_grad = (
(st["linear.weight"].grad is not None)
or (st["linear.bias"].grad is not None)
or (st["model_buffer"].grad is not None)
)

zhwesky2010 · 2024-05-21T12:15:59Z

test/legacy_test/test_state_dict_convert.py

+        y = model(x)
+        y.backward()
+        st = model.state_dict()
+        detached_from_graph = (


这个比较方式还是不太好看明白，建议直接：

has_grad = (st["linear.weight"].grad == model.linear.weight.grad).all() and (st["linear.bias"].grad == model.linear.bias.grad).all() and st["model_buffer"].grad == model.model_buffer.grad
self.assertEqual(has_grad, True)

zhwesky2010 · 2024-05-22T07:45:46Z

test/legacy_test/test_state_dict_convert.py

+        y = model(x)
+        y.backward()
+        st = model.state_dict(keep_vars=False)
+        has_grad = (


这个不应该是

(st["linear.weight"].grad is not None) and (st["linear.bias"].grad is not None) and (st["model_buffer"].grad is not None)

吗

zhwesky2010

LGTM

zhwesky2010 · 2024-05-23T07:20:02Z

python/paddle/nn/layer/layers.py

@@ -1871,6 +1871,7 @@ def _state_dict_impl(
        structured_name_prefix="",
        include_non_persistable_buffer=False,
        use_hook=True,
+        keep_vars=True,


默认值如果和Pytorch一样为False，即保存detach的内容。会有问题吗

修改之前是保存的原始参数，没有detach的，默认值改成False的话会不会对已有代码不兼容？

修改之前是保存的原始参数，没有detach的，默认值改成False的话会不会对已有代码不兼容？

OK，提交一下文档的修改PR吧

paddle-bot bot added the contributor External developers label May 16, 2024

NKNaN changed the title ~~API Improvement for paddle.nn.layer.state_dict~~ API Improvement for paddle.nn.layer.state_dict 易用性提升 May 16, 2024

zhwesky2010 reviewed May 21, 2024

View reviewed changes

NKNaN added 7 commits May 22, 2024 15:41

update state_dict

c290082

udpate

ca9f822

fix test

a8ff048

fix test

5c53bc5

fix test

1477830

fix test

b5bb2f1

update test

f40a666

NKNaN force-pushed the state_dict branch from f8cab9d to f40a666 Compare May 22, 2024 07:42

zhwesky2010 reviewed May 22, 2024

View reviewed changes

update test

34ba1d2

zhwesky2010 approved these changes May 23, 2024

View reviewed changes

zhwesky2010 reviewed May 23, 2024

View reviewed changes

NKNaN mentioned this pull request May 24, 2024

API Improvement for paddle.nn.layer.state_dict 易用性提升 PaddlePaddle/docs#6654

Merged

zhwesky2010 assigned luotao1 and zhwesky2010 May 24, 2024

luotao1 merged commit 12ecf2e into PaddlePaddle:develop May 24, 2024
31 of 32 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

API Improvement for paddle.nn.layer.state_dict 易用性提升 #64358

API Improvement for paddle.nn.layer.state_dict 易用性提升 #64358

NKNaN commented May 16, 2024

paddle-bot bot commented May 16, 2024

zhwesky2010 May 21, 2024

NKNaN May 22, 2024

zhwesky2010 May 21, 2024

NKNaN May 22, 2024

zhwesky2010 May 22, 2024

NKNaN May 22, 2024

zhwesky2010 left a comment

zhwesky2010 May 23, 2024

NKNaN May 23, 2024

zhwesky2010 May 24, 2024

API Improvement for paddle.nn.layer.state_dict 易用性提升 #64358

API Improvement for paddle.nn.layer.state_dict 易用性提升 #64358

Conversation

NKNaN commented May 16, 2024

PR Category

PR Types

Description

paddle-bot bot commented May 16, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

zhwesky2010 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment