Fix cuda kernel launch of grid sampler #33100

wanghaoshuang · 2021-05-25T06:17:26Z

PR types

Bug fixes

PR changes

OPs

Describe

Fix cuda kernel launch of grid sampler

Fix #29066

paddle-bot-old · 2021-05-25T06:17:29Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

jerrywgz

建议增加一个输入较大的单测case

wanghaoshuang · 2021-05-31T01:45:39Z

@Avin0323

关于benchmark CI failing的说明

对于inputs:

grid (Variable) - dtype: float32, shape: [4, 256, 246, 2]
x (Variable) - dtype: float32, shape: [4, 1, 128, 128]

在该PR之前， block_size = 4256246 / 512, grid_size = 512, block_size超过了max_thread_num, 所以计算结果实际是错误的。
在该PR修改之后， block_size = 512, grid_size=4256246 / 512, 计算结果正确，benchmark耗时增加。

benchmark部分超时log如下：

2021-05-26 13:02:59 [check_op_benchmark_result.py:80] [INFO] ------ OP: grid_sample_3 (forward) ------
2021-05-26 13:02:59 [check_op_benchmark_result.py:82] [INFO] GPU time change: 7.08968% (develop: 0.0168862 -> PR: 0.0180833)
2021-05-26 13:02:59 [check_op_benchmark_result.py:84] [INFO] Total time change: -2.24423% (develop: 0.0382667 -> PR: 0.0374079)
2021-05-26 13:02:59 [check_op_benchmark_result.py:85] [INFO] backward: False
2021-05-26 13:02:59 [check_op_benchmark_result.py:86] [INFO] parameters:
2021-05-26 13:02:59 [check_op_benchmark_result.py:88] [INFO] 	grid (Variable) - dtype: float32, shape: [4, 256, 246, 2]
2021-05-26 13:02:59 [check_op_benchmark_result.py:88] [INFO] 	x (Variable) - dtype: float32, shape: [4, 1, 128, 128]
2021-05-26 13:02:59 [check_op_benchmark_result.py:88] [INFO] 	align_corners (bool): False
2021-05-26 13:02:59 [check_op_benchmark_result.py:88] [INFO] 	mode (string): bilinear
2021-05-26 13:02:59 [check_op_benchmark_result.py:88] [INFO] 	out_shape (list): [4, 1, 256, 256]
2021-05-26 13:02:59 [check_op_benchmark_result.py:88] [INFO] 	padding_mode (string): reflection

zhangting2020 · 2021-05-31T02:57:31Z

python/paddle/fluid/tests/unittests/test_grid_sampler_op.py

+        self.mode = "bilinear"
+
+    def test_check_grad_normal(self):
+        pass


large input导致慢的原因是单测框架的期望梯度算的较慢吧？有测过大概需要多久吗

如果已经用了skip_check_grad_ci，下面259～260就不需要写了

是的，单测框架的期望梯度算的较慢。单个case跑了20min，还没有完成。

已删除259~260

zhangting2020

LGTM for skip_check_grad_ci

jerrywgz

LGTM

Fix cuda kernel lanch of grid sampler

124493d

wanghaoshuang requested a review from jerrywgz May 25, 2021 06:22

jerrywgz reviewed May 25, 2021

View reviewed changes

Add large inputs for unitest of grid_sampler

9cdb343

jerrywgz previously approved these changes May 27, 2021

View reviewed changes

wanghaoshuang requested a review from zhangting2020 May 31, 2021 02:29

zhangting2020 reviewed May 31, 2021

View reviewed changes

wanghaoshuang dismissed jerrywgz’s stale review via 53903ae May 31, 2021 03:25

wanghaoshuang force-pushed the fix_grid_sampler branch from 53903ae to 47dfe55 Compare May 31, 2021 03:29

zhangting2020 previously approved these changes May 31, 2021

View reviewed changes

wanghaoshuang mentioned this pull request May 31, 2021

grid_sample的gpu和cpu版本结果不一致 #33216

Closed

Remove unused lines

1175e54

wanghaoshuang dismissed zhangting2020’s stale review via 1175e54 May 31, 2021 06:42

wanghaoshuang force-pushed the fix_grid_sampler branch from 47dfe55 to 1175e54 Compare May 31, 2021 06:42

jerrywgz approved these changes May 31, 2021

View reviewed changes

zhangting2020 approved these changes May 31, 2021

View reviewed changes

chalsliu approved these changes May 31, 2021

View reviewed changes

wanghaoshuang merged commit f61e6ee into PaddlePaddle:develop May 31, 2021

wanghaoshuang added a commit to wanghaoshuang/Paddle that referenced this pull request May 31, 2021

Fix cuda kernel launch of grid sampler (PaddlePaddle#33100)

5b69044

XiaoguangHu01 pushed a commit that referenced this pull request Jun 1, 2021

Fix cuda kernel launch of grid sampler (#33100) (#33232)

8a5a45f

wanghaoshuang deleted the fix_grid_sampler branch May 20, 2022 03:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix cuda kernel launch of grid sampler #33100

Fix cuda kernel launch of grid sampler #33100

wanghaoshuang commented May 25, 2021 •

edited

Loading

paddle-bot-old bot commented May 25, 2021

jerrywgz left a comment

wanghaoshuang commented May 31, 2021 •

edited

Loading

zhangting2020 May 31, 2021

wanghaoshuang May 31, 2021

zhangting2020 left a comment

jerrywgz left a comment

Fix cuda kernel launch of grid sampler #33100

Fix cuda kernel launch of grid sampler #33100

Conversation

wanghaoshuang commented May 25, 2021 • edited Loading

PR types

PR changes

Describe

paddle-bot-old bot commented May 25, 2021

jerrywgz left a comment

Choose a reason for hiding this comment

wanghaoshuang commented May 31, 2021 • edited Loading

关于benchmark CI failing的说明

zhangting2020 May 31, 2021

Choose a reason for hiding this comment

wanghaoshuang May 31, 2021

Choose a reason for hiding this comment

zhangting2020 left a comment

Choose a reason for hiding this comment

jerrywgz left a comment

Choose a reason for hiding this comment

wanghaoshuang commented May 25, 2021 •

edited

Loading

wanghaoshuang commented May 31, 2021 •

edited

Loading