[HuaweiAscendNPU] Optimize the IR mapping of LayerNorm and GroupNorm #9869

shentanyue · 2022-12-23T04:41:45Z

No description provided.

paddle-bot · 2022-12-23T04:41:48Z

Thanks for your contribution!

hong19860320 · 2022-12-23T08:24:45Z

lite/backends/nnadapter/nnadapter/src/driver/huawei_ascend_npu/converter/group_normalization.cc

+
+  /**
+   * Use small operators to calculate, and the formula is as follows:
+   * input = reshape(input, (batch_size, groups, -1))


input = reshape(input, shape=[batch_size, groups, -1])

hong19860320 · 2022-12-23T08:24:59Z

lite/backends/nnadapter/nnadapter/src/driver/huawei_ascend_npu/converter/group_normalization.cc

+   * input = reshape(input, (batch_size, groups, -1))
+   * mean = reduce_mean(input, axis=2, keep_dims=True)
+   * var = reduce_sum(square(input - mean), axis=2, keep_dims=True) / (channel *
+   * height * width / grous)


hong19860320 · 2022-12-23T08:25:10Z

lite/backends/nnadapter/nnadapter/src/driver/huawei_ascend_npu/converter/group_normalization.cc

+   * mean = reduce_mean(input, axis=2, keep_dims=True)
+   * var = reduce_sum(square(input - mean), axis=2, keep_dims=True) / (channel *
+   * height * width / grous)
+   * std = sqrt(var + epsilon


std = sqrt(var + epsilon)

hong19860320 · 2022-12-23T08:25:28Z

lite/backends/nnadapter/nnadapter/src/driver/huawei_ascend_npu/converter/group_normalization.cc

+   * height * width / grous)
+   * std = sqrt(var + epsilon
+   * output = (input - mean) / std
+   * output = reshape(output, (batch_size, channel, height, width))


reshape(output, shape=[batch_size, channel, height, width])

hong19860320 · 2022-12-23T08:26:11Z

lite/backends/nnadapter/nnadapter/src/driver/huawei_ascend_npu/converter/group_normalization.cc

+  SET_INPUT(reduce_sum_op, x, square_operator);
+  SET_INPUT(reduce_sum_op, axes, reduce_sum_axes_operator);
+  auto reduce_sum_operator = MAP_OUTPUT(reduce_sum_op, y, output_operand);
+  // Varience


hong19860320 · 2022-12-23T08:26:20Z

lite/backends/nnadapter/nnadapter/src/driver/huawei_ascend_npu/converter/group_normalization.cc

+  SET_INPUT(div_op, x1, reduce_sum_operator);
+  SET_INPUT(div_op, x2, block_num_operator);
+  auto varience_operator = MAP_OUTPUT(div_op, y, output_operand);
+  // Add:


hong19860320 · 2022-12-23T08:26:40Z

lite/backends/nnadapter/nnadapter/src/driver/huawei_ascend_npu/converter/group_normalization.cc

+  auto sqrt_op = converter->AddOperator<ge::op::Sqrt>(output_operand, "sqrt");
+  SET_INPUT(sqrt_op, x, add_operator);
+  auto std_operator = MAP_OUTPUT(sqrt_op, y, output_operand);
+  // Input Normlazation


Normalization

hong19860320 · 2022-12-23T08:27:58Z

lite/backends/nnadapter/nnadapter/src/driver/huawei_ascend_npu/converter/layer_normalization.cc

+     * output = scale *((x - mean) / np.sqrt(variance + epsilon)) + bias
+     *
+     */
+    auto batch_size = ProductionOfDimensions(


小算子在任何情况下都比 LayerNorm 算子性能好吗？

理论上是的：https://gitee.com/ascend/modelzoo/issues/I5IA99?from=project-issue

实测性能也确实会提升一些

hong19860320

LGTM

optimizer layer_norm and group_norm test=huawei_ascend_npu

f1e0c3b

shentanyue requested review from mjp9527, zhupengyang and hong19860320 as code owners December 23, 2022 04:41

hong19860320 reviewed Dec 23, 2022

View reviewed changes

code_style test=huawei_ascend_npu

d3988c6

hong19860320 approved these changes Jan 3, 2023

View reviewed changes

shentanyue merged commit a983d6a into PaddlePaddle:develop Jan 3, 2023

shentanyue deleted the optimizer_layer_norm_and_group_norm branch January 3, 2023 03:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[HuaweiAscendNPU] Optimize the IR mapping of LayerNorm and GroupNorm #9869

[HuaweiAscendNPU] Optimize the IR mapping of LayerNorm and GroupNorm #9869

shentanyue commented Dec 23, 2022

paddle-bot bot commented Dec 23, 2022

hong19860320 Dec 23, 2022

hong19860320 Dec 23, 2022

hong19860320 Dec 23, 2022

hong19860320 Dec 23, 2022

hong19860320 Dec 23, 2022

hong19860320 Dec 23, 2022

hong19860320 Dec 23, 2022

hong19860320 Dec 23, 2022

shentanyue Dec 23, 2022

hong19860320 left a comment

[HuaweiAscendNPU] Optimize the IR mapping of LayerNorm and GroupNorm #9869

[HuaweiAscendNPU] Optimize the IR mapping of LayerNorm and GroupNorm #9869

Conversation

shentanyue commented Dec 23, 2022

paddle-bot bot commented Dec 23, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hong19860320 left a comment

Choose a reason for hiding this comment