[X86/ARM] add gru mode for rnn #7026

mjp9527 · 2021-09-23T06:59:49Z

[X86/ARM] add gru mode for rnn, fix elementwise left problem, move cast op to host

… host, fix precision_profile bug

chenjiaoAngel · 2021-09-26T02:56:59Z

lite/backends/arm/math/gru.h

@@ -0,0 +1,248 @@
+// Copyright (c) 2019 PaddlePaddle Authors. All Rights Reserved.
+//


有时间新提Pr，将日期修改下：2019->2021

chenjiaoAngel · 2021-09-26T03:00:18Z

lite/kernels/arm/rnn_compute.cc

+      cur_h_ptr[offset] = out_ptr[offset] + pre_h_ptr[offset] * mask_ptr_1[i];
+    }
+  }
+  if ("LSTM" == mode) {


如果mode 不是LSTM，则不处理是吗？

是的，gru不做处理，目前仅支持gru lstm

chenjiaoAngel · 2021-09-26T03:01:35Z

lite/kernels/arm/rnn_compute.cc

+    last_c_temp.Resize(init_h[layer_idx].dims());
+    last_c_temp.mutable_data<float>();
+    last_c_holder = &last_c_temp;
+  }


是否还有其他mode 呢，不管是否有，请加上else LOG(FATAL)<< "提示暂不支持“

run函数入口处已经做了检测，后面不用再检测了

chenjiaoAngel · 2021-09-26T03:01:48Z

lite/kernels/arm/rnn_compute.cc

+               &output_tensors[i],
+               &vec[3 + offset * 4],
+               &weight_hh_tmp);
+    }


chenjiaoAngel · 2021-09-26T03:02:27Z

lite/kernels/arm/rnn_compute.cc

+  } else if ("GRU" == mode) {
+    gate_num = 3;
+  } else {
+    LOG(FATAL) << "X86 RNN ERROR: unsupport mode except gru and lstm,"


这个是ARM 不支持

chenjiaoAngel · 2021-09-26T03:05:15Z

lite/kernels/x86/rnn_compute.cc

+    last_c_temp.Resize(init_h[layer_idx].dims());
+    last_c_temp.mutable_data<float>();
+    last_c_holder = &last_c_temp;
+  }


加上else log（FATAL）

chenjiaoAngel · 2021-09-26T03:05:44Z

lite/kernels/x86/rnn_compute.cc

+               &output_tensors[i],
+               &vec[3 + offset * 4],
+               &weight_hh_tmp);
+    }


同时，加上 else

chenjiaoAngel · 2021-09-26T03:06:34Z

lite/kernels/x86/elementwise_op_function.h

@@ -0,0 +1,802 @@
+/* Copyright (c) 2016 PaddlePaddle Authors. All Rights Reserved.


weishengying

LGTM

mjp9527 added 8 commits September 22, 2021 20:53

[X86] Add GRU for RNN, complete elementwise op, move cast from arm to…

011b6d9

… host, fix precision_profile bug

pre-commit

0723667

[ARM] add RNN-GRU OP; Optimize RNN-GRU OP

539d7c7

fix complie bug

8c9aa2e

fix elementwise left problem

b4e522f

merge develop

a3ec029

fix windows ci

d9272a2

change arm cast test to host cast test

0b143ec

chenjiaoAngel reviewed Sep 26, 2021

View reviewed changes

weishengying approved these changes Sep 27, 2021

View reviewed changes

weishengying merged commit c33793e into PaddlePaddle:develop Sep 27, 2021

mjp9527 deleted the bigru branch November 28, 2022 12:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[X86/ARM] add gru mode for rnn #7026

[X86/ARM] add gru mode for rnn #7026

mjp9527 commented Sep 23, 2021

chenjiaoAngel Sep 26, 2021

chenjiaoAngel Sep 26, 2021

mjp9527 Sep 29, 2021

chenjiaoAngel Sep 26, 2021

mjp9527 Sep 29, 2021

chenjiaoAngel Sep 26, 2021

chenjiaoAngel Sep 26, 2021

chenjiaoAngel Sep 26, 2021

chenjiaoAngel Sep 26, 2021

chenjiaoAngel Sep 26, 2021

weishengying left a comment

		@@ -0,0 +1,248 @@
		// Copyright (c) 2019 PaddlePaddle Authors. All Rights Reserved.
		//

		@@ -0,0 +1,802 @@
		/* Copyright (c) 2016 PaddlePaddle Authors. All Rights Reserved.

[X86/ARM] add gru mode for rnn #7026

[X86/ARM] add gru mode for rnn #7026

Conversation

mjp9527 commented Sep 23, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

weishengying left a comment

Choose a reason for hiding this comment