The mat_mul_kernel_s16 function needs PKHTB & PKHBT to re-order the value #44

CristXu · 2023-03-08T01:54:34Z

Hi,
I found that at below line:

CMSIS-NN/Source/NNSupportFunctions/arm_nn_mat_mul_kernel_s16.c

Line 93 in d071e9f

ip_a0 = read_and_pad(ip_a0, &a01, &a02);

We use a read_and_pad to process the weights for the value expanding from q7_t to q15_t, and also a group: __PKHxx for
reording the value from (a0, a2, a1, a3) to (a0, a1, a2, a3).
My question is that why we add this two PKHxx operations, I think that We can still use the (a0, a2, a1, a3), if we process the
input with the same way (I found that the 1x1 conv2d has the similarity operation, without __PKHxx). So that we can save two-instructs and then save the inference time.

Regards,
Crist

mansnils · 2023-03-08T10:14:11Z

Hi @CristXu ,
Thanks for your comments!
You are right that we do additional ordering in some places.
We are looking over this now and see where we can get rid of PKHTB/PKHBT.
Thanks,
Måns

RELATED=#44

mansnils mentioned this issue Mar 24, 2023

Conv DSP: Remove ordering of elements in im2col #52

Merged

mansnils added a commit that referenced this issue Mar 24, 2023

Conv DSP: Remove ordering of elements in im2col (#52)

6bf2e7d

RELATED=#44

felix-johnny added the improvement Performance or general improvement label Feb 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The mat_mul_kernel_s16 function needs PKHTB & PKHBT to re-order the value #44

The mat_mul_kernel_s16 function needs PKHTB & PKHBT to re-order the value #44

CristXu commented Mar 8, 2023 •

edited

Loading

mansnils commented Mar 8, 2023

The mat_mul_kernel_s16 function needs __PKHTB & __PKHBT to re-order the value #44

The mat_mul_kernel_s16 function needs __PKHTB & __PKHBT to re-order the value #44

Comments

CristXu commented Mar 8, 2023 • edited Loading

mansnils commented Mar 8, 2023

The mat_mul_kernel_s16 function needs PKHTB & PKHBT to re-order the value #44

The mat_mul_kernel_s16 function needs PKHTB & PKHBT to re-order the value #44

CristXu commented Mar 8, 2023 •

edited

Loading