Add ARM NEON intrinsics to unpack_yuy2 #13270

fateshelled · 2024-08-16T15:50:08Z

Changes:

adding unpack_yuy2_neon function
- unpack yuy2 to y8, y16, rgb8, rgba8, bgr8 and bgra8 format.
- tested on Ubuntu 22.04, OrangePi5 (RK3588s, 8GB of RAM) and RealSense D435.

sysrsbuild · 2024-08-16T15:50:13Z

Can one of the admins verify this patch?

fateshelled · 2024-08-19T23:20:31Z

I explained too little, my apologies.
I added the optimization code for ARM CPUs because the CPU load was too large when using ARM CPUs.
Please review if you like.

Nir-Az · 2024-08-28T20:56:23Z

Hi @fateshelled ,
Thanks for the PR, it will take some time but we will get to reviewing it.
Maybe you can undo format changes to make the PR more readable?
I see rs.cpp have format changes added..

fateshelled · 2024-08-29T15:24:29Z

Hi @Nir-Az ,
Thanks for the reply.
I fixed format changes. Please review it.

Nir-Az · 2024-09-03T11:36:43Z

It will take us some time to review and validate this PR on dedicated HW.
I would suggest to split bug fixes and new features as we may merge 1 faster than the other :)

fateshelled · 2024-09-03T14:21:59Z

Thanks for the reply.
I have changed the request to a pull request for new features only.

Nir-Az · 2024-09-04T12:21:49Z

src/proc/neon/image-neon.cpp

+            {
+                // Load 16 pixels
+                const uint8x8x4_t yuyv = vld4_u8(reinterpret_cast<const uint8_t *>(&src[i * 2]));
+                // yuyv.val[0] = y0, yuyv.val[1] = u, yuyv.val[2] = y1, yuyv.val[3] = v


It wasn't necessary, so I fixed it.

Nir-Az · 2024-09-04T12:23:13Z

src/proc/neon/image-neon.cpp

+
+        void unpack_yuy2_neon_y8(uint8_t * const d[], const uint8_t * s, int n)
+        {
+            unpack_yuy2_neon<RS2_FORMAT_Y8>(d, s, n);


Why not calling the templated function at the first place instead of adding functions that call it?

I used the code image-avx.cpp as a reference, but is this wrong?

librealsense/src/image-avx.cpp

Line 253 in e1688cc

unpack_yuy2<RS2_FORMAT_Y8>(d, s, n);

Not wrong.. just redundant.|
But since our code is a reference let's keep as is

Nir-Az · 2024-09-17T12:59:37Z

@fateshelled appreciate your contribution, we are always happy to integrate community pull requests :)
Since this feature looks safe for regression and would benefit you and other users, I merged it based on your testing.

Thank you :)

fateshelled · 2024-09-17T13:59:39Z

Thank you for merging the PR.
I am very happy to contribute.

fateshelled changed the title ~~Add NEON intrinsics to unpack_yuy2~~ Add ARM NEON intrinsics to unpack_yuy2 Aug 19, 2024

NEON support unpack_yuy2

a13850d

fateshelled force-pushed the neon-yuy2-support branch from 0e54f81 to a13850d Compare September 3, 2024 14:18

Nir-Az reviewed Sep 4, 2024

View reviewed changes

fateshelled added 2 commits September 5, 2024 23:47

remove unnecessary reinterpret_cast

f81c563

bug fix

766ae39

Nir-Az approved these changes Sep 17, 2024

View reviewed changes

Nir-Az merged commit 5a8da0a into IntelRealSense:development Sep 17, 2024
17 of 19 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ARM NEON intrinsics to unpack_yuy2 #13270

Add ARM NEON intrinsics to unpack_yuy2 #13270

fateshelled commented Aug 16, 2024 •

edited

Loading

sysrsbuild commented Aug 16, 2024

fateshelled commented Aug 19, 2024

Nir-Az commented Aug 28, 2024

fateshelled commented Aug 29, 2024

Nir-Az commented Sep 3, 2024

fateshelled commented Sep 3, 2024

Nir-Az Sep 4, 2024

fateshelled Sep 5, 2024

Nir-Az Sep 4, 2024

fateshelled Sep 5, 2024

Nir-Az Sep 17, 2024

Nir-Az commented Sep 17, 2024

fateshelled commented Sep 17, 2024

Add ARM NEON intrinsics to unpack_yuy2 #13270

Add ARM NEON intrinsics to unpack_yuy2 #13270

Conversation

fateshelled commented Aug 16, 2024 • edited Loading

sysrsbuild commented Aug 16, 2024

fateshelled commented Aug 19, 2024

Nir-Az commented Aug 28, 2024

fateshelled commented Aug 29, 2024

Nir-Az commented Sep 3, 2024

fateshelled commented Sep 3, 2024

Nir-Az Sep 4, 2024

Choose a reason for hiding this comment

fateshelled Sep 5, 2024

Choose a reason for hiding this comment

Nir-Az Sep 4, 2024

Choose a reason for hiding this comment

fateshelled Sep 5, 2024

Choose a reason for hiding this comment

Nir-Az Sep 17, 2024

Choose a reason for hiding this comment

Nir-Az commented Sep 17, 2024

fateshelled commented Sep 17, 2024

fateshelled commented Aug 16, 2024 •

edited

Loading