osx-arm64 optimal code generation #41128

sdmaclea · 2020-08-20T23:49:12Z

The Apple Silicon dev kit reports the following hardware features.

$ sysctl-a | grep hw.optional
hw.optional.floatingpoint: 1                                                   
hw.optional.watchpoint: 4           
hw.optional.breakpoint: 6                                                      
hw.optional.neon: 1                
hw.optional.neon_hpfp: 1              
hw.optional.neon_fp16: 1                                                       
hw.optional.armv8_1_atomics: 1  
hw.optional.armv8_crc32: 1                                                     
hw.optional.armv8_2_fhm: 0                                                     
hw.optional.amx_version: 0           
hw.optional.ucnormal_mem: 0                                                    
hw.optional.arm64: 1

I believe there is at least draft support for armv8_1_atomics, but given these are performance critical we should try to make sure we have used them in any perf critical code.

I believe we have enabled armv8_crc32 intrinsics

@tannergooding was looking for half precision floating point for AI work. Given that it is supported here it might be good to at least add the intrinsics. Maybe consider higher level support too.

category:cq
theme:vector-codegen
skill-level:expert
cost:large
impact:large

The text was updated successfully, but these errors were encountered:

BruceForstall · 2020-10-10T00:29:18Z

@echesakovMSFT Arm64 intrinsics and Apple silicon

echesakov · 2020-10-10T01:12:05Z

@sdmaclea Does it support ARMv8.3-CompNum ? I was planning to work on these too.

sdmaclea · 2020-10-10T01:38:21Z

The above list was unabridged. It is not on the list on the Apple Silicon prototype. We will have to see what is on the commercial hardware.

JulieLeeMSFT · 2022-06-14T18:17:42Z

@kunalspathak this is related to atomics. Please feel free to move to .NET 8.

kunalspathak · 2022-06-14T18:18:56Z

Yes. I will update this with my work.

kunalspathak · 2022-07-08T18:16:21Z

We have handled atomics in #71512. But I would like to keep this issue open for more features enable. I will move this to 8.0

neon-sunset · 2022-07-12T20:40:47Z

We have handled atomics in #71512. But I would like to keep this issue open for more features enable. I will move this to 8.0

If it helps, other features are detected on osx-arm64 since 3580ba7
While it doesn't cover everything reported like Sha3 and Sha512, which might get support in JIT similar to other cryptography intrinsics, I guess all existing ARM-specific features are taken advantage of more or less since xplat Vector64/128/256 changes.

p.s.: Since 12.0, now in 13.0 Beta 3, the only new sysctl -a hw.optional line is hw.optional.arm.FEAT_DIT: 1 which is security specific and seems to be untouched for now (see https://lore.kernel.org/all/20210211125900.22777-5-peter.maydell@linaro.org/ and golang/go#49702)

kunalspathak · 2023-06-05T17:53:38Z

Don't think anything actionable will be done in .NET 8.

sdmaclea added arch-arm64 area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI os-macos-bigsur (macOS11) labels Aug 20, 2020

sdmaclea added this to the 6.0.0 milestone Aug 20, 2020

Dotnet-GitSync-Bot added the untriaged New issue has not been triaged by the area owner label Aug 20, 2020

BruceForstall removed the untriaged New issue has not been triaged by the area owner label Aug 24, 2020

BruceForstall added the JitUntriaged CLR JIT issues needing additional triage label Oct 28, 2020

BruceForstall removed the JitUntriaged CLR JIT issues needing additional triage label Nov 10, 2020

JulieLeeMSFT assigned sandreenko Mar 23, 2021

JulieLeeMSFT added the needs-further-triage Issue has been initially triaged, but needs deeper consideration or reconsideration label Mar 23, 2021

JulieLeeMSFT removed the needs-further-triage Issue has been initially triaged, but needs deeper consideration or reconsideration label Jun 7, 2021

sandreenko modified the milestones: 6.0.0, 7.0.0 Jul 19, 2021

JulieLeeMSFT assigned kunalspathak and unassigned sandreenko Jun 14, 2022

kunalspathak modified the milestones: 7.0.0, 8.0.0 Jul 8, 2022

kunalspathak modified the milestones: 8.0.0, Future Jun 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

osx-arm64 optimal code generation #41128

osx-arm64 optimal code generation #41128

sdmaclea commented Aug 20, 2020 •

edited by BruceForstall

Loading

BruceForstall commented Oct 10, 2020

echesakov commented Oct 10, 2020

sdmaclea commented Oct 10, 2020

JulieLeeMSFT commented Jun 14, 2022

kunalspathak commented Jun 14, 2022

kunalspathak commented Jul 8, 2022

neon-sunset commented Jul 12, 2022 •

edited

Loading

kunalspathak commented Jun 5, 2023

osx-arm64 optimal code generation #41128

osx-arm64 optimal code generation #41128

Comments

sdmaclea commented Aug 20, 2020 • edited by BruceForstall Loading

BruceForstall commented Oct 10, 2020

echesakov commented Oct 10, 2020

sdmaclea commented Oct 10, 2020

JulieLeeMSFT commented Jun 14, 2022

kunalspathak commented Jun 14, 2022

kunalspathak commented Jul 8, 2022

neon-sunset commented Jul 12, 2022 • edited Loading

kunalspathak commented Jun 5, 2023

sdmaclea commented Aug 20, 2020 •

edited by BruceForstall

Loading

neon-sunset commented Jul 12, 2022 •

edited

Loading