arm neon optimization for layernorm fp32/bf16s/fp16s (#5746) #6291
linux-x64-cpu-gcc.yml
on: push
linux-gcc
24m 35s
linux-gcc-cpp03-nostdio-nostring-simplestl
12m 32s
linux-gcc-avx512
6m 29s