arm neon optimization for layernorm fp32/bf16s/fp16s #6290
linux-x64-cpu-gcc.yml
on: pull_request
linux-gcc
24m 18s
linux-gcc-cpp03-nostdio-nostring-simplestl
12m 26s
linux-gcc-avx512
8m 36s