arm neon optimization for layernorm fp32/bf16s/fp16s #4687
linux-mips-cpu-gcc.yml
on: pull_request
linux-gcc-mipsel
10m 1s
linux-gcc-mipsisa32r6el
9m 3s