1.0.1
- Add Eq128, i64 Mul, unsigned->float ConvertTo
- Faster sort for few unique keys, more robust pivot selection
- Fix: floating-point generator for sort tests, Min/MaxOfLanes for i16
- Fix: avoid always_inline in debug, link atomic
- GCC warnings: string.h, maybe-uninitialized, ignored-attributes
- GCC warnings: preprocessor int overflow, spurious use-after-free/overflow
- Doc: <=HWY_AVX3, Full32/64/128, how to use generic-inl