v0.6.0

angeloskath released this 07 Mar 22:22

· 510 commits to main since this release

2830180

Highlights:

Faster quantized matrix-vector multiplies
- Benchmarks
mx.fast.scaled_dot_product_attention fused op

Core

Memory allocation API improvements
Faster GPU reductions for smaller sizes (between 2 and 7x)
- Benchmarks
mx.fast.scaled_dot_product_attention fused op
Faster quantized matrix-vector multiplications
Pickle support for mx.array

NN

Dilation on convolution layers

Bugfixes

Fix mx.topk
Fix reshape for zero sizes

Assets 2