v0.0.3
github-actions
released this
08 Mar 10:06
·
0 commits
to 0d04571b614c944b5831d080882107a98b9c6e65
since this release
0.0.3 (2024-03-08)
Features
- adding
sm_scale
field for all attention APIs (#145) (85d4018) - enable
head_dim=256
for attention kernels (#132) (0372acc) - pytorch api of fp8 kv-cache (#156) (66ee066)
- support ALiBi (#146) (383518b)
Misc
Bug Fixes
- bugfix to pr 135 (#136) (3d55c71)
- fix bugs introduced in #132 (#135) (9b7b0b9)
- fix FindThrust.cmake (#161) (30fa584)