feat: expose decoupled kv-cache to pytorch api #383

yzh119 · 2024-07-19T09:01:34Z

Followup of #379

🤖 I have created a release *beep* *boop* --- ## [0.1.1](v0.1.0...v0.1.1) (2024-07-20) ### Bugfix * fix the invalid kernel configuration for architectures with small shared memory size ([#385](#385)) ([cdac57](cdac577)) ### Features * expose decoupled kv-cache to pytorch api ([#383](#383)) ([457a0ae](457a0ae)) ### Performance Improvements * use stmatrix in epilogue for sm90+ ([#380](#380)) ([c6f20d1](c6f20d1)) --- This PR was generated with [Release Please](https://github.com/googleapis/release-please). See [documentation](https://github.com/googleapis/release-please#release-please). --------- Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: yzh119 <expye@outlook.com>

yzh119 added 2 commits July 19, 2024 09:01

upd

a9dd989

upd

d1460fc

yzh119 marked this pull request as ready for review July 20, 2024 01:23

yzh119 force-pushed the torch-decouple-kv branch from 4121cc8 to 93af3b8 Compare July 20, 2024 01:24

yzh119 merged commit 457a0ae into main Jul 20, 2024

github-actions bot mentioned this pull request Jul 20, 2024

chore(main): release 0.1.1 #381

Merged

upd

93af3b8

yzh119 deleted the torch-decouple-kv branch July 24, 2024 10:38

github-actions bot mentioned this pull request Jul 31, 2024

chore(main): release 0.1.4 #415

Merged

github-actions bot mentioned this pull request Dec 25, 2024

chore(main): release 0.3.0 #698

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: expose decoupled kv-cache to pytorch api #383

feat: expose decoupled kv-cache to pytorch api #383

yzh119 commented Jul 19, 2024

feat: expose decoupled kv-cache to pytorch api #383

feat: expose decoupled kv-cache to pytorch api #383

Conversation

yzh119 commented Jul 19, 2024