Skip to content

Flash-attn performance: remove cuda sync during inference#33570

Merged
Cyrilvallez merged 1 commit intohuggingface:mainfrom Cyrilvallez:fix-flashOct 7, 2024