Skip to content

Flash-attn performance: remove cuda sync during inference #52129

Flash-attn performance: remove cuda sync during inference

Flash-attn performance: remove cuda sync during inference #52129

Annotations

2 warnings

This job succeeded