Failed to use --fp16 and --use_dml_attn simultaneously in Whisper pytorch-directml #598

XciciciX · 2024-06-24T06:42:58Z

I tried to run Whisper with torch-directml. It can run with either --fp16 flag or --use_dml_attn flag. However, I want to run with both flags. It failed with the following information:

joshjkim · 2024-08-19T21:45:19Z

@XciciciX Thank you for reporting this issue. Could you please try with the latest Whisper sample? Multilingual models preserve past_key_value tensors when detecting language which causes the parameter to be incorrect when running transcription inference on audio. The sample has been updated to remove DML attention in AudioEncoder as there's no significant perf gains in the encoder

joshjkim closed this as completed Aug 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Failed to use --fp16 and --use_dml_attn simultaneously in Whisper pytorch-directml #598

Failed to use --fp16 and --use_dml_attn simultaneously in Whisper pytorch-directml #598

XciciciX commented Jun 24, 2024

joshjkim commented Aug 19, 2024

Failed to use --fp16 and --use_dml_attn simultaneously in Whisper pytorch-directml #598

Failed to use --fp16 and --use_dml_attn simultaneously in Whisper pytorch-directml #598

Comments

XciciciX commented Jun 24, 2024

joshjkim commented Aug 19, 2024