-
Notifications
You must be signed in to change notification settings - Fork 898
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Slow Stable Diffusion #735
Comments
Are you using cudnn and flash-attention? If not these are likely to speed up the generation massively, you can turn them on via |
I need some help here. I am failing to build the 'flash-attn'. I just installed [... many others rerun-if-changed defs above. Just showing relevant info bellow]
cargo:rerun-if-changed=kernels/static_switch.h
cargo:rustc-env=CUDA_INCLUDE_DIR=C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.2\include
cargo:rerun-if-env-changed=CUDA_COMPUTE_CAP
cargo:rustc-env=CUDA_COMPUTE_CAP=sm_86
cutlass/include\cute/numeric/math.hpp(299): error: identifier "not" is undefined
typename std::enable_if<(not std::is_unsigned<T>::value)>::type* = nullptr>
^
cutlass/include\cute/numeric/math.hpp(299): error: expected a ")"
typename std::enable_if<(not std::is_unsigned<T>::value)>::type* = nullptr>
^
cutlass/include\cute/numeric/math.hpp(299): error: expected a "," or ">"
typename std::enable_if<(not std::is_unsigned<T>::value)>::type* = nullptr>
^
cutlass/include\cute/numeric/math.hpp(299): error: the global scope has no "type"
typename std::enable_if<(not std::is_unsigned<T>::value)>::type* = nullptr>
^ |
Googling for this actual error, I came across this issue, in a nutshell flash-attn-v2 doesn't seem to support building on windows at the moment because of cutlass. |
Stable Diffusion is super slow. It more than 30 seconds to generate an example image with the default configuration with a RTX 4090 and CUDA enabled while it would take less than 5 seconds with diffusers.
The text was updated successfully, but these errors were encountered: