Skip to content

Latest commit

 

History

History
40 lines (23 loc) · 2.02 KB

README.md

File metadata and controls

40 lines (23 loc) · 2.02 KB

⚡ComfyUI-Lightning

Introduction

This repository integrates all the tricks I know to speed up Flux inference:

  1. Use TeaCache or FBCache or MBCache;
  2. Skip some unnessasery blocks;
  3. Compile and quantize model;
  4. Use fast CuDNN attention kernels;
  5. Use SageAttention;
  6. Fix AttributeError: 'SymInt' object has no attribute 'size' to speed up recompilation after resolution changing.

MBCache extends FBCache and is applied to cache multiple blocks. The codes are modified from SageAttention, ComfyUI-TeaCache, comfyui-flux-accelerator and Comfy-WaveSpeed. More details see above given repositories.

Updates

  • [2025/1/24] Now support Sana. Get your 1024*1024 images within 2s. All the codes are modified from Sana.

Usage

For Flux

fbcache_and_skip_blocks

teacache_and_skip_blocks

teacache_and_skip_blocks

For Sana

Sana Generation Results

  1. Download Sana diffusion model from Model Zoo and put the .pth file into models/diffusion_models;

  2. Download Gemma text encoder from google/gemma-2-2b-it, unsloth/gemma-2b-it-bnb-4bit or Efficient-Large-Model/gemma-2-2b-it and put the whole folder into models/text_encoders;

  3. Download DCAE image decoder from mit-han-lab/dc-ae-f32c32-sana-1.0 and put the .safetensors file into models/vae;

  4. Run the example workflow.