Gradient checkpointing not applied to UNet mid_block #4377

laksjdjf · 2023-07-31T01:59:52Z

The mid block of the SDXL is huge, so fixing it significantly reduces VRAM usage.

I have tested the following changes and have seen great results.
main...laksjdjf:diffusers:mid_block_gradient_checkpointing

However, there seem to be several variations of midblock, including UNetMidBlock2DSimpleCrossAttn and UNetMidBlock3DCrossAttn, and I am not sure what to do with them.

By the way, Kohya's trainer applies gradient checkpointing to all blocks.
https://github.com/kohya-ss/sd-scripts/blob/4072f723c12822e2fa1b2e076cc1f90b8f4e30c9/library/sdxl_original_unet.py#L1035-L1041

The text was updated successfully, but these errors were encountered:

sayakpaul · 2023-08-02T09:28:18Z

Cc: @patrickvonplaten

patrickvonplaten · 2023-08-03T19:15:37Z

Great catch - yes for SDXL we should indeed apply gradient checkpointing to the midblock as well :-)

Would you like to open a PR for it? This would be a great addition for the community I believe :-)

This was referenced Aug 4, 2023

how can i run lora training with sdxl1.0 on google colab? #4360

Closed

LoRA training for sdxl on diffusers CUDA out of memory? #4368

Closed

[SDXL] Allow SDXL LoRA to be run with less than 16GB of VRAM #4470

Merged

patrickvonplaten closed this as completed Aug 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gradient checkpointing not applied to UNet mid_block #4377

Gradient checkpointing not applied to UNet mid_block #4377

laksjdjf commented Jul 31, 2023

sayakpaul commented Aug 2, 2023

patrickvonplaten commented Aug 3, 2023

Gradient checkpointing not applied to UNet mid_block #4377

Gradient checkpointing not applied to UNet mid_block #4377

Comments

laksjdjf commented Jul 31, 2023

sayakpaul commented Aug 2, 2023

patrickvonplaten commented Aug 3, 2023