Animatediff Proposal #5413

DN6 · 2023-10-16T17:23:34Z

What does this PR do?

Adds the AnimateDiff model to diffusers.

This PR:

Introduces a dedicated UNetMotionModel and motion blocks that allow using existing 2D UNet models with the Motion Modules introduced in AnimateDiff.

The approach taken by AnimateDiff is slightly different to a 3D UNet. No temporal convs are applied to the inputs and the motion module weights are saved separately from the 2D UNet weights. This is slightly similar to a ControlNet/Adapter approach so IMO it would be better to create a dedicated model and blocks to support this type of functionality, rather than adapt the existing 3D UNets.

Adds a MotionAdapter module that acts as a dedicated container for saving and loading motion module weights from the hub.

Proposed API

# Loading motion module into Pipeline
from diffusers import MotionAdapter, AnimateDiffPipeline

motion_adapter = MotionAdapter.from_pretrained("<path to saved motion modules>")
pipe = AnimateDiffPipeline.from_pretrained("runwayml/stable-diffusion-v1-5", motion_adapter=motion_adapter)

# Calling pipe.unet should return an UNetMotionModel object

# Create brand new UNetMotionModel with all random weights
unet = UNetMotionModel()

# Load from an existing 2D UNet and MotionAdapter
unet2D = UNet2DConditionModel.from_pretrained("...")
motion_adapter = MotionAdapter.from_pretrained("...")

unet_motion = UNetMotionModel.from_unet2d(unet2D, motion_adapter: Optional = None)

# Or load motion module after init
unet_motion.load_motion_modules(motion_adapter)

# Save only motion modules
unet_motion.save_motion_module(<path to save model>, push_to_hub=True)

# Save all weights to a single model repo (Including UNet weights) 
unet_motion.save_pretrained()

# Load fused models (Where the motion weights are saved along with the UNet weights in a single repo)
unet_motion = UNetMotionModel.from_pretrained("<path to model>")

TODO:

Test forward pass of UNetMotionModel to verify outputs match the original model
Test pipeline outputs for the same prompts defined in the original model repo
Support loading Motion LoRAs
Add documentation
Add tests
Add support for training/fine tuning AnimateDiff models.

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a Github issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

HuggingFaceDocBuilderDev · 2023-10-20T07:41:11Z

The documentation is not available anymore as the PR was closed or merged.

src/diffusers/models/embeddings.py

src/diffusers/models/unet_motion_blocks.py

src/diffusers/models/unet_motion_model.py

src/diffusers/pipelines/animatediff/pipeline_animatediff.py

src/diffusers/models/unet_motion_model.py

patrickvonplaten

I've added some final comments. It would be great if you could go over them and then I think we can merge tomorrow!

src/diffusers/models/attention.py

src/diffusers/models/embeddings.py

src/diffusers/models/transformer_temporal.py

patrickvonplaten · 2023-11-01T19:39:59Z

src/diffusers/models/transformer_temporal.py


        hidden_states = self.proj_in(hidden_states)
+        encoder_hidden_states = encoder_hidden_states if self.use_cross_attention else None


I still don't understand why we need this. If cross_attention_dim is None then why do we have to manually set encodre_hidden_states to None. This looks more like a hacky bug correction. Why do we pass encoder_hidden_states in the first place if we don't have cross attention?

src/diffusers/models/unet_3d_condition.py

src/diffusers/models/unet_motion_model.py

src/diffusers/pipelines/animatediff/pipeline_animatediff.py

src/diffusers/models/transformer_temporal.py

src/diffusers/pipelines/animatediff/pipeline_animatediff.py

docs/source/en/api/pipelines/animatediff.md

patrickvonplaten · 2023-11-02T14:04:08Z

Great job!

* draft design * clean up * clean up * clean up * clean up * clean up * clean up * clean up * clean up * clean up * update pipeline * clean up * clean up * clean up * add tests * change motion block * clean up * clean up * clean up * update * update * update * update * update * update * update * update * clean up * update * update * update model test * update * update * update * update * make style * update * fix embeddings * update * merge upstream * max fix copies * fix bug * fix mistake * add docs * update * clean up * update * clean up * clean up * fix docstrings * fix docstrings * update * update * clean up * update

DN6 added 4 commits October 15, 2023 21:13

draft design

d8ced0f

clean up

9e4c700

clean up

a026ea5

clean up

bbb2b6c

DN6 mentioned this pull request Oct 16, 2023

AnimateDiff #5296

Closed

8 tasks

DN6 added 2 commits October 18, 2023 18:52

clean up

36b3a44

clean up

2db7bd3

DN6 added 6 commits October 20, 2023 17:12

clean up

72e0fa6

clean up

d8d3515

clean up

7a5fbf8

clean up

9eeee36

update pipeline

86a4d31

clean up

c7ba4b8

patrickvonplaten reviewed Oct 23, 2023

View reviewed changes

src/diffusers/models/embeddings.py Outdated Show resolved Hide resolved

patrickvonplaten reviewed Oct 23, 2023

View reviewed changes

src/diffusers/models/unet_motion_blocks.py Outdated Show resolved Hide resolved

patrickvonplaten reviewed Oct 23, 2023

View reviewed changes

src/diffusers/models/unet_motion_blocks.py Outdated Show resolved Hide resolved

patrickvonplaten reviewed Oct 23, 2023

View reviewed changes

src/diffusers/models/unet_motion_blocks.py Outdated Show resolved Hide resolved

patrickvonplaten reviewed Oct 23, 2023

View reviewed changes

src/diffusers/models/unet_motion_model.py Show resolved Hide resolved

patrickvonplaten reviewed Oct 23, 2023

View reviewed changes

src/diffusers/pipelines/animatediff/pipeline_animatediff.py Show resolved Hide resolved

patrickvonplaten reviewed Oct 23, 2023

View reviewed changes

src/diffusers/models/unet_motion_model.py Outdated Show resolved Hide resolved

DN6 added 9 commits October 24, 2023 06:40

clean up

6ec184a

clean up

79f402f

add tests

b24f58a

change motion block

2688d07

clean up

0deab59

clean up

9c66c21

clean up

1bd65de

update

22c9f7b

update

0e1f7a8

DN6 added 2 commits November 1, 2023 16:09

fix mistake

ec8bb6e

add docs

d41f717

patrickvonplaten reviewed Nov 1, 2023

View reviewed changes

DN6 added 7 commits November 2, 2023 09:04

update

6d81f2a

clean up

840f576

update

a6d025b

clean up

ee51b90

clean up

dfa52fb

fix docstrings

c24c97b

fix docstrings

ef893c4