Free-Guidance-Diffusion

A method that provides greater control over generated images by guiding the internal representations of the pre-trained Stable Diffusion (SDv1.5). This idea is inspired by the paper Diffusion Self-Guidance for Controllable Image Generation (NIPS 2023).

The method allows for various modifications, including changing the position or size of specific objects, combining the appearance of an object from one image with the layout of another image, and merging objects from multiple images into a single image.

The core implementation is StableDiffusionFreeGuidancePipeline written based on 🧨diffusers, which is defined in free_guidance.py. The class inherits from the StableDiffusionAttendAndExcitePipeline and can be easily used. The file experiments.ipynb provides some visualization attempts as a reference for improvement. All guidance functions are located in ./utils/guidance_function.py. All visualization methods are defined in ./utils/vis_utils.py.

The biggest challenege is the guidance weights are very sensitive and the method performs worse as the prompts get more complex — subjects of the image interact, and it becomes harder to isolate the attention of specific tokens.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.vscode		.vscode
__pycache__		__pycache__
img		img
utils		utils
LICENSE		LICENSE
README.md		README.md
experiments.ipynb		experiments.ipynb
free_guidance.py		free_guidance.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Free-Guidance-Diffusion

About

Releases

Packages

Languages

License

Sainzerjj/Free-Guidance-Diffusion

Folders and files

Latest commit

History

Repository files navigation

Free-Guidance-Diffusion

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages