Noël Vouitsis, Rasa Hosseinzadeh, Brendan Leigh Ross, Valentin Villecroze, Satya Krishna Gorti, Jesse C. Cresswell, Gabriel Loaiza-Ganem
This repository contains the official implementation of our NeurIPS 2024 workshop paper Inconsistencies In Consistency Models: Better ODE Solving Does Not Imply Better Samples. We release code for both training and evalution of consistency models (CMs) and direct consistency models (Direct CMs). This code supports distillation of SDXL using LoRA and is based on the consistency distillation implementation from the diffusers library.
As a first step, create the following conda environment:
conda env create --file cm.yml
conda activate cm
And initialize an 🤗 Accelerate environment with:
accelerate config
Or for a default accelerate configuration without answering questions about your environment:
accelerate config default
To train our models, we use the 11k subset of LAION similar to BK-SDM. You must first download the dataset:
bash scripts/get_laion_data.sh preprocessed_11k
Some of our evaluation metrics require downloading a few additional things. First, to measure the aesthetic score of generated samples, you must download the aesthetic predictor model's weights from here and copy them to data/aesthetics_mlp_weights
.
For FID and FD-DINO metric evaluation, we use the deep generative models evaluation library. To install this library, run the following:
git clone git@github.com:layer6ai-labs/dgm-eval
cd dgm-eval
pip install -e .
cd ..
For the FID, FD-DINO, and ODE solving error metrics, you must first run the teacher model to generate reference samples used in the metric computations. Note that the choice of ODE solver, the number of discretization intervals, and the guidance scale for the teacher should match the corresponding choices used for training the model you wish to evaluate. For example, if you want to evaluate a student model distilled from a teacher model using the DDIM ODE solver with 100 discretization intervals and a classifier-free guidance scale of 9.0, you must first run the teacher model to generate references samples with this same hyperparamter selection. To do this, specify --scheduler
, --num_original_inference_steps
, and --guidance_scale
respectively in scripts/test_teacher.sh
and then run the following:
bash scripts/test_teacher.sh
To ensure that the teacher and student are being evaluated with the same initial noise, you should additionally pre-generate offline a set of random noise tensors (using diffusers.utils.randn_tensor
) and save them to data/init_noise.pt
; you will see this referenced in both test_teacher.py
and test.py
.
We note that, following the diffusers library implemention of the SDXL pipeline, we use the Imagen formulation of classifier-free guidance. The corresponding guidance scale from the original classifier-free guidance formulation is simply 1 less than that in the Imagen formulation (e.g., a guidance scale of 9.0 in the Imagen formulation corresponds to a guidance of 8.0 in the original formulation).
We provide default training scripts for both consistency models (CMs) and direct consistency models (Direct CMs).
bash scripts/train_cm.sh
bash scripts/train_direct_cm.sh
The following shows how to evaluate a trained CM or Direct CM model. You must specify the --output_dir
of where your LoRA (CM or Direct CM) checkpoints are stored. The --train_data_dir
should point to the directory where the corresponding reference samples generated from teacher (see Evaluation setup section above) are saved. Also make sure to set --num_original_inference_steps
to the corresponding number of dicretization intervals used to train the model you are evaluating. Evaluation is performed for the number of student sampling steps specified with --num_inference_steps
. All evaluations are saved to tensorboard. Run the following:
bash scripts/test.sh
If you find this work useful in your research, please cite the following paper:
@inproceedings{vouitsis2024inconsistencies,
title={Inconsistencies In Consistency Models: Better ODE Solving Does Not Imply Better Samples},
author={Vouitsis, No{\"e}l and Hosseinzadeh, Rasa and Ross, Brendan Leigh and Villecroze, Valentin and Gorti, Satya Krishna and Cresswell, Jesse C and Loaiza-Ganem, Gabriel},
booktitle={NeurIPS 2024 Workshop on Attributing Model Behavior at Scale},
year={2024}
}