The Code Repository for "CCDM: Continuous Conditional Diffusion Models for Image Generation"

This repository provides the source codes for the experiments in our papers for CCDMs.
If you use this code, please cite

@misc{ding2024ccdm,
      title={{CCDM}: Continuous Conditional Diffusion Models for Image Generation}, 
      author={Xin Ding and Yongwei Wang and Kao Zhang and Z. Jane Wang},
      year={2024},
      eprint={2405.03546},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Some Illustrative Figures

Illustration of the CCGM task with sample images from the UTKFace and Steering Angle datasets.

The overall workflow of CCDMs. The training process entails adapting a modified U-Net using the proposed hard vicinal loss and integrating CcGANs' label embedding network, referred to as ILI. For the sampling process, a classifier-free guidance-based mechanism is employed to produce a linear combination of a conditional output and an unconditional output from the trained U-Net. A DDIM-based sampler is then used to generate new images based on this linear combination.

The network architecture of the denoising U-Net.

Software Requirements

Here, we provide a list of crucial software environments and python packages employed in the conducted experiments. Please note that we use different computational platforms for our experiments.

For computing NIQE scores and implementing the NIQE filtering (Support both Windows and Linux):

Item	Version	Item	Version
OS	Win11	OS	Linux
Python	3.11.3	Python	3.10.12
Matlab	2023a	Matlab	2021b

For implementing CCDM (Support both Windows and Linux):

Item	Version	Item	Version
OS	Win11	OS	Linux
Python	3.11.3	Python	3.10.12
CUDA	11.8	CUDA	12.1
numpy	1.23.5	numpy	1.26.4
torch	2.0.1	torch	2.2.1
torchvision	0.15.2	torchvision	0.17.1
Pillow	9.5.0	Pillow	9.0.1
accelearate	0.20.3	accelearate	0.27.2

For implementing ReACGAN, ADCGAN, ADM-G, and CFG (Support Windows):

Item	Version
OS	Win11
Python	3.11.3
CUDA	11.8
numpy	1.23.5
torch	2.0.1
torchvision	0.15.2
Pillow	9.5.0
accelearate	0.20.3
wandb	0.15.7

For implementing CcGAN and Dual-NDA (Support Linux):

Item	Version
OS	Linux
Python	3.9
CUDA	11.4
numpy	1.23.0
torch	1.12.1
torchvision	0.13.1
Pillow	8.4.0
accelearate	0.18.0

Datasets

We use the preprocessed datasets provided by Ding et. al. (2023).

The RC-49 Dataset (h5 file)

Download the following h5 file and put it in ./datasets/RC-49.

RC-49 (64x64)

RC-49_64x64_OneDrive_link
RC-49_64x64_BaiduYun_link

The preprocessed UTKFace Dataset (h5 file)

Download the following h5 files and put them in ./datasets/UTKFace.

The Steering Angle dataset (h5 file)

Download the following h5 files and put them in ./datasets/SteeringAngle.

Steering Angle (64x64)

SteeringAngle_64x64_OneDrive_link
SteeringAngle_64x64_BaiduYun_link

Steering Angle (128x128)

SteeringAngle_128x128_OneDrive_link
SteeringAngle_128x128_BaiduYun_link

The Cell-200 Dataset (h5 file)

Download the following h5 file and put it in ./datasets/Cell-200.

Cell-200 (64x64)

Cell-200_64x64_OneDrive_link
Cell-200_64x64_BaiduYun_link

Preparation

Please download the zip file from either OneDrive or BaiduYun and extract its contents to the ./CCDM directory. This zip archive includes the necessary checkpoints for the ILI's embedding networks, as well as the corresponding checkpoints for the evaluation models related to each individual experiment.

Training

Following Ding et. al. (2023) and Ding et. al. (2024), distinct training codes have been provided for various datasets.

For simplicity, we only show how to implement the proposed CCDM in each experiment.

(1) RC-49 (64x64)

Go to the directory ./RC-49/RC-49_64x64/CCGM/CCDM. Run the training script ./scripts/run_train_ccdm.bat for Windows or ./scripts/run_train_ccdm.sh for Linux. Please correctly set ROOT_PATH, DATA_PATH, EVAL_PATH, and NIQE_PATH.

Please note that we also provide the training scripts named run_train_ccdpm.bat and run_train_ccdpm.sh for implementing CcDPM on RC-49.

(2) UTKFace (64x64)

Go to the directory ./UTKFace/UK64/CCGM/CCDM. Run the training script ./scripts/run_train.bat for Windows or ./scripts/run_train.sh for Linux. Please correctly set ROOT_PATH, DATA_PATH, EVAL_PATH, and NIQE_PATH.

(3) UTKFace (128x128)

Go to the directory ./UTKFace/UK128/CCGM/CCDM. Run the training script ./scripts/run_train.bat for Windows or ./scripts/run_train.sh for Linux. Please correctly set ROOT_PATH, DATA_PATH, EVAL_PATH, and NIQE_PATH.

(4) UTKFace (192x192)

Go to the directory ./UTKFace/UK192/CCGM/CCDM. Run the training script ./scripts/run_train.bat for Windows or ./scripts/run_train.sh for Linux. Please correctly set ROOT_PATH, DATA_PATH, EVAL_PATH, and NIQE_PATH.

(5) Steering Angle (64x64)

Go to the directory ./UTKFace/SA64/CCGM/CCDM. Run the training script ./scripts/run_train.bat for Windows or ./scripts/run_train.sh for Linux. Please correctly set ROOT_PATH, DATA_PATH, EVAL_PATH, and NIQE_PATH.

(6) Steering Angle (128x128)

Go to the directory ./UTKFace/SA128/CCGM/CCDM. Run the training script ./scripts/run_train.bat for Windows or ./scripts/run_train.sh for Linux. Please correctly set ROOT_PATH, DATA_PATH, EVAL_PATH, and NIQE_PATH.

(7) Cell-200 (64x64)

Go to the directory ./Cell-200/Cell-200_64x64/CCGM/CCDM. Run the training script ./scripts/run_train.bat for Windows or ./scripts/run_train.sh for Linux. Please correctly set ROOT_PATH, DATA_PATH, EVAL_PATH, and NIQE_PATH.

Sampling and Evaluation

(1) SFID, Diversity, and Label Score

For all methods except ReACGAN and ADCGAN, after the training, the evaluation usually automatically starts. The evaluation setups are consistent with Ding et. al. (2023).

(2) NIQE

In the bash scripts for training each method, enable --dump_fake_for_NIQE --niqe_dump_path <YOUR_NIQE_PATH> to dump fake images for computing NIQE scores. Please note that you need set the <YOUR_NIQE_PATH> correctly. Fake images for computing NIQE are usually stored in ./CcGAN_TPAMI_NIQE/fake_data. Run the batch script run_test.bat to computing the average NIQE scores.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The Code Repository for "CCDM: Continuous Conditional Diffusion Models for Image Generation"

Some Illustrative Figures

Software Requirements

Datasets

The RC-49 Dataset (h5 file)

RC-49 (64x64)

The preprocessed UTKFace Dataset (h5 file)

UTKFace (64x64)

UTKFace (128x128)

UTKFace (192x192)

The Steering Angle dataset (h5 file)

Steering Angle (64x64)

Steering Angle (128x128)

The Cell-200 Dataset (h5 file)

Cell-200 (64x64)

Preparation

Training

(1) RC-49 (64x64)

(2) UTKFace (64x64)

(3) UTKFace (128x128)

(4) UTKFace (192x192)

(5) Steering Angle (64x64)

(6) Steering Angle (128x128)

(7) Cell-200 (64x64)

Sampling and Evaluation

(1) SFID, Diversity, and Label Score

(2) NIQE

Acknowledge

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
CcGAN_TPAMI_NIQE		CcGAN_TPAMI_NIQE
Cell-200/Cell-200_64x64		Cell-200/Cell-200_64x64
RC-49/RC-49_64x64		RC-49/RC-49_64x64
SteeringAngle		SteeringAngle
UTKFace		UTKFace
datasets		datasets
images		images
LICENSE		LICENSE
README.md		README.md

License

UBCDingXin/CCDM

Folders and files

Latest commit

History

Repository files navigation

The Code Repository for "CCDM: Continuous Conditional Diffusion Models for Image Generation"

Some Illustrative Figures

Software Requirements

Datasets

The RC-49 Dataset (h5 file)

RC-49 (64x64)

The preprocessed UTKFace Dataset (h5 file)

UTKFace (64x64)

UTKFace (128x128)

UTKFace (192x192)

The Steering Angle dataset (h5 file)

Steering Angle (64x64)

Steering Angle (128x128)

The Cell-200 Dataset (h5 file)

Cell-200 (64x64)

Preparation

Training

(1) RC-49 (64x64)

(2) UTKFace (64x64)

(3) UTKFace (128x128)

(4) UTKFace (192x192)

(5) Steering Angle (64x64)

(6) Steering Angle (128x128)

(7) Cell-200 (64x64)

Sampling and Evaluation

(1) SFID, Diversity, and Label Score

(2) NIQE

Acknowledge

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages