🎨 Art Creation With Multi-Conditional StyleGANs 🎨

This repository contains the code for the paper "Art Creation With Multi-Conditional StyleGANs" accepted at IJCAI 2022. The source code is based on StyleGAN2-ADA by Karras et al. from NVIDIA.

Setup Instructions

In general, instructions are the same as for the original StyleGAN2-ADA code. For multi-conditional training, you will also need to supply a prepared_dataset.json file with the --cond-path flag to the python train.py command. The prepared_dataset.json should contain the multi-conditions and have the following format:

{
  "condition_order": ["painter", "keywords", "emotions"], // condition names in the order they appear in the concatenated vector
  "shapes": [352, 768, 9], // the size of the vector representation for each condition part (in the same order)
  "labels": [
    ["/path/to/image/within/data/dir", [<label>, <label>, <label>],
    ... // for all images in dataset
  ],
}

Each <label> is either:

an integer (which will be converted to vector representation through one-hot-encoding)
an array containing one or more strings (which will be converted to vector representation with a pretrained TinyBERT embedding). In case of multiple strings, a single representation will be randomly sampled each time the training sample is shown to the model.
an array containing floats, which should be a probability distribution and are directly used as vector representation

How to get the EnhancedArtEmis dataset?

Unfortunately, we cannot host the dataset or emotion annotations due to copyright and licencing. We outline the process to reproduce the dataset and prepare it for training with StyleGAN2 here.

Follow the instructions in the ArtEmis repository to download and preprocess the emotion annotations. We do not use their preprocessing for "deep" networks.
Download the actual image files for ArtEmis. You can use our download_artemis_images.py script. This can take a while.
Create a dataset.json file compatible with the dataset_tool.py that maps image names to emotion labels. We convert multiple emotion annotations per image directly into a probability distribution. The format for dataset.json should look like this:

{
  "labels": [
    ["vincent-van-gogh_the-starry-night-1889.jpg", [0.0, 0.0, 0.2, 0.4, 0.0, 0.0, 0.2, 0.0, 0.2]],
    ["claude-monet_water-lilies-1919.jpg", [0.0, 0.8, 0.2, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0]],
    ... //for every image in the dataset
    ["pablo-picasso_guernica-1937.jpg", [0.0, 0.0, 0.0, 0.0, 0.6, 0.0, 0.4, 0.0, 0.0]],
  ]
}

Prepare the image files and dataset.json for training with the dataset_tool.py (instructions in the original StyleGAN2-ADA repository). The dataset.json should be placed inside the folder containing the images downloaded in step 2. We used a command like this (we scale non-square images instead of cropping):

python dataset_tool.py --source=./artemis-download/ --dest=./processed-artemis.zip --width=512 --height=512

To get additional annotations scraped from Wikiart use our scrape_additional_annotations.py script. Or you can use the enhanced_annotations.json we provide (scraped as of July 2021).
Now, we only need to create the prepared_dataset.json to enable our multi-conditional training. We have prepared the create_label_json.py script for this.
🚀 Wow, you made it! 🚀 Time to create some 🎨 art 🎨.

You can start a multi-conditional training with a command like this:

python train.py --outdir=./results-out --data=</path/to/data.zip> --cond-path ./annotations/painter-style-keywords/prepared_dataset.json --gpus=1 --snap=50 --workers=4 --batch=64 --cond=1 -n my-multiconditional-stylegan --dataset-cache-dir </path/to/cache/if/wanted>

Conditional Truncation Trick

In the paper, we propose the conditional truncation trick for StyleGAN. If you use the truncation trick together with conditional generation or on diverse datasets, give our conditional truncation trick a try (it's a drop-in replacement). The effect is illustrated below (figure taken from the paper):

The implementation is quite easy. The relevant lines of code can be found here:

multi-conditional-stylegan/training/networks.py

Lines 245 to 264 in 5b9ec5f

    
                           if conditional_truncation: 
        
                               """ 
        
                               For this implementation of conditional truncation, we expect all conditions in a batch to be the same. 
        
                               We choose 1000 samples to compute the conditional center of mass as a tradeoff between exactness and performance. 
        
                               """ 
        
                               N = 1000 
        
                               fixed_rng = torch.Generator().manual_seed(42) 
        
                               cond_center_of_mass_z = torch.randn( 
        
                                   N, self.z_dim, generator=fixed_rng 
        
                               ).type_as(z) 
        
                               cond_center_of_mass_c = torch.vstack([c[0]] * N) 
        
                               cond_center_of_mass_ws = self.forward( 
        
                                   cond_center_of_mass_z, cond_center_of_mass_c, truncation_psi=1 
        
                               ) 
        
                               w_center_of_mass = torch.mean(cond_center_of_mass_ws, dim=0) 
        
                           else: 
        
                               assert self.w_avg_beta is not None 
        
                               w_center_of_mass = self.w_avg 
        
                           if self.num_ws is None or truncation_cutoff is None: 
        
                               x = w_center_of_mass.lerp(x, truncation_psi)

License

The source code in this repository is distributed under the same Nvidia Source Code License as the original StyleGAN2-ADA repository.

Citation

@inproceedings{dobler2022multiconditional,
  title     = {Art Creation with Multi-Conditional StyleGANs},
  author    = {Dobler, Konstantin and Hübscher, Florian and Westphal, Jan and Sierra-Múnera, Alejandro and de Melo, Gerard and Krestel, Ralf},
  booktitle = {Proceedings of the Thirty-First International Joint Conference on
               Artificial Intelligence, {IJCAI-22}},
  publisher = {International Joint Conferences on Artificial Intelligence Organization},
  editor    = {Lud De Raedt},
  pages     = {4936--4942},
  year      = {2022},
  month     = {7},
  note      = {AI and Arts}
  doi       = {10.24963/ijcai.2022/684},
  url       = {https://doi.org/10.24963/ijcai.2022/684},
}

If you use the code in this repository, please also cite StyleGAN2-ADA:

@inproceedings{Karras2020ada,
  title     = {Training Generative Adversarial Networks with Limited Data},
  author    = {Tero Karras and Miika Aittala and Janne Hellsten and Samuli Laine and Jaakko Lehtinen and Timo Aila},
  booktitle = {Proc. NeurIPS},
  year      = {2020}
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
annotations		annotations
dnnlib		dnnlib
docs		docs
genart		genart
metrics		metrics
scripts		scripts
torch_utils		torch_utils
training		training
.gitignore		.gitignore
CITATION.cff		CITATION.cff
Dockerfile		Dockerfile
LICENSE.txt		LICENSE.txt
README.md		README.md
calc_metrics.py		calc_metrics.py
dataset_tool.py		dataset_tool.py
docker_run.sh		docker_run.sh
environment.yml		environment.yml
generate_conditional_mass.py		generate_conditional_mass.py
generate_grid.py		generate_grid.py
generate_interpolation.py		generate_interpolation.py
interpolation.py		interpolation.py
legacy.py		legacy.py
projection_video.py		projection_video.py
projector.py		projector.py
style_mixing.py		style_mixing.py
train.py		train.py
utils.py		utils.py
w_vector.py		w_vector.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎨 Art Creation With Multi-Conditional StyleGANs 🎨

Setup Instructions

Conditional Truncation Trick

License

Citation

About

Contributors 6

Languages

	if conditional_truncation:
	"""
	For this implementation of conditional truncation, we expect all conditions in a batch to be the same.
	We choose 1000 samples to compute the conditional center of mass as a tradeoff between exactness and performance.
	"""
	N = 1000
	fixed_rng = torch.Generator().manual_seed(42)
	cond_center_of_mass_z = torch.randn(
	N, self.z_dim, generator=fixed_rng
	).type_as(z)
	cond_center_of_mass_c = torch.vstack([c[0]] * N)
	cond_center_of_mass_ws = self.forward(
	cond_center_of_mass_z, cond_center_of_mass_c, truncation_psi=1
	)
	w_center_of_mass = torch.mean(cond_center_of_mass_ws, dim=0)
	else:
	assert self.w_avg_beta is not None
	w_center_of_mass = self.w_avg
	if self.num_ws is None or truncation_cutoff is None:
	x = w_center_of_mass.lerp(x, truncation_psi)

License

konstantinjdobler/multi-conditional-stylegan

Folders and files

Latest commit

History

Repository files navigation

🎨 Art Creation With Multi-Conditional StyleGANs 🎨

Setup Instructions

Conditional Truncation Trick

License

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Contributors 6

Languages