This is the official PyTorch implementation of our paper: "Artistic Style Transfer with Internal-external Learning and Contrastive Learning" (NeurIPS 2021)
Although existing artistic style transfer methods have achieved significant improvement with deep neural networks, they still suffer from artifacts such as disharmonious colors and repetitive patterns. Motivated by this, we propose an internal-external style transfer method with two contrastive losses. Specifically, we utilize internal statistics of a single style image to determine the colors and texture patterns of the stylized image, and in the meantime, we leverage the external information of the large-scale style dataset (WikiArt) to learn the human-aware style information, which makes the color distributions and texture patterns in the stylized image more reasonable and harmonious. In addition, we argue that existing style transfer methods only consider the content-to-stylization and style-to-stylization relations, neglecting the stylization-to-stylization relations. To address this issue, we introduce two contrastive losses, which pull the multiple stylization embeddings closer to each other when they share the same content or style, but push far away otherwise. We conduct extensive experiments, showing that our proposed method can not only produce visually more harmonious and satisfying artistic images, but also promote the stability and consistency of rendered video clips.
We recommend the following configurations:
- python 3.8
- PyTorch 1.8.0
- CUDA 11.1
- Download the content dataset: MS-COCO.
- Download the style dataset: WikiArt.
- Download the pre-trained VGG-19 model.
- Set your available GPU ID in Line94 of the file "train.py".
- Run the following command:
python train.py --content_dir /data/train2014 --style_dir /data/WikiArt/train
- Put your trained model to ./model/ folder.
- Put some sample photographs to ./input/content/ folder.
- Put some artistic style images to ./input/style/ folder.
- Run the following command:
python Eval.py --content input/content/1.jpg --style input/style/1.jpg
We provide the pre-trained model in link.
We compare our model with some existing artistic style transfer methods, including Gatys et al., AdaIN, WCT, Avatar-Net, LST, and SANet.
The code in this repository is based on SANet. Thanks for both their paper and code.