GitHub

A Style-Based GAN Encoder for High Fidelity Reconstruction of Images and Videos

Official implementation for paper: A Style-Based GAN Encoder for High Fidelity Reconstruction of Images and Videos.

Abstract We propose a novel architecture for GAN inversion, which we call Feature-Style encoder. The style encoder is key for the manipulation of the obtained latent codes, while the feature encoder is crucial for optimal image reconstruction. Our model achieves accurate inversion of real images from the latent space of a pre-trained style-based GAN model, obtaining better perceptual quality and lower reconstruction error than existing methods. Thanks to its encoder structure, the model allows fast and accurate image editing. Additionally, we demonstrate that the proposed encoder is especially well-suited for inversion and editing on videos. We conduct extensive experiments for several style-based generators pre-trained on different data domains. Our proposed method yields state-of-the-art results for style-based GAN inversion, significantly outperforming competing approaches.

Requirements

Dependencies

Python 3.6
PyTorch 1.8
Opencv

You can install a new environment for this repo by running

conda env create -f environment.yml
conda activate feature_style

Prepare StyleGAN2 model and other necessary models

We adapt the StyleGAN2 model implemented by paper Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation. Here is their official implementation.
Download and save the pretrained models running
```
sh download_models.sh
```

Training

Prepare the training data

To train the encoder for StyleGAN, we use the synthetic images generated by StyleGAN and also the real images ffhq dataset. You can generate the synthetic images by running
```
python generate_imgs.py
```
and download the ffhq dataset (aligned faces) to data/ffhq-dataset/images/.
Training

You can modify the training options of the config file in the directory configs/.
```
python train.py --config 001 
```

Testing

Inversion

You can test the encoder on the images in test/. The output images are saved in output/image/.
```
python test.py --pretrained_model_path './pretrained_models/143_enc.pth' --input_path './test/'
```
Inversion and editing in notebook

You can explore the encoder and the attribute editing code in notebook inference.ipynb. You can also open it in Google Colab here.

Video Manipulation

We provide a script to achieve inversion and attribute manipulation for the videos in the test directory data/video/. You can upload your own video and modify the options in run_video_inversion_editing.sh.

sh run_video_inversion_editing.sh

Citation

@article{xuyao2022,
  title={A Style-Based GAN Encoder for High Fidelity Reconstruction of Images and Videos},
  author={Yao, Xu and Newson, Alasdair and Gousseau, Yann and Hellier, Pierre},
  journal={European conference on computer vision},
  year={2022}
}

License

This source code is made available under the license found in the LICENSE.txt in the root directory of this source tree.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
arcface		arcface
boundaries_ours		boundaries_ours
configs		configs
data		data
face_parsing		face_parsing
images		images
lpips		lpips
nets		nets
pixel2style2pixel		pixel2style2pixel
test		test
utils		utils
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
download_models.sh		download_models.sh
environment.yml		environment.yml
generate_imgs.py		generate_imgs.py
inference.ipynb		inference.ipynb
ranger.py		ranger.py
run_video_inversion_editing.sh		run_video_inversion_editing.sh
test.py		test.py
train.py		train.py
trainer.py		trainer.py
video_processing.py		video_processing.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Style-Based GAN Encoder for High Fidelity Reconstruction of Images and Videos

Requirements

Dependencies

Prepare StyleGAN2 model and other necessary models

Training

Testing

Video Manipulation

Citation

License

About

Releases

Packages

Contributors 2

Languages

License

InterDigitalInc/FeatureStyleEncoder

Folders and files

Latest commit

History

Repository files navigation

A Style-Based GAN Encoder for High Fidelity Reconstruction of Images and Videos

Requirements

Dependencies

Prepare StyleGAN2 model and other necessary models

Training

Testing

Video Manipulation

Citation

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages