FineStyler

We provide a demo on colab that can be easily run !

⚠: demo in colab needs to download the model from the internet and may run slowly due to GPU limitations (30s-60s an iter, usually need to train about 200 iters to get better results)

The top left is the original image, and the bottom left is the mask generated using the stylized objects via CRIS. The rest are stylized images generated with different stylized content. Our stylized translation results in various text conditions. The stylized images have the spatial structure of the content images with realistic textures corresponding to the text, while retaining the original style of the non-target regions.

❤ A more detailed description of the source code is in the process of being organized and will be posted in a readme in this repository when the paper is accepted.

Pipeline

The overall architecture of the system.

Run

Environment

Python 3.10.13 & ptyorch 1.12.0+cu116 & ubuntu 20.04.1

git clone https://github.com/yisuanwang/Finestyler.git
cd Finestyler

conda create -n finestyler python=3.10
conda activate finestyler
pip install -r requirements.txt
pip install git+https://github.com/openai/CLIP.git

If you clone this repository locally, you will need to download this weight file to the root directory before running it. Running colab directly will automatically download the weights.

https://drive.google.com/file/d/10wo4R7sGWw5ITHpjtv3dIbIbkGpvkMiJ/view?usp=sharing

1. Single Image Style Transfer

Use this colab->

2. Multiple Image Style Transfer

If you think it's long for him to generate a graph (due to the presence of random cropping, this takes a long time to reduce the loss, we'll follow up with optimizations 😳), you can run demo.py, which is a multi-threaded batch-run script that allows for multiple Stylized Content trainings to be performed simultaneously on a single GPU.

Some commands for testing can be found in democases.md (this is just a temporary draft file of commands, a more detailed description of the training, inference detail steps will follow in this GitHub repository)

CUDA_VISIBLE_DEVICES=0 python demo.py --case=0 --style=0,7

To Do List

✅1. Colab online running demo

🔘2. CUDA version FineStyler

Citation

@article{chen2023soulstyler,
  title={Soulstyler: Using Large Language Model to Guide Image Style Transfer for Target Object},
  author={Chen, Junhao and Rong, Peng and Sun, Jingbo and Li, Chao and Li, Xiang and Lv, Hongwu},
  journal={arXiv preprint arXiv:2311.13562},
  year={2023}
}

License

This code and model are available only for non-commercial research purposes as defined in the LICENSE (i.e., MIT LICENSE). Check the LICENSE

Acknowledgment

This implementation is mainly built based on CRIS, CLIPstyler.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
config		config
engine		engine
exp		exp
img		img
mask		mask
model		model
style		style
testimg		testimg
tools		tools
utils		utils
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md
demo.py		demo.py
democases.md		democases.md
requirements.txt		requirements.txt
yisuanwang_Finestyler.ipynb		yisuanwang_Finestyler.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FineStyler

Pipeline

Run

Environment

1. Single Image Style Transfer

2. Multiple Image Style Transfer

To Do List

Citation

License

Acknowledgment

⭐️ Star History

About

Releases

Packages

Contributors 2

Languages

License

yisuanwang/Finestyler

Folders and files

Latest commit

History

Repository files navigation

FineStyler

Pipeline

Run

Environment

1. Single Image Style Transfer

2. Multiple Image Style Transfer

To Do List

Citation

License

Acknowledgment

⭐️ Star History

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages