AnyBench and AnySD

This is the official model implementation and benchmark evaluation repository of AnyEdit: Unified High-Quality Image Edit with Any Idea

📊 AnyBench

This is the guide for the evaluation tool for AnyBench. The specific files are located in the anybench directory.

We have integrated the evaluations for AnyBench, Emu-edit, and MagicBrush into the same codebase, and it supports the following models: Null-Text, Uni-ControlNet, InstructPix2Pix, MagicBrush, HIVE, and UltraEdit (SD3).

Evaluation metrics are CLIPim↑, CLIPout↑, L1↓ , L2↓and DINO↑

🚀 Quick Start

bash anybench/setup.sh  # You need to go into the script and carefully check to ensure that the correct dependencies are installed.

🏆 Evaluation

EMU-Edit

download dataset via

huggingface-cli download facebook/emu_edit_test_set_generations --repo-type dataset

run

# gen images
CUDA_VISIBLE_DEVICES=0 PYTHONPATH='./' python anybench/eval/emu_gen.py
# test
CUDA_VISIBLE_DEVICES=3 PYTHONPATH='./' python anybench/eval/emu_eval.py

MagicBrush

download the test set from MagicBrush

CUDA_VISIBLE_DEVICES=0 PYTHONPATH='./' python anybench/eval/magicbrush_gen_eval.py

AnyBench

CUDA_VISIBLE_DEVICES=0 PYTHONPATH='./' python anybench/eval/anybench_gen_eval.py

⚠ Notice: AnySD may output completely black images for certain sensitive commands, which is a normal occurrence.

⚠ Notice: During evaluation, the final scores may vary due to the influence of inference hyperparameters, random seeds, and batch size.

🎨 AnySD

🚀 Quick Start

Clone this repo

vim ~/.bashrc
export HF_HOME=/mnt/bn/magellan-product-audit/weic/data_hf
#按 Esc 退出插入模式, 输入 :wq 保存并退出 vim
source ~/.bashrc
echo $HF_HOME

git clone https://github.com/weichow23/AnyDM

git add .
git commit -m "update"
git push origin main

Environment setup

conda create -n anyedit python=3.9
conda activate anyedit
pip install -r requirements.txt
pip install --upgrade torch diffusers xformers triton pydantic deepspeed
pip install git+https://github.com/openai/CLIP

🌐 Inference

🔮 Training

Stage I

bash train_stage1.sh

Stage II

🔎 Summary

Since AnyEdit contains a wide range of editing instructions across various domains, it holds promising potential for developing a powerful editing model to address high-quality editing tasks. However, training such a model has three extra challenges: (a) aligning the semantics of various multi-modal inputs; (b) identifying the semantic edits within each domain to control the granularity and scope of the edits; (c) coordinating the complexity of various editing tasks to prevent catastrophic forgetting. To this end, we propose a novel AnyEdit Stable Diffusion approach (🎨AnySD) to cope with various editing tasks in the real world.

Architecture of 🎨AnySD. 🎨AnySD is a novel architecture that supports three conditions (original image, editing instruction, visual prompt) for various editing tasks.

💖 Our model is based on the awesome SD 1.5

📚 Citation

@article{yu2024anyedit,
  title={AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea},
  author={Yu, Qifan and Chow, Wei and Yue, Zhongqi and Pan, Kaihang and Wu, Yang and Wan, Xiaoyang and Li, Juncheng and Tang, Siliang and Zhang, Hanwang and Zhuang, Yueting},
  journal={arXiv preprint arXiv:2411.15738},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AnyBench and AnySD

📊 AnyBench

🚀 Quick Start

🏆 Evaluation

🎨 AnySD

🚀 Quick Start

🌐 Inference

🔮 Training

🔎 Summary

📚 Citation

About

Releases

Packages

License

weichow23/AnySD

Folders and files

Latest commit

History

Repository files navigation

AnyBench and AnySD

📊 AnyBench

🚀 Quick Start

🏆 Evaluation

🎨 AnySD

🚀 Quick Start

🌐 Inference

🔮 Training

🔎 Summary

📚 Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages