Skip to content

Papers and codes collection for customized, personalized and editable generative models

License

Notifications You must be signed in to change notification settings

DuNGEOnmassster/awesome-customized-generative-AI

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

59 Commits
 
 
 
 
 
 
 
 

Repository files navigation

awesome-customized-generative-AI

Papers and codes collection for customize, personalized and editable generative models in 2D and 3D domains.

Awesome

Introduction

Artificial Intelligence Generated Content (AIGC) has become ubiquitous, demonstrating the power of generating mesmerizing results of random portraits. However, users generally show greater interest in personalized information (for example, faces from familiar people or celebrities) in these generated results than in generic faces. This tendency toward customization in AIGC arouses attention to customized, personalized, and editable generative AI.

This repo mainly focuses on visual generative models (leaving out LLMs), including 2D image-to-image, 2D text-to-image, and text-guided 3D generation/manipulation, collecting customized, personalized, and editable works in these specific domains. For any addition about other 2D/3D AIGC domains or bugs report, please open an issue, pull requests, or e-mail me at normanzheng6606@gmail.com for better communication.

Frequantly updating, please stay tuned!

Table of Contents

Customized 2D Image-to-Image

Image-prompted Generation

2024

  • FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition CVPR2024 {Paper} {Code} {Webpage}

    Details

  • Face2Diffusion for Fast and Editable Face Personalization CVPR2024 {Paper} {Code} {Webpage}

    Details

  • InstantID: Zero-shot Identity-Preserving Generation in Seconds arxiv {Paper} {Code} {Webpage}

    Details

2023

  • When StyleGAN Meets Stable Diffusion: a đť’˛+ Adapter for Personalized Image Generation arxiv {Paper} {Code}
    Details

2022

Portrait Style Transfer

2024

  • InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation arxiv {Paper} {Code}
    Details

  • DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations CVPR 2024 {Paper} {Code} {Webpage}

    Details

  • Customizing Text-to-Image Models with a Single Image Pair arxiv {Paper}

    Details

2023

Interactive Image Editing

2024

  • Transparent Image Layer Diffusion using Latent Transparency arxiv {Paper} {Code}
    Powerful PhotoShop cutout replacement!Details

2023

  • Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold SIGGRAPH 2023 {Paper} {Code} {Webpage}

    Interactive manipulation of the generative image manifold!Details

  • DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing CVPR 2024 {Paper} {Code} {Webpage}

    Details

  • Expressive Text-to-Image Generation with Rich Text ICCV 2023 {Paper} {Code} {Webpage}

    Details

Customized 2D Text-to-Image

Text-guided Portrait Generation

2024

  • Towards a Simultaneous and Granular Identity-Expression Control in Personalized Face Generation arxiv {Paper}
    Details

  • DisenBooth: Identity-Preserving Disentangled Tuning for Subject-Driven Text-to-Image Generation ICLR2024 {Paper} {Code} {Webpage}
    Details

  • DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization CVPR2024 {Paper} {Code} {Webpage}
    Details

2023

Text-guided Image Editing

2024

  • BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabilities in Pretrained Diffusion Models arxiv {Paper}

    Details

  • Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks CVPR 2024 {Paper} {Code} {Webpage}

    Details

  • CustomText: Customized Textual Image Generation using Diffusion Models arxiv {Paper}

    Details

  • EmoEdit: Evoking Emotions through Image Manipulation arxiv {Paper}

    Details

  • Enhancing Text-to-Image Editing via Hybrid Mask-Informed Fusion arxiv {Paper}

    Details

  • Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model arxiv {Paper} {Code} {Webpage}

    Also suitable for Interactive Image EditingDetails

2023

  • DreamInpainter: Text-Guided Subject-Driven Image Inpainting with Diffusion Models arxiv {Paper}

    Details

  • HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models arxiv {Paper} {Code} {HuggingFace}

    Details

  • Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing NeurIPS 2023 {Paper} {Code}

    Details

Customized 3D Generation/Manipulation

Image-prompted 3D Generation

2024

  • StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On CVPR 2024 {Paper} {Code} {Webpage}
    Details

2023

  • GP-VTON: Towards General Purpose Virtual Try-on via Collaborative Local-Flow Global-Parsing Learning CVPR2023 {Paper} {Code}
    Details

Text-prompted 3D Manipulation

2024

2023

  • Instruct-NeRF2NeRF: Editing 3D Scenes with Instructions ICCV2023 {Paper} {Code}

    Details

  • DreamBooth3D: Subject-Driven Text-to-3D Generation with Dream Fields ICCV2023 {Paper} {Webpage}

    Details

  • GaussianEditor (Huawei): Editing 3D Gaussians Delicately with Text Instructions arxiv {Paper} {Webpage}

    Details

  • ViCA-NeRF: View-Consistency-Aware 3D Editing of Neural Radiance Fields NeurIPS 2023 {Paper} {Code} {Webpage}

    Details

Customized Video Generation

Image-prompted Video Generation

2024

Text-prompted Video Generation

2024

2023

  • Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation arxiv {Paper} {Code} {Webpage}
    Details

Other Resources

Generic Benchmarks

Video Benchmarks

  • TC-Bench: Benchmarking Temporal Compositionality in Text-to-Video and Image-to-Video Generation Text-to-Video-Benchmark {Paper} {Code} {Webpage}

  • VBench: Comprehensive Benchmark Suite for Video Generative Models CVPR 2024 {Paper} {Code} {Webpage}

Generic Datasets

Generic Pre-trained Models

About

Papers and codes collection for customized, personalized and editable generative models

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published