Add InstructPix2Pix pipeline support. #625

asntr · 2024-06-07T13:41:17Z

What does this PR do?

Fixes: #624

Added a support for loading and compiling InstructPix2Pix pipeline using neuron

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

asntr · 2024-06-07T15:25:00Z

Hi! I'd like to highlight a point with InstructPix2Pix Unet inference.

Since it uses 3 inputs to the unet with text guidance and image guidance, I can't use static batch size with data parallel on 2 devices, so I'm passing dynamic_batch_size=True (so I'm splitting it 2:1). But it is a setting for all models, so it is a suboptimal solution.

What do you think is better here? Introduce a new parameter that allows to set dynamic batching exclusively for unet? Also do we need to address it somewhere in other parts of code that tied with unet export?

HuggingFaceDocBuilderDev · 2024-06-11T13:14:24Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

JingyaHuang

Hi @asntr,

Thanks for contributing, the PR looks awesome (pleasure to review PR with tests, snippet, comments, docstrings!!!).

Do not have much to change, just some small nits for this PR. And for the

And for the case of classifier free guidance, indeed it's the first time that we meet the case where we don't just double but triple the inputs batch size. I think it would make sense to address it during the compilation to avoid using dynamic batching. Identify if the task is for pix2pix, modify the batch size for compiling the unet according to how we place models in the pipe to the neuron cores. Eg.

if we only leverage one core (data_parallel_mode=="none") or place the whole pipeline on both neuron cores (data_parallel_mode=="all")** then, the batch_size for compiling unet shall be 3*batch_size.
if we place just unet on both neuron cores then, the batch_size for compiling unet shall be something either 3*batch_size if torch_neuronx.data_parallel is able to split the inputs with odd batch_size into 2 chunks (need to test, haven't try yet but I think so) or (3*batch_size + batch_size%2) // 2 and then truncate the output noise_pred during the inference runtime.

optimum/neuron/pipelines/diffusers/pipeline_stable_diffusion_instruct_pix2pix.py

JingyaHuang · 2024-06-28T16:16:12Z

Hi @asntr, what do you think of the changes on the compilation that I suggested? Let me know if you are interested in working on this!
And we can get this PR merged first and improve the support with another PR as well.

JingyaHuang · 2024-06-28T16:17:53Z

If you prefer to get this PR merged first, could you rebase your branch, there was a fix on the styling tool today, with it the CI shall be good.

asntr · 2024-06-28T16:42:28Z

Hi @JingyaHuang!

Sorry for a delayed response.

I was thinking that choosing batch size depending on data_parallel_mode is a great thing and it can be useful not only in ip2p case, but in other diffusion pipelines.

For example, t2i pipeline doesn't allow to use cfg when you are not using dynamic_batch_size with data parallel modes different from "unet". Here we can apply the same logic of multiplying batch size by 2 if data_parallel_mode is not "unet".

So, maybe it is indeed a task for a next pr, and I can totally work on it!

asntr · 2024-06-28T16:45:29Z

I also have this example of inference output on INF2 for my snippet, should I update the docs with this example (and create a PR into documentation-images) ?

JingyaHuang · 2024-06-28T17:25:27Z

Hi @JingyaHuang!

Sorry for a delayed response.

I was thinking that choosing batch size depending on data_parallel_mode is a great thing and it can be useful not only in ip2p case, but in other diffusion pipelines.

For example, t2i pipeline doesn't allow to use cfg when you are not using dynamic_batch_size with data parallel modes different from "unet". Here we can apply the same logic of multiplying batch size by 2 if data_parallel_mode is not "unet".

So, maybe it is indeed a task for a next pr, and I can totally work on it!

Sounds great, thanks @asntr! Ping me if you need any help.

JingyaHuang · 2024-06-28T17:28:13Z

I also have this example of inference output on INF2 for my snippet, should I update the docs with this example (and create a PR into documentation-images) ?

Yeah please do! The image looks great!

We could put it under the sdxl section: https://github.com/huggingface/optimum-neuron/blob/main/docs/source/tutorials/stable_diffusion.mdx#stable-diffusion-xl-turbo

Thank you!

asntr · 2024-06-30T12:33:09Z

Hi @JingyaHuang , I placed the docs under the stable diffusion section as ip2p pipeline is inside stable_diffusion directory in diffusers: https://github.com/huggingface/diffusers/blob/main/src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_instruct_pix2pix.py

I also opened this pr: https://huggingface.co/datasets/optimum/documentation-images/discussions/4

Let me know if you're happy with this.

Thanks!

JingyaHuang

Thanks @asntr for adding the doc, it looks great! Let's just wait for the CIs finishing to get this PR merge. (trainium CIs and INF1 CIs might fail but it's totally irrelevant and we can ignore them).

asntr force-pushed the add-ip2p-support branch 2 times, most recently from ecd57f2 to 0eea205 Compare June 7, 2024 14:29

JingyaHuang self-requested a review June 11, 2024 09:57

asntr force-pushed the add-ip2p-support branch from f0d9502 to f86058b Compare June 12, 2024 20:11

JingyaHuang reviewed Jun 25, 2024

View reviewed changes

JingyaHuang marked this pull request as ready for review June 28, 2024 16:18

asntr added 4 commits June 29, 2024 19:14

Add InstructPix2Pix pipeline support.

9d9485e

Replace flax pipe with torch.

c6d836d

Adjust annotations and docs style.

52e055c

Add prerequisite on cfg.

001ce73

asntr force-pushed the add-ip2p-support branch from 3bc21ec to 9ce10f4 Compare June 30, 2024 12:28

Update docs.

c01ddc2

asntr force-pushed the add-ip2p-support branch from 9ce10f4 to c01ddc2 Compare June 30, 2024 12:48

JingyaHuang approved these changes Jun 30, 2024

View reviewed changes

JingyaHuang merged commit 86900e7 into huggingface:main Jun 30, 2024
8 of 12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add InstructPix2Pix pipeline support. #625

Add InstructPix2Pix pipeline support. #625

asntr commented Jun 7, 2024 •

edited

Loading

asntr commented Jun 7, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Jun 11, 2024

JingyaHuang left a comment

JingyaHuang commented Jun 28, 2024

JingyaHuang commented Jun 28, 2024

asntr commented Jun 28, 2024

asntr commented Jun 28, 2024 •

edited

Loading

JingyaHuang commented Jun 28, 2024

JingyaHuang commented Jun 28, 2024

asntr commented Jun 30, 2024

JingyaHuang left a comment

Add InstructPix2Pix pipeline support. #625

Add InstructPix2Pix pipeline support. #625

Conversation

asntr commented Jun 7, 2024 • edited Loading

What does this PR do?

Before submitting

asntr commented Jun 7, 2024 • edited Loading

HuggingFaceDocBuilderDev commented Jun 11, 2024

JingyaHuang left a comment

Choose a reason for hiding this comment

JingyaHuang commented Jun 28, 2024

JingyaHuang commented Jun 28, 2024

asntr commented Jun 28, 2024

asntr commented Jun 28, 2024 • edited Loading

JingyaHuang commented Jun 28, 2024

JingyaHuang commented Jun 28, 2024

asntr commented Jun 30, 2024

JingyaHuang left a comment

Choose a reason for hiding this comment

asntr commented Jun 7, 2024 •

edited

Loading

asntr commented Jun 7, 2024 •

edited

Loading

asntr commented Jun 28, 2024 •

edited

Loading