SDXL lora on inf2 #359

MrD005 · 2023-11-29T20:18:53Z

want to implement LoRA on SDXL model on inf2. Is there any support or process to implement this ?

JingyaHuang · 2023-12-03T20:51:13Z

Thanks for opening the issue, and the contribution is very welcomed!

To support LoRA, the first thing we need to check is whether the neuron compiler support the compilation of text encoder / unet loaded with LoRA weight (theorectically yes).

To do so, we can do a quick hack:

Load the LoRA weight to component models in the pipeline before the compilation before this line

optimum-neuron/optimum/exporters/neuron/utils.py

Line 163 in e9c2a9f

models_for_export = _get_submodels_for_export_stable_diffusion(pipeline=pipeline, task=task)

with for example

pipeline.load_lora_weights("ostris/super-cereal-sdxl-lora", weight_name="cereal_box_sdxl_v1.safetensors")

If the model compiled successfully, and doesn't meet any issue during the inference, then we can add offically support to the exporter:

Add a flag like "--lora_weight" in optimum/commands/export/neuronx.py for supporting it in optimum CLI
Add lora info to main_export throught the argument submodels
Load the LoRA weight through the function replace_stable_diffusion_submodels

optimum-neuron/optimum/exporters/neuron/utils.py

Line 329 in e9c2a9f

def replace_stable_diffusion_submodels(pipeline, submodels):

Then we are normally all set, the compiled artifacts should contain the LoRA weights and there is nothing else to add during the inference.

Much appreciation if you would like to work on it 🙏 . Feel free to ping me for review or any further question!

MrD005 · 2023-12-04T07:50:30Z

Thanks @JingyaHuang i will implement this and revert back if its works or not

Dev-hestabit · 2023-12-04T14:03:39Z

@JingyaHuang i have one more query if you can help into that also
when i am running SDXL model it is truncating my prompt upto 77 length and i found different solution for GPU and try to implement them on inf2 also but stuck on tensor calculation. if you can help in this also it will be very helpful for me

HuggingFaceDocBuilderDev · 2024-03-29T08:05:09Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread. Thank you!

HuggingFaceDocBuilderDev · 2024-04-22T08:05:17Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread. Thank you!

HuggingFaceDocBuilderDev · 2024-05-17T08:05:33Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread. Thank you!

HuggingFaceDocBuilderDev · 2024-06-10T08:05:45Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread. Thank you!

khurramkhalil · 2024-06-27T12:20:05Z

Hi there,
Just wanting to know if there is any update on adding LORAs support on inf2?
Thanks

JingyaHuang · 2024-06-27T15:17:37Z

Hi @khurramkhalil, it's already supported through this PR: #483.

You could also find an example here: https://huggingface.co/docs/optimum-neuron/en/tutorials/stable_diffusion#load-adapters

khurramkhalil · 2024-07-25T18:35:27Z

Thank you very much.

JingyaHuang self-assigned this Dec 3, 2023

JingyaHuang closed this as completed Jun 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SDXL lora on inf2 #359

SDXL lora on inf2 #359

MrD005 commented Nov 29, 2023

JingyaHuang commented Dec 3, 2023

MrD005 commented Dec 4, 2023

Dev-hestabit commented Dec 4, 2023

HuggingFaceDocBuilderDev commented Mar 29, 2024

HuggingFaceDocBuilderDev commented Apr 22, 2024

HuggingFaceDocBuilderDev commented May 17, 2024

HuggingFaceDocBuilderDev commented Jun 10, 2024

khurramkhalil commented Jun 27, 2024

JingyaHuang commented Jun 27, 2024

khurramkhalil commented Jul 25, 2024

SDXL lora on inf2 #359

SDXL lora on inf2 #359

Comments

MrD005 commented Nov 29, 2023

JingyaHuang commented Dec 3, 2023

MrD005 commented Dec 4, 2023

Dev-hestabit commented Dec 4, 2023

HuggingFaceDocBuilderDev commented Mar 29, 2024

HuggingFaceDocBuilderDev commented Apr 22, 2024

HuggingFaceDocBuilderDev commented May 17, 2024

HuggingFaceDocBuilderDev commented Jun 10, 2024

khurramkhalil commented Jun 27, 2024

JingyaHuang commented Jun 27, 2024

khurramkhalil commented Jul 25, 2024