Improve config support for transformers with accelerate #630

touchwolf · 2024-08-04T14:52:28Z

This pull request improves the configuration support for transformers when using the accelerate library. The changes address issues with incomplete consideration of the transformer's config in the original implementation.

Changes made:

Modified train.py to enhance config support.

Please review the changes and let me know if there are any issues or further modifications needed.

bghira · 2024-08-04T15:05:31Z

oh, i forgot to pull this in from my local dev branch. i manually swapped these when testing schnell. thank you for fixing that - there's another one, where the sequence length is cut to 256 for dev and schnell, but i'm not sure they actually have the proper model config for that to work.

train.py

facok · 2024-08-04T15:15:12Z

oh, i forgot to pull this in from my local dev branch. i manually swapped these when testing schnell. thank you for fixing that - there's another one, where the sequence length is cut to 256 for dev and schnell, but i'm not sure they actually have the proper model config for that to work.哦，我忘了从我本地的开发分支中获取它。我在测试 schnell 时手动交换了这些。感谢您解决这个问题 - 还有另一个问题，其中 dev 和 schnell 的序列长度被削减为 256，但我不确定他们实际上是否拥有正确的模型配置来使其工作。

I saw these instructions, what impact will they have?

https://github.com/huggingface/diffusers/blob/c370b90ff184a61bcbd58d486975ad4de095275e/docs/source/en/api/pipelines/flux.md

Flux comes in two variants:
    Timestep-distilled (black-forest-labs/FLUX.1-schnell)
    Guidance-distilled (black-forest-labs/FLUX.1-dev)

Timestep-distilled
    max_sequence_length cannot be more than 256.
    guidance_scale needs to be 0.
    As this is a timestep-distilled model, it benefits from fewer sampling steps.

Guidance-distilled
    The guidance-distilled variant takes about 50 sampling steps for good-quality generation.
    It doesn't have any limitations around the max_sequence_length.

Improve config support for transformers with accelerate

9973797

bghira requested changes Aug 4, 2024

View reviewed changes

train.py Outdated Show resolved Hide resolved

train.py Outdated Show resolved Hide resolved

train.py Outdated Show resolved Hide resolved

bghira added 3 commits August 4, 2024 09:08

Update train.py

a100b00

Update train.py

4acc576

Update train.py

15c8a2f

bghira approved these changes Aug 4, 2024

View reviewed changes

bghira merged commit f58a5db into bghira:main Aug 4, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve config support for transformers with accelerate #630

Improve config support for transformers with accelerate #630

touchwolf commented Aug 4, 2024

bghira commented Aug 4, 2024

facok commented Aug 4, 2024

Improve config support for transformers with accelerate #630

Improve config support for transformers with accelerate #630

Conversation

touchwolf commented Aug 4, 2024

bghira commented Aug 4, 2024

facok commented Aug 4, 2024