Implement zero terminal SNR noise schedule option #14145

drhead · 2023-11-29T22:55:07Z

Description

This is for implementing a zero terminal SNR noise schedule for models trained to use one.
This is a rough copy of ComfyUI's implementation that works with K-diffusion samplers by having the last sigma value be extremely small but not quite zero.
Fixes [Bug]: alphas_cumprod are downcasted to half precision during model load despite existing at full precision during sampling #14071
Adds compatibility option to reproduce old seeds for aforementioned bugfix
Implements [Feature Request]: Support Zero Terminal SNR betas for full dynamic range on any model trained with ZTSNR #13052

Draft progress

Currently the implementation works roughly as expected. I am sure it works correctly with K-diffusion samplers
I can't verify that it works with DDIM or UniPC because current dynamic thresholding and CFG rescale extensions do not work with those samplers, so I have no proper way of testing them. PLMS doesn't work with v-prediction at all to begin with so it is effectively written off.
The relevant options, when active, are saved to PNG metadata, ~~but I don't know how to get them to apply setting overrides from the PNG info tab. This is one of the reasons why I am making this a draft for the moment.~~ fixed thanks to catboxanon

Checklist:

I have read contributing wiki page
I have performed a self-review of my own code
My code follows the style guidelines
My code passes tests

AUTOMATIC1111 · 2023-12-02T06:42:31Z

From the code it looks like the feature only works for half precision: alphas_cumprod_original is only added for half, and nothing is done if alphas_cumprod_original does not exist.

This looks like a good change overall but since it alters generation by default, I don't want to be hasty with merging it. If anyone wants to discuss his, please do here. Some images for comparison can also help.

drhead · 2023-12-02T18:04:15Z

I will fix its implementation on the other code paths. Here are a few grids for comparing the zero SNR noise schedule, generated using ZeroDiffusion 0.9. The first two are on Euler A with the default noise schedule. Note how they are overexposed and that there is a black border on some samples on the first grid. Also note that these images were generated without any CFG rescale or dynamic thresholding applied.

With the zero SNR noise schedule, these modes of failure are less likely to occur. You can also note that there's no longer a glowing outline around the subject in the second grid.

This is one sample of the compatibility mode:

This should be compared with the second grid posted above. The changes are extremely minor, I only notice a few patches of the first image and one patch on the third image having any differences, the others appear to be identical.

drhead · 2023-12-06T16:11:15Z

With further testing I am getting this error:
/home/drhead/miniconda3/envs/automatic/lib/python3.10/site-packages/torchsde/_brownian/brownian_interval.py:601: UserWarning: Should have ta>=t0 but got ta=0.0291675366461277 and t0=0.029168. warnings.warn(f"Should have ta>=t0 but got ta={ta} and t0={self._start}.")

Give me some time to investigate this before merging since it seems obvious that something is being casted inappropriately (line numbers will be wrong because of my debug prints)

edit: It appears that this is not a casting issue, it is an issue with torchsde's brownian interval rounding the start and stop values. I don't know if this has a significant impact, but rounding the sigma schedule to 6 decimal places would match it. It might make more sense for this to be fixed upstream in the k-diffusion repo.

drhead · 2023-12-06T17:42:08Z

Current implementation also seems to break Heun, but not Restart for some odd reason. Need to investigate.

drhead · 2023-12-09T17:08:09Z

I have identified the root cause for some samplers breaking. It appears that Heun, DDIM, and UniPC use other values that can be derived from alphas_cumprod at various points in sampling and I have to either update those values to match or update parts of the code to consistently re-derive those values (which theoretically would have a performance impact, but an insignificant one). I will stick to updating those values for now and expect to have a commit implementing these changes later today.

This is the only place these values are ever referenced outside of training code so this change is very justifiable and more consistent.

drhead · 2023-12-09T19:25:41Z

I went forward with re-deriving the values in the one place that used those values and now the DDIM sampler works more or less as expected.

I have further investigated the problems with Heun and UniPC and have concluded that both of these issues are present in the original ComfyUI implementation of zero terminal SNR sampling that I based this on. Model outputs for Heun start out in a ridiculous range and never seem to reach the expected range of -1 to 1, probably partly because it starts at sigma ~4500 and then goes to a sigma around 60. Under Restart, model outputs still start out in a rather high range but end up leveling out since the second sigma is around 3300. Outputs look normal under Heun if I use CFG rescale with fi=1.0, so I think the issue is lack of said rescale, and I would consider that out of scope of this pull request.

As far as I am concerned, this pull request is complete and ready. There's a number of other things that could be altered in order to make zero terminal SNR better supported (I would want to look at the discrepancy in Restart's sigma schedule for instance now that I know how to get reasonable outputs from Heun), but those can be investigated separately later.

AUTOMATIC1111 · 2023-12-16T16:49:10Z

Here's an example of differences, for whoever is reading:

Without this PR applied:

With it:

BurnZeZ · 2023-12-22T05:12:52Z

I’ve been seeing much closer prompt adherence especially with regard to fine details after applying this patch.

I haven’t used the new noise scheduler yet so is this caused by the alphas_cumprod fix?

drhead · 2023-12-30T17:15:02Z

I’ve been seeing much closer prompt adherence especially with regard to fine details after applying this patch.

I haven’t used the new noise scheduler yet so is this caused by the alphas_cumprod fix?

The changes shouldn't really be substantial enough to have a noticeable impact on prompt adherence. If it does then that may indicate something is wrong. Could you provide a side-by-side comparison with and without the patch on the same seed and settings to demonstrate?

…tion from #14145

AUTOMATIC1111 · 2024-01-01T12:08:00Z

@drhead

I decided to add the system I was considering previously, and that was recently suggested to me in discord, for automatically enabling backwards compatibility options; it does not completely solve the problem, but alleviates it a bit. And with that the PR is accepted.

One thing I'd like to ask you to do is to make another PR where you remove unneeded if hasattr(p.sd_model, 'alphas_cumprod') and hasattr(p.sd_model, 'alphas_cumprod_original'): check, and move your code inside process_images_inner into two functions outside of it (because it's already a very crowded function), preferably between program_version and create_infotext, leaving just a single call to your function inside process_images_inner. I can do it myself, but then git would show me as author of those lines.

ibrainventures · 2024-01-09T16:46:25Z

@drhead

I updated today to the dev version and in fine details / photorealistic the improvement by your PR is seeable / great.

only difference between the generation was: (hint for the api / override_setting param user)

Img 1: 'use_downcasted_alpha_bar' => true

Img 2: 'use_downcasted_alpha_bar' => false / (or no param) (the "new" calculation)

Thanks for this PR.

…tion from AUTOMATIC1111#14145

drhead and others added 5 commits November 29, 2023 17:38

Protect alphas_cumprod from downcasting

b25c126

Add options for zero terminal SNR

588a528

Implement zero terminal SNR schedule option

6d0a8dc

Fix infotext for ztSNR

ec6ee5c

Lint

ffa7f82

drhead marked this pull request as ready for review November 29, 2023 23:13

drhead requested a review from AUTOMATIC1111 as a code owner November 29, 2023 23:13

catboxanon and others added 2 commits November 29, 2023 18:33

Only apply ztSNR related code if alphas_cumprod exists

de79597

remove debug print

668ae34

drhead and others added 5 commits December 2, 2023 13:07

ensure that original alpha bar always exists

309a606

fix linting

81c4ddf

Revert 309a606

4a43334

Create alphas_cumprod_original on full precision path

dc1adee

fix variable

78acdcf

catboxanon linked an issue Dec 4, 2023 that may be closed by this pull request

[Bug]: alphas_cumprod are downcasted to half precision during model load despite existing at full precision during sampling #14071

Closed

1 task

re-derive sqrt alpha bar and sqrt one minus alphabar

5381405

This is the only place these values are ever referenced outside of training code so this change is very justifiable and more consistent.

AUTOMATIC1111 approved these changes Jan 1, 2024

View reviewed changes

AUTOMATIC1111 merged commit 267fd5d into AUTOMATIC1111:dev Jan 1, 2024
3 checks passed

AUTOMATIC1111 added a commit that referenced this pull request Jan 1, 2024

add automatic version support for zero terminal SNR noise schedule op…

45b7bba

…tion from #14145

Cyberbeing mentioned this pull request Jan 11, 2024

[Bug]: 1.7x DEV v1.7.0-329-g85bf2eb4 alphas_cumprod Downcast setting got lost and stuck on some models #14610

Open

6 tasks

w-e-w mentioned this pull request Feb 17, 2024

1.8.0-RC #14948

Closed

pawel665j mentioned this pull request Apr 16, 2024

## 1.8.0-RC #15537

Closed

ruchej pushed a commit to ruchej/stable-diffusion-webui that referenced this pull request Sep 30, 2024

add automatic version support for zero terminal SNR noise schedule op…

a8b1df5

…tion from AUTOMATIC1111#14145

catboxanon mentioned this pull request Oct 19, 2024

Support and automatically detect SDXL V-prediction models #16567

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement zero terminal SNR noise schedule option #14145

Implement zero terminal SNR noise schedule option #14145

drhead commented Nov 29, 2023 •

edited

Loading

AUTOMATIC1111 commented Dec 2, 2023

drhead commented Dec 2, 2023

drhead commented Dec 6, 2023 •

edited

Loading

drhead commented Dec 6, 2023

drhead commented Dec 9, 2023

drhead commented Dec 9, 2023 •

edited

Loading

AUTOMATIC1111 commented Dec 16, 2023

BurnZeZ commented Dec 22, 2023

drhead commented Dec 30, 2023

AUTOMATIC1111 commented Jan 1, 2024

ibrainventures commented Jan 9, 2024 •

edited

Loading

Implement zero terminal SNR noise schedule option #14145

Implement zero terminal SNR noise schedule option #14145

Conversation

drhead commented Nov 29, 2023 • edited Loading

Description

Draft progress

Checklist:

AUTOMATIC1111 commented Dec 2, 2023

drhead commented Dec 2, 2023

drhead commented Dec 6, 2023 • edited Loading

drhead commented Dec 6, 2023

drhead commented Dec 9, 2023

drhead commented Dec 9, 2023 • edited Loading

AUTOMATIC1111 commented Dec 16, 2023

BurnZeZ commented Dec 22, 2023

drhead commented Dec 30, 2023

AUTOMATIC1111 commented Jan 1, 2024

ibrainventures commented Jan 9, 2024 • edited Loading

drhead commented Nov 29, 2023 •

edited

Loading

drhead commented Dec 6, 2023 •

edited

Loading

drhead commented Dec 9, 2023 •

edited

Loading

ibrainventures commented Jan 9, 2024 •

edited

Loading