unet keeps producing nan during training #18

EYcab · 2024-01-22T09:10:47Z

Anyone knows why this unet process always produces nan results despite all the settings are done accordingly and all the other input variables are the same

junyongyou · 2024-02-12T07:55:11Z

Yes, I encountered the same here: loss becomes nan after some epochs. I tried different reward functions, and all the same.

junyongyou · 2024-02-13T10:34:02Z

I figured out the reason. You can change config.mixed_precision to "no" in base.py, such that full-precision can be enabled, and it should avoid that unet produces NaN.

EYcab changed the title ~~unet keep producing nan during training~~ unet keeps producing nan during training Jan 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

unet keeps producing nan during training #18

unet keeps producing nan during training #18

EYcab commented Jan 22, 2024

junyongyou commented Feb 12, 2024

junyongyou commented Feb 13, 2024

unet keeps producing nan during training #18

unet keeps producing nan during training #18

Comments

EYcab commented Jan 22, 2024

junyongyou commented Feb 12, 2024

junyongyou commented Feb 13, 2024