Why the mean and std in Reward Model is set as 0.16717362830052426 and 1.0333394966054072 ? #7

liming-ai · 2023-04-23T09:34:36Z

Thanks for your contribution, I want to figure out why the mean and std in the Reward Model are set as the following values:

ImageReward/ImageReward/ImageReward.py

Line 80 in c0b9080

self.mean = 0.16717362830052426

In addition, there are negative reward values during inference, which confuses me that what's the range of rewards during training?

xujz18 · 2023-04-24T02:53:21Z

The two values you mentioned are the mean/std of the reward values on the test set. It is to make the average reward value on the test set 0. The reward is normalized to have a mean of 0 and a standard deviation of 1. When testing stable diffusion v1.4 on our metric set, the scope of the reward is observed to have 62.4% of [-1,1] and 98.2% of [-2,2].

liming-ai closed this as completed Apr 26, 2023

xujz18 mentioned this issue Jul 24, 2023

about score #40

Open

xujz18 mentioned this issue Sep 5, 2023

about mean and std #51

Open

tusharbhutt mentioned this issue Sep 25, 2023

Yet another question on standard deviation #60

Open

xujz18 mentioned this issue Mar 11, 2024

About the score range #73

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why the mean and std in Reward Model is set as 0.16717362830052426 and 1.0333394966054072 ? #7

Why the mean and std in Reward Model is set as 0.16717362830052426 and 1.0333394966054072 ? #7

liming-ai commented Apr 23, 2023 •

edited

Loading

xujz18 commented Apr 24, 2023

Why the mean and std in Reward Model is set as 0.16717362830052426 and 1.0333394966054072 ? #7

Why the mean and std in Reward Model is set as 0.16717362830052426 and 1.0333394966054072 ? #7

Comments

liming-ai commented Apr 23, 2023 • edited Loading

xujz18 commented Apr 24, 2023

liming-ai commented Apr 23, 2023 •

edited

Loading