Isn't min-SNR training strategy counter-intuitive? #13054
Unanswered
AbrightWay
asked this question in
Q&A
Replies: 1 comment 2 replies
-
|
@drhead could you please explain this phenomenon? |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Consider the case 'prediction_type' = 'sample' (predict x0). min-SNR training strategy assign weights to each sample of the loss (as in the figure below, but replace$\gamma$ with values > $\gamma$ :

This is quite counter-intuitive as (when t → 0), the denoising task of diffusion model is easier than (when t → T). Therefore, the weights (when t → T) should be higher than (when t → 0), and the formula for the weighs should be the inverse of min-SNR weighting, but the paper https://arxiv.org/abs/2303.09556 suggests the opposite weighting.
Can anyone please explain, please?
Beta Was this translation helpful? Give feedback.
All reactions