A problem about the weight λ of Lvlb #114

yinguanchun · 2023-09-30T09:10:44Z

In the paper, λ is 0.001. The code sets learn_sigma as True and rescale_learned_sigmas as False, so the loss type will be gd.LossType.MSE, in this loss type ,the Lvlb will not multply 0.001. Even if the loss type is gd.LossType.RESCALED_MSE, terms["vb"] *= self.num_timesteps / 1000.0, what is self.num_timesteps, and what is its effect？
Thank you .

toyot-li · 2024-01-14T05:31:21Z

@yinguanchun I am also confused about this scaling factor, have you understood that?

Feynman1999 · 2024-05-23T03:37:24Z

I am also confused about this scaling factor, have you understood that?

yhy258 · 2024-08-01T00:46:10Z

In my opinion, authors define L_{vlb} = L_0 + ... + L_T, not L_t.
Thus, they may calculate the vlb loss with scale factor T (self.num_timestep).

unl1002 · 2024-11-23T14:22:56Z

@yhy258 Thank you for your answer, so, which means we use L_t * T (self. num_timestep) to approximate L_ {vlb}?

unl1002 mentioned this issue Nov 23, 2024

In my opinion, authors define L_{vlb} = L_0 + ... + L_T, not L_t. #147

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A problem about the weight λ of Lvlb #114

A problem about the weight λ of Lvlb #114

yinguanchun commented Sep 30, 2023

toyot-li commented Jan 14, 2024

Feynman1999 commented May 23, 2024

yhy258 commented Aug 1, 2024

unl1002 commented Nov 23, 2024

A problem about the weight λ of Lvlb #114

A problem about the weight λ of Lvlb #114

Comments

yinguanchun commented Sep 30, 2023

toyot-li commented Jan 14, 2024

Feynman1999 commented May 23, 2024

yhy258 commented Aug 1, 2024

unl1002 commented Nov 23, 2024