Skip to content

Latest commit

 

History

History
34 lines (23 loc) · 1.65 KB

File metadata and controls

34 lines (23 loc) · 1.65 KB

Diffusion models

Succession of layers that try to predict the input in one step less noisy.

What we are trying to do is find the distribution that maximizes the likelihood of the data.

What we do in practice is to minimize the negative ELBO.

The important part about diffusion models is that this objective becomes tenable because of (1) the Markov assumption and (2) the fact that transitions are conditional Gaussians

if you forward diffuse and image and then reverse diffuse the result, you will very probably get a different image than the one you started at.

GANs are slower than diffusion models:

  • False during inference
  • True during training. Because the min max game is complex with the discriminator, it's hard to reach the equilibrium. For diffusion models, the output is straigthforward to learn and objective is obvious and clear.

What conditions do diffusion model architectures need to fulfill?

  • Input dimension == output dimension

More