Diffusion models are the foundation of all image generation models.
语速
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
noise image -> -------------
| |----> cleared image
step (int) -> | denoiser |
| |
--------------|
```The charm of deep learning lies in the fact that once a new task achieves improved performance with a certain architecture, many other tasks can refer to this architecture and benefit from it.
I believe the diffusion model is a typical example. Although I do not conduct research on diffusion models and currently have no related projects, there is no harm in understanding this network architecture.
The diffusion model is one that benefits from the image processing process.
By learning the reverse process of adding noise to images, the diffusion model acquires the ability to generate images from noise.

To enable the model to achieve better performance, the denoising step of the model is included as one of the inputs.