Chapter 5

Diffusion Models

This chapter covers the forward and reverse diffusion processes, then builds up the Latent Stable Diffusion architecture (Rombach et al., 2022) — showing how autoencoders, U-Nets, transformer text encoders, and cross-attention integrate into the system that powers DALL-E, Stable Diffusion, and Midjourney.