Cloud Diffusion Part 1: Theory and Motivation
Cloud Diffusion Part 1: Theory and Motivation
Diffusion models for image generation function by progressively adding noise to an image set and training a model to separate out the signal from the noise. The noise profile used by these models is white noise -- that is, noise based on independent normal distributions at each point whose mean and variance is independent of the scale. By contrast, most natural image sets exhibit a type of scale invariance in their low-order statistical properties characterized by a power-law scaling. Consequently, natural images are closer (in a quantifiable sense) to a different probability distribution that emphasizes large scale correlations and de-emphasizes small scale correlations. These scale invariant noise profiles can be incorporated into diffusion models in place of white noise to form what we will call a ``Cloud Diffusion Model". We argue that these models can lead to faster inference, improved high-frequency details, and greater controllability. In a follow-up paper, we will build and train a Cloud Diffusion Model that uses scale invariance at a fundamental level and compare it to classic, white noise diffusion models.
Andrew Randono
计算技术、计算机技术
Andrew Randono.Cloud Diffusion Part 1: Theory and Motivation[EB/OL].(2025-07-07)[2025-07-23].https://arxiv.org/abs/2507.05496.点此复制
评论