首页|Cloud Diffusion Part 1: Theory and Motivation

Cloud Diffusion Part 1: Theory and Motivation

来源：

英文摘要

Diffusion models for image generation function by progressively adding noise to an image set and training a model to separate out the signal from the noise. The noise profile used by these models is white noise -- that is, noise based on independent normal distributions at each point whose mean and variance is independent of the scale. By contrast, most natural image sets exhibit a type of scale invariance in their low-order statistical properties characterized by a power-law scaling. Consequently, natural images are closer (in a quantifiable sense) to a different probability distribution that emphasizes large scale correlations and de-emphasizes small scale correlations. These scale invariant noise profiles can be incorporated into diffusion models in place of white noise to form what we will call a ``Cloud Diffusion Model". We argue that these models can lead to faster inference, improved high-frequency details, and greater controllability. In a follow-up paper, we will build and train a Cloud Diffusion Model that uses scale invariance at a fundamental level and compare it to classic, white noise diffusion models.

作者：Andrew Randono

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Andrew Randono.Cloud Diffusion Part 1: Theory and Motivation[EB/OL].(2025-07-07)[2025-07-25].https://arxiv.org/abs/2507.05496.点此复制

Cloud Diffusion Part 1: Theory and Motivation

Cloud Diffusion Part 1: Theory and Motivation

评论