Video URL
https://pirsa.org/26020036Local Diffusion Models and Phases of Data Distributions
APA
Gao, X. (2026). Local Diffusion Models and Phases of Data Distributions. Perimeter Institute for Theoretical Physics. https://pirsa.org/26020036
MLA
Gao, Xun. Local Diffusion Models and Phases of Data Distributions. Perimeter Institute for Theoretical Physics, Feb. 04, 2026, https://pirsa.org/26020036
BibTex
@misc{ scivideos_PIRSA:26020036,
doi = {10.48660/26020036},
url = {https://pirsa.org/26020036},
author = {Gao, Xun},
keywords = {Quantum Information},
language = {en},
title = {Local Diffusion Models and Phases of Data Distributions},
publisher = {Perimeter Institute for Theoretical Physics},
year = {2026},
month = {feb},
note = {PIRSA:26020036 see, \url{https://scivideos.org/pirsa/26020036}}
}
Xun Gao University of Colorado, Boulder
Abstract
As a class of generative artificial intelligence frameworks inspired by statistical physics, diffusion models have shown extraordinary performance in synthesizing complicated data distributions through a denoising process gradually guided by score functions. Real-life data, like images, is often spatially structured in low-dimensional spaces. However, ordinary diffusion models ignore this local structure and learn spatially global score functions, which are often computationally expensive. In this work, motivated by recent advances in statistical physics, we develop a generic framework for defining phases of data distributions and use it to analyze the locality requirements of denoisers in diffusion models. We define two distributions as belonging to the same data distribution phase if they can be mutually connected via spatially local operations such as local denoisers. We demonstrate that the reverse denoising process consists of an early trivial phase and a late data phase, sandwiching a rapid phase transition where local denoisers must fail. We further demonstrate that the performance of local denoisers is closely tied to spatial Markovianity, which provides an operational criterion for diagnosing such phase transitions. We validate this criterion through numerical experiments on real-world datasets. Our work suggests guidance for simpler and more efficient architectures of diffusion models: far from the phase transition point, we can use small local neural networks to compute the score function; global neural networks are only necessary around the narrow time interval of phase transitions. This result also opens up new directions for studying phases of data distributions, the broader science of generative artificial intelligence, and guiding the design of neural networks inspired by physics concepts.