Dynamical regimes of diffusion models.

Autor: Biroli G; Laboratoire de Physique de l'Ecole Normale Supérieure, ENS, Université PSL, CNRS, Sorbonne Université, Université Paris Cité, Paris, France., Bonnaire T; Laboratoire de Physique de l'Ecole Normale Supérieure, ENS, Université PSL, CNRS, Sorbonne Université, Université Paris Cité, Paris, France. tony.bonnaire@ens.fr., de Bortoli V; Computer Science Department, ENS, CNRS, PSL University, Paris, France., Mézard M; Department of Computing Sciences, Bocconi University, Milano, Italy.
Jazyk: angličtina
Zdroj: Nature communications [Nat Commun] 2024 Nov 17; Vol. 15 (1), pp. 9957. Date of Electronic Publication: 2024 Nov 17.
DOI: 10.1038/s41467-024-54281-3
Abstrakt: We study generative diffusion models in the regime where both the data dimension and the sample size are large, and the score function is trained optimally. Using statistical physics methods, we identify three distinct dynamical regimes during the generative diffusion process. The generative dynamics, starting from pure noise, first encounters a speciation transition, where the broad structure of the data emerges, akin to symmetry breaking in phase transitions. This is followed by a collapse phase, where the dynamics is attracted to a specific training point through a mechanism similar to condensation in a glass phase. The speciation time can be obtained from a spectral analysis of the data's correlation matrix, while the collapse time relates to an excess entropy measure, and reveals the existence of a curse of dimensionality for diffusion models. These theoretical findings are supported by analytical solutions for Gaussian mixtures and confirmed by numerical experiments on real datasets.
Competing Interests: Competing interests The authors declare no competing interests.
(© 2024. The Author(s).)
Databáze: MEDLINE