ICTS:32499

What does guidance do? (Online)

APA

(2025). What does guidance do? (Online). SciVideos. https://scivideos.org/icts-tifr/32499

MLA

What does guidance do? (Online). SciVideos, Aug. 13, 2025, https://scivideos.org/icts-tifr/32499

BibTex

          @misc{ scivideos_ICTS:32499,
            doi = {},
            url = {https://scivideos.org/icts-tifr/32499},
            author = {},
            keywords = {},
            language = {en},
            title = {What does guidance do? (Online)},
            publisher = {},
            year = {2025},
            month = {aug},
            note = {ICTS:32499 see, \url{https://scivideos.org/icts-tifr/32499}}
          }
          
Sitan Chen
Talk numberICTS:32499
Source RepositoryICTS-TIFR

Abstract

When sampling from a base measure tilted by a reward model, a popular trick is to approximate the score of the tilted measure with the sum of the base score and the gradient of the reward. It is well-known that this does not sample from the base distribution but nevertheless seems to do something interesting and useful, e.g., classifier-free guidance (CFG) and diffusion posterior sampling (DPS). In this talk, I provide some theoretical perspectives on what this method actually samples from, focusing on a simple mixture model setting. In the first part, I will rigorously characterize the dynamics of CFG, proving that it generates archetypal and low-diversity samples in a certain precise sense. In the second part, I will show that for linear inverse problems, DPS with a careful choice of initialization simultaneously boosts reward and likelihood under the prior. I will then describe some experiments demonstrating that DPS with this initialization scheme achieves strong performance on hard image restoration tasks like large box inpainting. Based on https://arxiv.org/abs/2409.13074 and https://arxiv.org/abs/2506.10955