What does guidance do? (Online)
APA
(2025). What does guidance do? (Online). SciVideos. https://scivideos.org/icts-tifr/32499
MLA
What does guidance do? (Online). SciVideos, Aug. 13, 2025, https://scivideos.org/icts-tifr/32499
BibTex
@misc{ scivideos_ICTS:32499, doi = {}, url = {https://scivideos.org/icts-tifr/32499}, author = {}, keywords = {}, language = {en}, title = {What does guidance do? (Online)}, publisher = {}, year = {2025}, month = {aug}, note = {ICTS:32499 see, \url{https://scivideos.org/icts-tifr/32499}} }
Abstract
When sampling from a base measure tilted by a reward model, a popular trick is to approximate the score of the tilted measure with the sum of the base score and the gradient of the reward. It is well-known that this does not sample from the base distribution but nevertheless seems to do something interesting and useful, e.g., classifier-free guidance (CFG) and diffusion posterior sampling (DPS). In this talk, I provide some theoretical perspectives on what this method actually samples from, focusing on a simple mixture model setting. In the first part, I will rigorously characterize the dynamics of CFG, proving that it generates archetypal and low-diversity samples in a certain precise sense. In the second part, I will show that for linear inverse problems, DPS with a careful choice of initialization simultaneously boosts reward and likelihood under the prior. I will then describe some experiments demonstrating that DPS with this initialization scheme achieves strong performance on hard image restoration tasks like large box inpainting. Based on https://arxiv.org/abs/2409.13074 and https://arxiv.org/abs/2506.10955