22742

The Statistical Complexity of Interactive Decision Making

APA

(2022). The Statistical Complexity of Interactive Decision Making. The Simons Institute for the Theory of Computing. https://old.simons.berkeley.edu/talks/statistical-complexity-interactive-decision-making

MLA

The Statistical Complexity of Interactive Decision Making. The Simons Institute for the Theory of Computing, Oct. 11, 2022, https://old.simons.berkeley.edu/talks/statistical-complexity-interactive-decision-making

BibTex

          @misc{ scivideos_22742,
            doi = {},
            url = {https://old.simons.berkeley.edu/talks/statistical-complexity-interactive-decision-making},
            author = {},
            keywords = {},
            language = {en},
            title = {The Statistical Complexity of Interactive Decision Making},
            publisher = {The Simons Institute for the Theory of Computing},
            year = {2022},
            month = {oct},
            note = {22742 see, \url{https://scivideos.org/simons-institute/22742}}
          }
          
Dylan Foster (Microsoft Research)
Talk number22742
Source RepositorySimons Institute

Abstract

A fundamental challenge in interactive learning and decision making, ranging from bandit problems to reinforcement learning, is to provide sample-efficient, adaptive learning algorithms that achieve near-optimal regret. This question is analogous to the classical problem of optimal (supervised) statistical learning, where there are well-known complexity measures (e.g., VC dimension and Rademacher complexity) that govern the statistical complexity of learning. However, characterizing the statistical complexity of interactive learning is substantially more challenging due to the adaptive nature of the problem. In this talk, we will introduce a new complexity measure, the Decision-Estimation Coefficient, which is necessary and sufficient for sample-efficient interactive learning. In particular, we will provide: 1. a lower bound on the optimal regret for any interactive decision making problem, establishing the Decision-Estimation Coefficient as a fundamental limit. 2. a unified algorithm design principle, Estimation-to-Decisions, which attains a regret bound matching our lower bound, thereby achieving optimal sample-efficient learning as characterized by the Decision-Estimation Coefficient. Taken together, these results give a theory of learnability for interactive decision making. When applied to reinforcement learning settings, the Decision-Estimation Coefficient recovers essentially all existing hardness results and lower bounds.