Video URL

https://old.simons.berkeley.edu/talks/near-optimal-no-regret-learning-general-convex-games

Near-Optimal No-Regret Learning for General Convex Games

(2022). Near-Optimal No-Regret Learning for General Convex Games. The Simons Institute for the Theory of Computing. https://old.simons.berkeley.edu/talks/near-optimal-no-regret-learning-general-convex-games

Near-Optimal No-Regret Learning for General Convex Games. The Simons Institute for the Theory of Computing, Oct. 14, 2022, https://old.simons.berkeley.edu/talks/near-optimal-no-regret-learning-general-convex-games

          @misc{ scivideos_22752,
            doi = {},
            url = {https://old.simons.berkeley.edu/talks/near-optimal-no-regret-learning-general-convex-games},
            author = {},
            keywords = {},
            language = {en},
            title = {Near-Optimal No-Regret Learning for General Convex Games},
            publisher = {The Simons Institute for the Theory of Computing},
            year = {2022},
            month = {oct},
            note = {22752 see, \url{https://scivideos.org/simons-institute/22752}}
          }

Gabriele Farina (Carnegie Mellon University)

October 14, 2022

Talk number22752

Source RepositorySimons Institute

Subject

Computer Science

Abstract

A recent line of work has established uncoupled learning dynamics such that, when employed by all players in a game, each player's regret after T repetitions grows polylogarithmically in T, an exponential improvement over the traditional guarantees within the no-regret framework. However, so far these results have only been limited to certain classes of games with structured strategy spaces---such as normal-form and extensive-form games. The question as to whether O(polylog T) regret bounds can be obtained for general convex and compact strategy sets---which occur in many fundamental models in economics and multiagent systems---while retaining efficient strategy updates is an important question. In this talk, we answer this in the positive by establishing the first uncoupled learning algorithm with O(log T) per-player regret in general convex games, that is, games with concave utility functions supported on arbitrary convex and compact strategy sets. Our learning dynamics are based on an instantiation of optimistic follow-the-regularized-leader over an appropriately lifted space using a self-concordant regularizer that is, peculiarly, not a barrier for the feasible region. Further, our learning dynamics are efficiently implementable given access to a proximal oracle for the convex strategy set, leading to O(loglog T) per-iteration complexity; we also give extensions when access to only a linear optimization oracle is assumed. Finally, we adapt our dynamics to guarantee O(sqrt(T)) regret in the adversarial regime. Even in those special cases where prior results apply, our algorithm improves over the state-of-the-art regret bounds either in terms of the dependence on the number of iterations or on the dimension of the strategy sets. Based on joint work with Ioannis Anagnostides, Haipeng Luo, Chung-Wei Lee, Christian Kroer, and Tuomas Sandholm. Paper link: https://arxiv.org/abs/2206.08742

Supported by

Video URL

Near-Optimal No-Regret Learning for General Convex Games

Abstract

Intro to Meta-Complexity: Part 2

Intro to Meta-Complexity: Part 1

Aggregative Efficiency of Bayesian Learning in Networks

(Relaxing) Common Belief for Social Networks

Organizing Modular Production

Video URL

Near-Optimal No-Regret Learning for General Convex Games

APA

MLA

BibTex

Abstract

Intro to Meta-Complexity: Part 2

Intro to Meta-Complexity: Part 1

Aggregative Efficiency of Bayesian Learning in Networks

(Relaxing) Common Belief for Social Networks

Organizing Modular Production