Video URL

https://old.simons.berkeley.edu/talks/power-adaptivity-representation-learning-meta-learning-federa…

The Power of Adaptivity in Representation Learning: From Meta-Learning to Federated Learning

(2022). The Power of Adaptivity in Representation Learning: From Meta-Learning to Federated Learning. The Simons Institute for the Theory of Computing. https://old.simons.berkeley.edu/talks/power-adaptivity-representation-learning-meta-learning-federated-learning

The Power of Adaptivity in Representation Learning: From Meta-Learning to Federated Learning. The Simons Institute for the Theory of Computing, Oct. 13, 2022, https://old.simons.berkeley.edu/talks/power-adaptivity-representation-learning-meta-learning-federated-learning

          @misc{ scivideos_22751,
            doi = {},
            url = {https://old.simons.berkeley.edu/talks/power-adaptivity-representation-learning-meta-learning-federated-learning},
            author = {},
            keywords = {},
            language = {en},
            title = {The Power of Adaptivity in Representation Learning: From Meta-Learning to Federated Learning},
            publisher = {The Simons Institute for the Theory of Computing},
            year = {2022},
            month = {oct},
            note = {22751 see, \url{https://scivideos.org/simons-institute/22751}}
          }

Sanjay Shakkottai (University of Texas at Austin)

October 13, 2022

Talk number22751

Source RepositorySimons Institute

Subject

Computer Science

Abstract

A central problem in machine learning is as follows: How should we train models using data generated from a collection of clients/environments, if we know that these models will be deployed in a new and unseen environment? In the setting of few-shot learning, two prominent approaches are: (a) develop a modeling framework that is “primed” to adapt, such as Model Adaptive Meta Learning (MAML), or (b) develop a common model using federated learning (such as FedAvg), and then fine tune the model for the deployment environment. We study both these approaches in the multi-task linear representation setting. We show that the reason behind generalizability of the models in new environments trained through either of these approaches is that the dynamics of training induces the models to evolve toward the common data representation among the clients’ tasks. In both cases, the structure of the bi-level update at each iteration (an inner and outer update with MAML, and a local and global update with FedAvg) holds the key — the diversity among client data distributions are exploited via inner/local updates, and induces the outer/global updates to bring the representation closer to the ground-truth. In both these settings, these are the first results that formally show representation learning, and derive exponentially fast convergence to the ground-truth representation. Based on joint work with Liam Collins, Hamed Hassani, Aryan Mokhtari, and Sewoong Oh. Papers: https://arxiv.org/abs/2202.03483 , https://arxiv.org/abs/2205.13692

Supported by

Video URL

The Power of Adaptivity in Representation Learning: From Meta-Learning to Federated Learning

Abstract

Intro to Meta-Complexity: Part 2

Intro to Meta-Complexity: Part 1

Aggregative Efficiency of Bayesian Learning in Networks

(Relaxing) Common Belief for Social Networks

Organizing Modular Production

Video URL

The Power of Adaptivity in Representation Learning: From Meta-Learning to Federated Learning

APA

MLA

BibTex

Abstract

Intro to Meta-Complexity: Part 2

Intro to Meta-Complexity: Part 1

Aggregative Efficiency of Bayesian Learning in Networks

(Relaxing) Common Belief for Social Networks

Organizing Modular Production