Video URL

https://simons.berkeley.edu/talks/tba-153

Is Overfitting Actually Benign? On the Consistency of Interpolating Methods

(2021). Is Overfitting Actually Benign? On the Consistency of Interpolating Methods. The Simons Institute for the Theory of Computing. https://simons.berkeley.edu/talks/tba-153

Is Overfitting Actually Benign? On the Consistency of Interpolating Methods. The Simons Institute for the Theory of Computing, Dec. 07, 2021, https://simons.berkeley.edu/talks/tba-153

          @misc{ scivideos_18850,
            doi = {},
            url = {https://simons.berkeley.edu/talks/tba-153},
            author = {},
            keywords = {},
            language = {en},
            title = {Is Overfitting Actually Benign? On the Consistency of Interpolating Methods},
            publisher = {The Simons Institute for the Theory of Computing},
            year = {2021},
            month = {dec},
            note = {18850 see, \url{https://scivideos.org/Simons-Institute/18850}}
          }

Preetum Nakkiran (UCSD)

December 07, 2021

Talk number18850

Source RepositorySimons Institute

Subject

Computer Science

Abstract

This talk will be very informal: I will discuss ongoing research with open questions and partial results. We study the asymptotic consistency of modern interpolating methods, including deep networks, in both classification and regression settings. That is, we consider learning methods which scale the data size while simultaneously scaling the model size, in a way which always interpolates the train set. We present empirical evidence that, perhaps contrary to intuitions in theory, many natural interpolating learning methods are *inconsistent* for a wide variety of distributions. That is, they do not approach Bayes-optimality even in the limit of infinite data. The message is, in settings with nonzero Bayes risk, overfitting is not benign: interpolating the noise significantly harms the classifier, to the point of preventing consistency. This work is motivated by: (1) understanding differences between the overparameterized and underparameterized regime, (2) guiding theory towards more "realistic" assumptions to capture deep learning practice, and (3) understanding common structure shared by "natural" interpolating methods. Based on joint work with: Neil Mallinar, Amirhesam Abedsoltan, Gil Kur, and Misha Belkin. And is a follow-up work to the paper "Distributional Generalization", joint with Yamini Bansal.

Supported by

Video URL

Is Overfitting Actually Benign? On the Consistency of Interpolating Methods

Abstract

Intro to Meta-Complexity: Part 2

Intro to Meta-Complexity: Part 1

Aggregative Efficiency of Bayesian Learning in Networks

(Relaxing) Common Belief for Social Networks

Organizing Modular Production

Video URL

Is Overfitting Actually Benign? On the Consistency of Interpolating Methods

APA

MLA

BibTex

Abstract

Intro to Meta-Complexity: Part 2

Intro to Meta-Complexity: Part 1

Aggregative Efficiency of Bayesian Learning in Networks

(Relaxing) Common Belief for Social Networks

Organizing Modular Production