Overparameterization Regime

The Overparameterization Regime refers to a situation in training machine learning (ML) models where the number of parameters in the model surpasses the number of training samples. This might seem counterintuitive, as many assume that it will inevitably lead to overfitting – memories of training data instead of learning to generalize – but recent theoretical and empirical studies suggest otherwise. In the Overparameterization Regime, models with excessive capacity can still generalize well even though they memorize noise and could potentially overfit. This concept has widespread implications in deep learning, an area that often deals with overparameterized models.

Overparameterization in machine learning models has been observed since the early days of artificial neural networks. However, understanding its implications and hence the term 'Overparameterization Regime,' saw a surge in interest with the advent and popularity of deep learning post-2010. While its exact origin is hard to pinpoint, its usage in literature gained momentum during the mid-2010s.

The term and the underlying principles are not attributed to a single individual or group. Instead, it is the collective output of the AI research community working on understanding the counter-intuitive generalization abilities of overparameterized models. This includes teams from universities, research institutions and tech companies contributing to developing and refining the concept.

Overparameterization Regime

Explainer

Overparameterization in ML

Key Contributors

Newsletter

Academic Papers

A convergence theory for deep learning via over-parameterization

A farewell to the bias-variance tradeoff? an overview of the theory of overparameterized machine learning

Landscape and training regimes in deep learning

Learning through atypical phase transitions in overparameterized neural networks

Rethinking influence functions of neural networks in the over-parameterized regime