Robert Tibshirani

(16 articles)
Sampling
1936

Sampling

Fundamental technique used to reduce computational cost and simplify data management

Generality: 870

Statistical Classification
1956

Statistical Classification

The problem of identifying which category or class an object belongs to based on its features or characteristics.

Generality: 500

Supervised Classifier
1959

Supervised Classifier

Algorithm that, given a set of labeled training data, learns to predict the labels of new, unseen data.

Generality: 870

Regularization
1970

Regularization

Technique used in machine learning to reduce model overfitting by adding a penalty to the loss function based on the complexity of the model.

Generality: 845

Curse of Dimensionality
1970

Curse of Dimensionality

Phenomenon where the complexity and computational cost of analyzing data increase exponentially with the number of dimensions or features.

Generality: 827

Sparsity
1986

Sparsity

Technique and principle of having models that utilize minimal data representation and processing, typically through zero or near-zero values.

Generality: 855

Feature Importance
1986

Feature Importance

Techniques used to identify and rank the significance of input variables (features) in contributing to the predictive power of a ML model.

Generality: 800

Feature Extraction
1986

Feature Extraction

Process of transforming raw data into a set of features that are more meaningful and informative for a specific task, such as classification or prediction.

Generality: 880

Boosting
1989

Boosting

ML ensemble technique that combines multiple weak learners to form a strong learner, aiming to improve the accuracy of predictions.

Generality: 800

Similarity Computation
1990

Similarity Computation

A mathematical process to quantify the likeness between data objects, often used in AI to enhance pattern recognition and data clustering.

Generality: 675

Ensamble Algorithm
1992

Ensamble Algorithm

Combines multiple machine learning models to improve overall performance by reducing bias, variance, or noise.

Generality: 860

Bias-Variance Dilemma
1992

Bias-Variance Dilemma

Fundamental problem in supervised ML that involves a trade-off between a model’s ability to minimize error due to bias and error due to variance.

Generality: 893

Ensemble Methods
1996

Ensemble Methods

ML technique where multiple models are trained and used collectively to solve a problem.

Generality: 860

Ensemble Learning
1996

Ensemble Learning

ML paradigm where multiple models (often called weak learners) are trained to solve the same problem and combined to improve the accuracy of predictions.

Generality: 795

Sparsability
2015

Sparsability

Ability of algorithms to effectively handle and process data matrices where most elements are zero (sparse), improving computational efficiency and memory usage.

Generality: 675

Model-Based Classifier
2015

Model-Based Classifier

ML algorithm that uses a pre-defined statistical model to make predictions based on input data.

Generality: 835