Trevor Hastie

(24 articles)

1805

Regression

Statistical method used in ML to predict a continuous outcome variable based on one or more predictor variables.

Generality: 860

1901

PCA
Principal Component Analysis

A statistical procedure that transforms a dataset into a set of orthogonal components, intended to reduce dimensionality while preserving as much variability as possible.

Generality: 500

1931

Cross Validation

Statistical method used to estimate the skill of ML models on unseen data by partitioning the original dataset into a training set to train the model and a test set to evaluate it.

Generality: 852

1956

Statistical Classification

The problem of identifying which category or class an object belongs to based on its features or characteristics.

Generality: 500

1958

Unsupervised Learning

Type of ML where algorithms learn patterns from untagged data, without any guidance on what outcomes to predict.

Generality: 905

1959

Supervised Classifier

Algorithm that, given a set of labeled training data, learns to predict the labels of new, unseen data.

Generality: 870

1970

Regularization

Technique used in machine learning to reduce model overfitting by adding a penalty to the loss function based on the complexity of the model.

Generality: 845

1970

Bias-Variance Trade-off

In ML, achieving optimal model performance involves balancing bias and variance to minimize overall error.

Generality: 818

1970

Curse of Dimensionality

Phenomenon where the complexity and computational cost of analyzing data increase exponentially with the number of dimensions or features.

Generality: 827

1974

Probabilistic Programming

Programming paradigm designed to handle uncertainty and probabilistic models, allowing for the creation of programs that can make inferences about data by incorporating statistical methods directly into the code.

Generality: 820

1974

Empirical Risk Minimization

A foundational principle in statistics and ML (Machine Learning), focused on minimizing the average of the loss function over a sample dataset.

Generality: 814

1976

Overfitting

When a ML model learns the detail and noise in the training data to the extent that it negatively impacts the performance of the model on new data.

Generality: 890

1986

Feature Importance

Techniques used to identify and rank the significance of input variables (features) in contributing to the predictive power of a ML model.

Generality: 800

1986

Feature Extraction

Process of transforming raw data into a set of features that are more meaningful and informative for a specific task, such as classification or prediction.

Generality: 880

1989

Boosting

ML ensemble technique that combines multiple weak learners to form a strong learner, aiming to improve the accuracy of predictions.

Generality: 800

1990

Similarity Computation

A mathematical process to quantify the likeness between data objects, often used in AI to enhance pattern recognition and data clustering.

Generality: 675

1992

Ensamble Algorithm

Combines multiple machine learning models to improve overall performance by reducing bias, variance, or noise.

Generality: 860

1992

Bias-Variance Dilemma

Fundamental problem in supervised ML that involves a trade-off between a model’s ability to minimize error due to bias and error due to variance.

Generality: 893

1996

Ensemble Methods

ML technique where multiple models are trained and used collectively to solve a problem.

Generality: 860

1996

Ensemble Learning

ML paradigm where multiple models (often called weak learners) are trained to solve the same problem and combined to improve the accuracy of predictions.

Generality: 795