Richard Bellman
(14 articles)Objective Function
Objective function used in ML which quantitatively defines the goal of an optimization problem by measuring the performance of a model or solution.
Generality: 858
Optimization Problem
Optimization problem in AI which involves finding the best solution from all feasible solutions, given a set of constraints and an objective to achieve or optimize.
Generality: 895
DP
Dynamic Programming
Dynamic Programming
Method used in computer science and mathematics to solve complex problems by breaking them down into simpler subproblems and solving each of these subproblems just once, storing their solutions.
Generality: 830
Bellman Equation
Recursive formula used to find the optimal policy in decision-making processes, particularly in the context of dynamic programming and RL.
Generality: 815
SSM
State-Space Model
State-Space Model
Mathematical frameworks used to model dynamic systems by describing their states in space and how these states evolve over time under the influence of inputs, disturbances, and noise.
Generality: 625
State Space Model
Mathematical frameworks used to represent systems that are governed by a set of latent (hidden) variables evolving over time, observed through another set of variables.
Generality: 834
Function Approximation
Method used in AI to estimate complex functions using simpler, computationally efficient models.
Generality: 810
Curse of Dimensionality
Phenomenon where the complexity and computational cost of analyzing data increase exponentially with the number of dimensions or features.
Generality: 827
Minimax
A decision-making strategy in game theory and AI that aims to minimize the possible losses in worst-case scenarios by maximizing the worst-case outcome.
Generality: 775
Minimax Loss
A strategy used in optimization and decision-making problems to minimize the maximum possible loss.
Generality: 500
Matrix Models
Mathematical frameworks that use matrices with parameters to represent and solve complex problems, often in ML, statistics, and systems theory.
Generality: 728
Discount Factor
Multiplicative factor used to reduce future values or rewards to their present value in decision-making processes, particularly in reinforcement learning.
Generality: 790
Cross Entropy Loss
Loss function used to measure the difference between two probability distributions for a given random variable or set of events.
Generality: 851
Robustness
Ability of an algorithm or model to deliver consistent and accurate results under varying operating conditions and input perturbations.
Generality: 885