Michael Jordan
(6 articles)Unsupervised Learning
Type of ML where algorithms learn patterns from untagged data, without any guidance on what outcomes to predict.
Generality: 905
ML
Machine Learning
Machine Learning
Development of algorithms and statistical models that enable computers to perform tasks without being explicitly programmed for each one.
Generality: 965
Statistical AI
Utilizes statistical methods to analyze data and make probabilistic inferences, aimed at emulating aspects of human intelligence through quantitative models.
Generality: 890
Universal Learning Algorithms
Theoretical frameworks aimed at creating systems capable of learning any task to human-level competency, leveraging principles that could allow for generalization across diverse domains.
Generality: 840
TRPO
Trust Region Policy Optimization
Trust Region Policy Optimization
Advanced algorithm used in RL to ensure stable and reliable policy updates by optimizing within a trust region, thus preventing drastic policy changes.
Generality: 635
PPO
Proximal Policy Optimization
Proximal Policy Optimization
RL algorithm that aims to balance ease of implementation, sample efficiency, and reliable performance by using a simpler but effective update method for policy optimization.
Generality: 670