Peter Dayan
(2 articles)1986
Prediction Error
The discrepancy between predicted outcomes by an AI model and the actual observed results in a dataset.
Generality: 675
1989
Q-Learning
Model-free reinforcement learning algorithm that seeks to learn the value of actions in a given state, enabling an agent to maximize cumulative reward over time.
Generality: 870