Type of ML where an agent learns to make decisions by performing actions in an environment to achieve a goal, guided by rewards.
Generality: 890