Technique that combines reinforcement learning (RL) with human feedback to guide the learning process towards desired outcomes.
Generality: 625