Home Services Work Signals Vocab About

Marcin Andrychowicz

(1 article)

Policy Gradient

Policy Gradient

Class of algorithms in RL that optimizes the parameters of a policy directly through gradient ascent on expected future rewards.

Generality: 675