What is: Deterministic Policy Gradient?
Year | 2014 |
Data Source | CC BY-SA - https://paperswithcode.com |
Deterministic Policy Gradient, or DPG, is a policy gradient method for reinforcement learning. Instead of the policy function being modeled as a probability distribution, DPG considers and calculates gradients for a deterministic policy .