What is: Eligibility Trace?
Year | 2000 |
Data Source | CC BY-SA - https://paperswithcode.com |
An Eligibility Trace is a memory vector that parallels the long-term weight vector . The idea is that when a component of participates in producing an estimated value, the corresponding component of is bumped up and then begins to fade away. Learning will then occur in that component of if a nonzero TD error occurs before the trade falls back to zero. The trace-decay parameter determines the rate at which the trace falls.
Intuitively, they tackle the credit assignment problem by capturing both a frequency heuristic - states that are visited more often deserve more credit - and a recency heuristic - states that are visited more recently deserve more credit.
Source: Sutton and Barto, Reinforcement Learning, 2nd Edition