What is: True Online TD Lambda?
Year | 2000 |
Data Source | CC BY-SA - https://paperswithcode.com |
True Online seeks to approximate the ideal online -return algorithm. It seeks to invert this ideal forward-view algorithm to produce an efficient backward-view algorithm using eligibility traces. It uses dutch traces rather than accumulating traces.
Source: Sutton and Seijen