Viet-Anh on Software Logo

What is: True Online TD Lambda?

Year2000
Data SourceCC BY-SA - https://paperswithcode.com

True Online TD(λ)TD\left(\lambda\right) seeks to approximate the ideal online λ\lambda-return algorithm. It seeks to invert this ideal forward-view algorithm to produce an efficient backward-view algorithm using eligibility traces. It uses dutch traces rather than accumulating traces.

Source: Sutton and Seijen