Viet-Anh on Software Logo

What is: Random Ensemble Mixture?

SourceAn Optimistic Perspective on Offline Reinforcement Learning
Year2000
Data SourceCC BY-SA - https://paperswithcode.com

Random Ensemble Mixture (REM) is an easy to implement extension of DQN inspired by Dropout. The key intuition behind REM is that if one has access to multiple estimates of Q-values, then a weighted combination of the Q-value estimates is also an estimate for Q-values. Accordingly, in each training step, REM randomly combines multiple Q-value estimates and uses this random combination for robust training.