What is: ENIGMA?
Source | Towards Automatic Evaluation of Dialog Systems: A Model-Free Off-Policy Evaluation Approach |
Year | 2000 |
Data Source | CC BY-SA - https://paperswithcode.com |
ENIGMA is an evaluation framework for dialog systems based on Pearson and Spearman's rank correlations between the estimated rewards and the true rewards. ENIGMA only requires a handful of pre-collected experience data, and therefore does not involve human interaction with the target policy during the evaluation, making automatic evaluations feasible. More importantly, ENIGMA is model-free and agnostic to the behavior policies for collecting the experience data (see details in Section 2), which significantly alleviates the technical difficulties of modeling complex dialogue environments and human behaviors.