Computation and Language
Re-Evaluating ADEM: A Deeper Look at Scoring Dialogue Responses
Automatically evaluating the quality of dialogue responses for unstructured domains is a challenging problem. ADEM (Lowe et al. 2017) formulated the automatic evaluation of dialogue systems as a learning problem and …
