On Evaluating and Comparing Conversational Agents
The subjectivity associated with evaluating conversations is a key element underlying the challenge of building non-goal oriented dialogue systems. This paper proposes a comprehensive evaluation strategy with multiple metrics designed to reduce subjectivity by selecting metrics which correlate well with […]