Manual and automatic evaluation of summaries 论文

2002引用 265

Natural Language Processing TechniquesSemantic Web and OntologiesSpeech and dialogue systems

Natural Language Processing Techniques Semantic Web and Ontologies Speech and dialogue systems

作者

摘要

In this paper we discuss manual and automatic evaluations of summaries using data from the Document Understanding Conference 2001 (DUC-2001). We first show the instability of the manual evaluation. Specifically, the low inter-human agreement indicates that more reference summaries are needed. To investigate the feasibility of automated summary evaluation based on the recent BLEU method from machine translation, we use accumulative n-gram overlap scores between system and human summaries. The initial results provide encouraging correlations with human judgments, based on the Spearman rank-order correlation coefficient. However, relative ranking of systems needs to take into account the instability.

作者查看全部 (2)

Eduard Hovy

Chin-Yew Lin

Manual and automatic evaluation of summaries 论文

摘要

作者查看全部 (2)

相关技术查看全部 (1)

相关事件

相关文章