Better Evaluation for Grammatical Error Correction 论文

2012引用 302
Natural Language Processing TechniquesTopic ModelingAlgorithms and Data Compression

摘要

We present a novel method for evaluating grammatical error correction. The core of our method, which we call MaxMatch (M2), is an algorithm for efficiently computing the sequence of phrase-level edits between a source sentence and a system hypothesis that achieves the highest overlap with the goldstandard annotation. This optimal edit sequence is subsequently scored using F1 measure. We test our M 2 scorer on the Helping Our Own (HOO) shared task data and show that our method results in more accurate evaluation for grammatical error correction. 1