A Monolingual Tree-based Translation Model for Sentence Simplification 论文

2010TUbilio (Technical University of Darmstadt)引用 328
Text Readability and SimplificationNatural Language Processing TechniquesTopic Modeling

摘要

In this paper, we consider sentence sim-plification as a special form of translation with the complex sentence as the source and the simple sentence as the target. We propose a Tree-based Simplification Model (TSM), which, to our knowledge, is the first statistical simplification model covering splitting, dropping, reordering and substitution integrally. We also de-scribe an efficient method to train our model with a large-scale parallel dataset obtained from the Wikipedia and Simple Wikipedia. The evaluation shows that our model achieves better readability scores than a set of baseline systems. 1