An unsupervised method for word sense tagging using parallel corpora 论文

2001引用 231

Natural Language Processing TechniquesTopic ModelingSpeech and dialogue systems

Natural Language Processing Techniques Topic Modeling Speech and dialogue systems

作者

摘要

We present an unsupervised method for word sense disambiguation that exploits translation correspondences in parallel corpora. The technique takes advantage of the fact that cross-language lexicalizations of the same concept tend to be consistent, preserving some core element of its semantics, and yet also variable, reflecting differing translator preferences and the influence of context. Working with parallel corpora introduces an extra complication for evaluation, since it is difficult to find a corpus that is both sense tagged and parallel with another language; therefore we use pseudo-translations, created by machine translation systems, in order to make possible the evaluation of the approach against a standard test set. The results demonstrate that word-level translation correspondences are a valuable source of information for sense disambiguation.

作者查看全部 (2)

Philip Resnik

Mona Diab

An unsupervised method for word sense tagging using parallel corpora 论文

摘要

作者查看全部 (2)

相关技术查看全部 (2)

相关事件

相关文章