NLTK 论文
2004引用 380
Natural Language Processing TechniquesTopic ModelingSyntax, Semantics, Linguistic Variation
摘要
The Natural Language Toolkit is a suite of program modules, data sets, tutorials and exercises, covering symbolic and statistical natural language processing. NLTK is written in Python and distributed under the GPL open source license. Over the past three years, NLTK has become popular in teaching and research. We describe the toolkit and report on its current state of development.