KNN with TF-IDF based Framework for Text Categorization 论文
2014Procedia Engineering引用 356
Text and Document Classification TechnologiesSpam and Phishing DetectionAdvanced Text Analysis Techniques
摘要
KNN is a very popular algorithm for text classification. This paper presents the possibility of using KNN algorithm with TF-IDF method and framework for text classification. Framework enables classification according to various parameters, measurement and analysis of results. Evaluation of framework was focused on the speed and quality of classification. The results of testing showed the good and bad features of algorithm, providing guidance for the further development of similar frameworks.