文本分類
分類 和 聚類 不同
Clustering determines how to put documents into groups.
Classification assigns documents to groups that have been defined.
文本分類基本概念
類別及其數量要先訂好。
可以用不同的機器學習演算法實驗。Popular supervised text categorization algorithms: Naive Bayes, k Nearest Neighbor (kNN) and Logistic Regression, Support Vector Machines and Decision Trees
R 套件 RTextTools
練習
Last updated