Feature space is high dimensional and sparse in text categorization, the process of dimension reduction is a very key problem for large-scale text categorization.
文本分类中特征向量空间是高维和稀疏的,降维处理是分类的关键步骤。
Moreover, S-TFIDF algorithm is as efficient as TFIDF algorithm, which implies it is competent for large scale text categorization task.
同时,S - TFIDF算法保持了TFIDF算法的高运行效率,适合大规模的文本分类任务。
应用推荐