Study on Chinese text clustering models in compliance with the characteristics of Chinese texts.
针对中文文本组成上的特点,研究了中文文本聚类的模型。
But the research of Chinese text clustering is at its early stage, and there are still many problems to be resolved.
但是国内中文文本聚类的研究还处于初期阶段,还存在许多问题亟待解决。
Using HowNet's complete knowledge system to construct Concept Dictionary and Concept Hierarchy, we realized a kind of Chinese text clustering algorithm based on concept.
利用知网较完备的知识体系来构造概念词典和概念层次结构,实现了一种以知网为背景知识的基于概念的中文文本聚类算法。
Since Chinese network short text is less of keywords and full of anomalous writings, the traditional text clustering method is not directly suitable for network short text clustering.
然而,中文网络短文本固有的关键词词频低、存在大量变形词等特点,使得难以直接使用现有面向长文本的聚类算法。
Since Chinese network short text is less of keywords and full of anomalous writings, the traditional text clustering method is not directly suitable for network short text clustering.
然而,中文网络短文本固有的关键词词频低、存在大量变形词等特点,使得难以直接使用现有面向长文本的聚类算法。
应用推荐