The core of Chinese Search Engine is the key content extracting, and the bottleneck is Chinese Word Automatic Segmentation.
中文搜索引擎的重点在于中文关键信息提取,其中的难点就是中文自动分词。
Especially, it is very difficult to deal with special noun in Chinese automatic word segmentation.
特别是对专有名词的处理是中文自动分词中的又一个难点。
To extend word segmentation repository and enhance word segmentation capacity, a Chinese word segmentation system based on automatic learning is proposed in this paper.
为扩展分词知识库,提高自动分词能力,本文提出了一种基于自学习机制的汉语自动分词系统。
应用推荐