首页 | 本学科首页   官方微博 | 高级检索  
     检索      

Apriori and N-gram Based Chinese Text Feature Extraction Method
作者姓名:王晔  黄上腾
作者单位:Dept.ofComputerScienceandEng.,ShanghaiJiaotongUniv.,Shanghai200030,China
摘    要:A feature extraction, which means extracting the representative words from a text, is an important issue in text mining field. This paper presented a new Apriori and N-gram based Chinese text feature extraction method, and analyzed its correctness and performance. Our method solves the question that the exist extraction methods cannot find the frequent words with arbitrary length in Chinese texts. The experimental results show this method is feasible.

关 键 词:演绎算法  汉语分割  特征提取  中文文本

Apriori and N-gram Based Chinese Text Feature Extraction Method
WANG Ye,HUANG Shang-teng.Apriori and N-gram Based Chinese Text Feature Extraction Method[J].Journal of Shanghai Jiaotong university,2004,9(4):11-14,20.
Authors:WANG Ye  HUANG Shang-teng
Institution:Dept. of Computer Science and Eng., Shanghai Jiaotong Univ., Shanghai 200030, China
Abstract:A feature extraction, which means extracting the representative words from a text, is an important issue in text mining field. This paper presented a new Apriori and N-gram based Chinese text feature extraction method, and analyzed its correctness and performance. Our method solves the question that the exist extraction methods cannot find the frequent words with arbitrary length in Chinese texts. The experimental results show this method is feasible.
Keywords:Apriori algorithm  N-gram  Chinese words segmentation  feature extraction
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号