首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于语料库的多词单位抽取算法
引用本文:恽佳丽,何军,黄厚宽.基于语料库的多词单位抽取算法[J].北方交通大学学报,2009(5):121-125.
作者姓名:恽佳丽  何军  黄厚宽
作者单位:北京交通大学计算机与信息技术学院,北京100044
摘    要:分析了研究者们在多词单位抽取算法中的一些工作,包括多词单位的评分和选择.将评分算法根据它们的设计依据划分为3类,对它们进行总结分析,并用实验进行了验证.本文还分析了多种评分算法的组合方法,使用这些组合方法可以互补各种评分算法,达到更好的抽取效果.

关 键 词:多词单位  语料库  统计学  抽取算法  数据挖掘

Research on Extracting Multi-Words Units From Corpus
YUN Jiali,HE Jun,HUANG Houkuan.Research on Extracting Multi-Words Units From Corpus[J].Journal of Northern Jiaotong University,2009(5):121-125.
Authors:YUN Jiali  HE Jun  HUANG Houkuan
Institution:(School of Computer and Information Technology, Beijing Jiaotong University, Beijing 100044, China)
Abstract:Some works of the algorithms on extracting selection of multi-words units, the review algorithms Multi-words Units, it includes the review and is divided into three classifications, they were summarized, analysized and verified in this paper. The combined methods of multiple review algorithms were also proposed in this paper, the usage of these combined methods can complement other algorithms and achieve better extraction.
Keywords:Multi-words Unit  corpus  statistics  extraction  data mining
本文献已被 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号