首页 | 官方网站   微博 | 高级检索  
     

基于无监督学习的部分-整体关系获取
引用本文:贾真,何大可,尹红风,李天瑞.基于无监督学习的部分-整体关系获取[J].西南交通大学学报,2014,27(4):590-596.
作者姓名:贾真  何大可  尹红风  李天瑞
基金项目:国家自然科学基金资助项目(61170111,61202043,61262058)中央高校基本科研业务费专项基金资助项目(SWJTU11ZT08)
摘    要:针对面向中文自由文本的部分-整体关系抽取问题,提出一种基于无监督学习的方法. 首先提出子模式提取算法,从领域文本集中获取概念对和概念对所在上下文模式,利用概念对和概念对上下文模式建立分布式语义模型;然后采用协同聚类算法将具有相同语义关系的概念对聚合成簇,通过训练L1正则化逻辑回归模型提取簇的特征并得到代表每个簇语义关系的概念对上下文模式;最后根据模式识别表达部分-整体关系的簇,从而获取部分-整体关系概念对. 实验结果表明,该方法取得较好的性能,F度量达到68.97%,优于传统聚类方法(55.77%)和模式匹配方法(61.95%). 

关 键 词:本体    无监督学习    部分-整体关系    分布式语义模型    协同聚类
收稿时间:2013-08-20

Acquisition of Part-Whole Relations Based on Unsupervised Learning
JIA Zhen,HE Dake,YIN Hongfeng,LI Tianrui.Acquisition of Part-Whole Relations Based on Unsupervised Learning[J].Journal of Southwest Jiaotong University,2014,27(4):590-596.
Authors:JIA Zhen  HE Dake  YIN Hongfeng  LI Tianrui
Abstract:An unsupervised learning method was proposed to solve the problem of part-whole relation extraction from Chinese free texts. A subsequence extraction algorithm was firstly introduced that can acquire concept pairs and their context patterns from domain texts, and a distributional semantic model was constructed according to concept pairs and context patterns of concept pairs. Then a co-clustering algorithm was applied to group the concept pairs with the same semantic relations together. L1 regularized logistic regression model was trained to select clustering feature and obtain the context pattern which represents semantic relation of each cluster. At last, according to the patterns, the clusters expressing part-whole relation were identified and part-whole relation concept pairs were acquired. The experimental results indicate the proposed method is effective and its F measure is up to 68.97% which is superior to the traditional clustering (55.77%) and pattern matching methods(61.95%). 
Keywords:
点击此处可从《西南交通大学学报》浏览原始摘要信息
点击此处可从《西南交通大学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号