首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于分词的语句相似度计算的改进
引用本文:邸书灵,刘晓飞,李欢.基于分词的语句相似度计算的改进[J].石家庄铁道学院学报,2011(4):94-97.
作者姓名:邸书灵  刘晓飞  李欢
作者单位:石家庄铁道大学信息科学与技术学院;河北联合大学现代教育技术中心;
摘    要:语句相似度体现的是两个句子之间的相似程度。语句相似度计算是FAQ和信息检索等方面核心技术之一。针对基于分词的相似度计算过于依赖实际的分词效果,在原相似度计算模型中增加了两个句子不分词时的词形相似度计算,以缓解因为句子分词不准确而导致相似度计算结果偏低的情况。结合“数据结构”课程问答系统的实验,结果表明,改进的方法比原方法有较高的准确率。

关 键 词:语句相似度  分词  词形  词序  词长

Improvement on Sentence Similarity Computing Based on Word Segmentation
Di Shuling,Liu Xiaofei,Li Huan.Improvement on Sentence Similarity Computing Based on Word Segmentation[J].Journal of Shijiazhuang Railway Institute,2011(4):94-97.
Authors:Di Shuling  Liu Xiaofei  Li Huan
Institution:Di Shuling1,Liu Xiaofei1,Li Huan2(1.School of Information Science and Technology,Shijiazhuang Tiedao University,Shijiazhuang 050043,China,2.Modern Education and Technology Center,Hebei United University,Tangshan 063009,China)
Abstract:Sentence similarity reflects the similarity degree between two sentences.Similarity computing is one of the core technologies of the FAQ and information retrieval.This paper,in view of the of over-reliance on real word effect in sentence similarity calculation based on word segmentation,the word form similarity calculation without word segmentation is added in the original calculation model,in order to alleviate the low level of similarity calculation result due to the inaccurate sentence segmentation.Exper...
Keywords:sentence similarity  word segmentation  word form  word order  word length  
本文献已被 CNKI 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号