首页 | 官方网站   微博 | 高级检索  
     

基于多特征的汉语句子相似度计算模型的研究
引用本文:李春梅,徐庆生.基于多特征的汉语句子相似度计算模型的研究[J].计算机技术与发展,2014(6):136-139.
作者姓名:李春梅  徐庆生
作者单位:云南楚雄师范学院计算机科学系,云南楚雄675000
基金项目:云南省教育专项课题基金(08Y0313)
摘    要:句子相似度的计算在自然语言处理的各个领域中都占有很重要的地位。文中深入分析了现有的一些句子相似度计算的方法,这些方法各自从词特征、词义特征或句法特征等某一侧面描述了句子相似的情况,未能全面地描述一个句子的完整信息。文中提出了一种新的基于多特征的汉语句子相似度的计算模型。该方法在基于词的基础上,从句子中词的表层到词的逻辑联系,从句子的局部结构到整体结构,用句子的区分度、相同词的相似度、长度相似度、词性相似度及词序相似度五个方面来综合考虑两个句子相似度的计算。实验结果表明,该方法合理、简便、可行。

关 键 词:自然语言处理  区分度  词性  词序  句子相似度

Research on Chinese Sentence Similarity Calculation Model Based on Multi-features
LI Chun-mei,XU Qing-sheng.Research on Chinese Sentence Similarity Calculation Model Based on Multi-features[J].Computer Technology and Development,2014(6):136-139.
Authors:LI Chun-mei  XU Qing-sheng
Affiliation:( Department of Computer Science, Chuxiong Normal University, Chuxiong 675000, China)
Abstract:Sentence similarity calculation plays an important role in various areas of natural language processing.Analyze the existing some sentence similarity calculation method.These methods describe the sentence similarity from the word characteristics,semantic features or syntactic features,all the information of a sentence can't be described fully.A new model of Chinese sentence similarity based on the multi-feature is proposed.This method is based on the word,from the surface to the logical connection of the word,from local structure to the overall structure of a sentence,five aspects of sentence similarity such as degree of differentiation,the same word similarity,length similarity,the part of speech similarity and word order similarity have been studied in depth.Experimental results show that the method is reasonable,simple and feasible.
Keywords:natural language processing  degree of differentiation  discrimination part of speech  word order  sentence similarity
本文献已被 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号