首页 | 官方网站   微博 | 高级检索  
     

基于关系向量模型的句子相似度计算
引用本文:殷耀明,张东站.基于关系向量模型的句子相似度计算[J].计算机工程与应用,2014(2):198-203.
作者姓名:殷耀明  张东站
作者单位:厦门大学信息科学与技术学院,福建厦门361005
摘    要:句子相似度的计算在自然语言处理的各个领域占有很重要的地位,一些传统的计算方法只考虑句子的词形、句长、词序等表面信息,并没有考虑句子更深层次的语义信息,另一些考虑句子语义的方法在实用性上的表现不太理想。在空间向量模型的基础上提出了一种同时考虑句子结构和语义信息的关系向量模型,这种模型考虑了组成句子的关键词之间的搭配关系和关键词的同义信息,这些信息反应了句子的局部结构成分以及各局部之间的关联关系,因此更能体现句子的结构和语义信息。以关系向量模型为核心,提出了基于关系向量模型的句子相似度计算方法。同时将该算法应用到网络热点新闻自动摘要生成算法中,排除文摘中意思相近的句子从而避免文摘的冗余。实验结果表明,在考虑网络新闻中的句子相似度时,与考虑词序与语义的算法相比,关系向量模型算法不但提高了句子相似度计算的准确率,计算的时间复杂度也得到了降低。

关 键 词:句子相似度  关系向量模型  句子语法  句子语义

Sentence similarity computing based on relation vector model
YIN Yaoming,ZHANG Dongzhan.Sentence similarity computing based on relation vector model[J].Computer Engineering and Applications,2014(2):198-203.
Authors:YIN Yaoming  ZHANG Dongzhan
Affiliation:School of Information Science and Engineering, Xiamen University, Xiamen, Fujian 361005, China
Abstract:Sentence similarity computation is very important in all fields of natural language process. Some of the tradi- tional algorithms only compare sentences based on their surface form such as same words, sentence length, word order and do not consider the sentence deep-level semantic information, some methods considered the sentence semantics get an unsatisfactory performance on the algorithm practicality. Therefore, a relation vector model which taking into account the relationship of sentence structure and semantic information based on space vector model is presented, this model is com- posed of a mix between the key words of the sentence and the key words synonymous information, which reflects local structural component of the sentence as well as the correlation between the local structure and therefore better reflects the structure and semantics of the sentence. An algorithm of sentence similarity based on relation vector model is put forward. The algorithm is applied to the network news summary generation algorithm in order to avoid redundancy. The experimental results show that, compared with the algorithm which considers the word order and semantic, relation vector model algo- rithm not only improves the accuracy of sentence similarity calculation, the time complexity of calculation is also reduced.
Keywords:sentence similarity  relation vector model  sentence syntax  sentence semantics
本文献已被 CNKI 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号