首页 | 官方网站   微博 | 高级检索  
     

基于语义与情感的句子相似度计算方法
引用本文:杨延娇,赵国涛,王丕栋.基于语义与情感的句子相似度计算方法[J].计算机工程与应用,2021,57(16):151-158.
作者姓名:杨延娇  赵国涛  王丕栋
作者单位:西北师范大学 计算机科学与工程学院,兰州 730070
摘    要:针对汉语语句表意灵活复杂多变的特点,提出一种基于语义与情感的句子相似度计算方法,从表意层面计算句子相似度。该方法使用哈工大LTP平台对句子进行预处理,提取词语、词性、句法依存标记与语义角色标记,将语义角色标注结果作为句中语义独立成分赋予相似度权重系数,综合句法依存关系与词法关系计算两句相同标签语义独立成分相似度得到部分相似度,加权计算部分相似度得到句子整体相似度。另外,考虑到情感与句式因子,在整体相似度的基础上对满足条件的两句计算情感减益与句式减益。实验结果表明,该方法能有效提取出句子语义独立成分,从语义层面上计算句子相似度,解决了信息遗漏与句子组成成分不一致的问题,提高了句子相似度计算的准确率与鲁棒性。

关 键 词:句子相似度  句法结构  情感信息  哈工大LTP平台  句法依存分析  语义角色标注  

Sentence Similarity Calculation Method Based on Semantics and Emotion
YANG Yanjiao,ZHAO Guotao,WANG Pidong.Sentence Similarity Calculation Method Based on Semantics and Emotion[J].Computer Engineering and Applications,2021,57(16):151-158.
Authors:YANG Yanjiao  ZHAO Guotao  WANG Pidong
Affiliation:College of Computer Science and Engineering, Northwest Normal University, Lanzhou 730070, China
Abstract:Aiming at the characteristics of flexible, complex and changeable expressions in Chinese sentences, a sentence similarity calculation method based on semantics and emotion is proposed, which calculates sentence similarity from the ideographic level. This method uses the Harbin Institute of Technology LTP platform to preprocess sentences, extracts words, parts of speech, syntactic dependency tags and semantic role tags, assigns semantic role labeling results as semantic independent components in sentences to similarity weight coefficients, synthesizes syntactic dependency relationships and lexical relationships to calculate the similarity of semantic independent components of two sentences with the same label to obtain partial similarity, and calculates the partial similarity by weighting to obtain the overall similarity of the sentence. In addition, considering the sentiment and sentence factors, the sentiment deduction and sentence deduction are calculated for the two sentences that satisfy the condition based on the overall similarity. The experimental results show that the method can effectively extract the semantic independent components of the sentence, calculate the sentence similarity from the semantic level, solve the problem of inconsistency between information leakage and sentence composition, and improve the accuracy and robustness of sentence similarity calculation.
Keywords:sentence similarity  syntactic structure  emotional information  Harbin Institute of Technology LPT platform  syntactic dependency analysis  semantic role labeling  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号