首页 | 官方网站   微博 | 高级检索  
     

基于语义角色标注的汉语句子相似度算法
引用本文:田 堃,柯永红,穗志方.基于语义角色标注的汉语句子相似度算法[J].中文信息学报,2016,30(6):126-132.
作者姓名:田 堃  柯永红  穗志方
作者单位:北京大学 信息科学技术学院,北京 100871
基金项目:国家“973”计划(2014CB340504)
摘    要:在语义角色标注过程中,经常需要检索相似的已标注语料,以便进行参考和分析。现有方法未能充分利用动词及其支配的成分信息,无法满足语义角色标注的相似句检索需求。基于此,本文提出一种新的汉语句子相似度计算方法。该方法基于已标注好语义角色的语料资源,以动词为分析核心,通过语义角色分析、标注句型的相似匹配、标注句型间相似度计算等步骤来实现句子语义的相似度量。为达到更好的实验效果,论文还综合比较了基于知网、词向量等多种计算词语相似度的算法,通过分析与实验对比,将实验效果最好的算法应用到句子相似度计算的研究中。实验结果显示,基于语义角色标注的句子相似度计算方法相对传统方法获得了更好的测试结果。

关 键 词:语义角色标注  词语相似度  知网  词向量  标注句型匹配  />  

Chinese Sentence Similarity Computing Based on Semantic Roles Annotation
TIAN Kun,KE Yonghong,SUI Zhifang.Chinese Sentence Similarity Computing Based on Semantic Roles Annotation[J].Journal of Chinese Information Processing,2016,30(6):126-132.
Authors:TIAN Kun  KE Yonghong  SUI Zhifang
Affiliation:School of Electronic Engineering and Computer Science, Peking University, Beijing 100871, China
Abstract:In the process of semantic roles annotation, searching for similar annotated sentences is a common way to analyze such corpus. Existing methods cannot take full advantage of verbs and related elements, so they are unable to meet the demand of searching for similar annotated sentences. This article develops a new method to calculate Chinese sentence similarity focused on the verbs. Based on semantic roles annotation, the algorithm detects the similar sentences by analyzing the semantic roles, matching the annotated sentences, and calculating similarity between these matched sentences. To get a better result, the article also compares several other methods for word similarity, including algorithms based on How-net and Distributed Representation, and applies the best one into our algorithm. The experimental result indicates that the sentence similarity algorithm based semantic roles annotation performs better than traditional methods.
Keywords:semantic roles annotation  word similarity  How-net  word vector  annotated sentence match
        
        
        
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号