首页 | 官方网站   微博 | 高级检索  
     

基于浅层语义树核的阅读理解答案句抽取
引用本文:张志昌,张宇,刘挺,李生.基于浅层语义树核的阅读理解答案句抽取[J].中文信息学报,2008,22(1).
作者姓名:张志昌  张宇  刘挺  李生
作者单位:哈尔滨工业大学,计算机学院,信息检索研究室,黑龙江,哈尔滨,150001
基金项目:国家自然科学基金资助项目(60435020,60675034),国家863项目(2006AA01Z145)
摘    要:阅读理解系统是通过对一篇自然语言文本的分析理解,对用户根据该文本所提的问题,自动抽取或者生成答案。本文提出一种利用浅层语义信息的英文阅读理解抽取方法,首先将问题和所有候选句的语义角色标注结果表示成树状结构,用树核(tree kernel)的方法计算问题和每个候选句之间的语义结构相似度,将该相似度值和词袋方法获得的词匹配数融合在一起,选择具有最高分值的候选句作为最终的答案句。在Remedia测试语料上,本文方法取得43.3%的HumSent准确率。

关 键 词:计算机应用  中文信息处理  阅读理解  答案句抽取  浅层语义  树核

Answer Sentence Extraction of Reading Comprehension Based on Shallow Semantic Tree Kernel
ZHANG Zhi-chang,ZHANG Yu,LIU Ting,LI Sheng.Answer Sentence Extraction of Reading Comprehension Based on Shallow Semantic Tree Kernel[J].Journal of Chinese Information Processing,2008,22(1).
Authors:ZHANG Zhi-chang  ZHANG Yu  LIU Ting  LI Sheng
Abstract:Automatic reading comprehension systems can analyze a given passage and generate/extract answers in response to questions about the passage.An approach integrating shallow semantic information to extract answer sentence is proposed in this paper.The labeled semantic roles in question and candidate sentences are represented as semantic trees,then the structure similarity is calculated using tree kernel between them.After combining the similarity with matching words count obtained using bag-of-words method,the sentence with the highest score is chosen as answer sentence.The proposed approach achieves 43.3% HumSent accuracy on the Remedia corpora.
Keywords:computer application  Chinese information processing  reading comprehension  answer sentence extraction  shallow semantic  tree kernel
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号