首页 | 官方网站   微博 | 高级检索  
     

面向自然语言文本的否定性与不确定性信息抽取
引用本文:邹博伟,钱忠,陈站成,朱巧明,周国栋.面向自然语言文本的否定性与不确定性信息抽取[J].软件学报,2016,27(2):309-328.
作者姓名:邹博伟  钱忠  陈站成  朱巧明  周国栋
作者单位:苏州大学 计算机科学与技术学院, 江苏 苏州 215006,苏州大学 计算机科学与技术学院, 江苏 苏州 215006,苏州大学 计算机科学与技术学院, 江苏 苏州 215006,苏州大学 计算机科学与技术学院, 江苏 苏州 215006,苏州大学 计算机科学与技术学院, 江苏 苏州 215006
基金项目:国家自然科学基金(61272260, 61331011, 61273320)
摘    要:目前,信息抽取研究主要面向肯定性信息,而自然语言文本中包含了大量否定性和不确定性信息,为了将此类信息与肯定性信息区分开,有必要针对否定性与不确定性信息抽取进行深入研究.针对这一任务,首次构建了一个16 841句的汉语语料资源,利用序列标注模型与卷积树核模型,系统地探索了各种序列化依存特征和结构化句法树特征的有效性,并提出了元决策树模型,对二者进行融合.实验结果显示,该方法在否定性和不确定性信息抽取任务上的精确率分别达到69.84%和58.57%,为相关研究打下了坚实的基础.

关 键 词:信息抽取  否定性信息  不确定性信息  线索词检测  覆盖域界定
收稿时间:2015/1/30 0:00:00
修稿时间:2015/4/14 0:00:00

Negation and Uncertainty Information Extraction Oriented to Natural Language Text
ZOU Bo-Wei,QIAN Zhong,CHEN Zhan-Cheng,ZHU Qiao-Ming and ZHOU Guo-Dong.Negation and Uncertainty Information Extraction Oriented to Natural Language Text[J].Journal of Software,2016,27(2):309-328.
Authors:ZOU Bo-Wei  QIAN Zhong  CHEN Zhan-Cheng  ZHU Qiao-Ming and ZHOU Guo-Dong
Affiliation:School of Computer Science and Technology, Soochow University, Suzhou 215006, China,School of Computer Science and Technology, Soochow University, Suzhou 215006, China,School of Computer Science and Technology, Soochow University, Suzhou 215006, China,School of Computer Science and Technology, Soochow University, Suzhou 215006, China and School of Computer Science and Technology, Soochow University, Suzhou 215006, China
Abstract:The current research on information extraction mainly focuses on affirmative information. However there are more negation and uncertainty information in natural language texts. For purpose of separating them from affirmative information, it is necessary to make an intensive study of negation and uncertainty information extraction. For this task, this study firstly constructs a Chinese corpus including 16 841 sentences. Employing the sequence labeling model and the convolution tree kernel model, it systematically explores the efficiency of various kinds of serialized dependency features and structured parsing features. Finally, it proposes a meta-decision tree model to integrate the above two models. Experimental results show that the performances of the new method on negation and uncertainty information extraction achieve 69.84% and 58.57% of accuracy respectively, providing a solid foundation for related studies in the future.
Keywords:information extraction  negation information  uncertainty information  cue detection  scope resolution
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号