首页 | 官方网站   微博 | 高级检索  
     

基于协同训练的文本蕴含识别
引用本文:任函,万菁,吴泓缈,冯文贺.基于协同训练的文本蕴含识别[J].中文信息学报,2014,28(6):114-119.
作者姓名:任函  万菁  吴泓缈  冯文贺
作者单位:1. 武汉大学 外国语言文学学院,湖北 武汉 430072;
2. 武汉大学 湖北省语言与智能信息处理研究基地,湖北 武汉 430072;
3. 武汉大学 计算机学院,湖北 武汉 430072
基金项目:国家自然科学基金(61402341,61373108,61173062),中国博士后科学基金(2014M552073, 2013M540594),中央高校基本科研业务费专项资金(2012GSP017)
摘    要:针对文本蕴含的训练数据不足的问题,该文提出了基于协同训练的文本蕴含识别方法。该方法利用少量已标注的蕴含数据和大量未标注数据进行协同训练。为此,该文利用改写视图和评估视图,从结构和非结构两个角度考察蕴含关系,并将语义树核分类器和基于统计特征的分类器应用于两个视图,同时利用协同训练的结果训练一个综合分类器,用于对新数据进行预测。实验表明,基于协同训练的蕴含识别方法能在少量训练数据的情况下获得较好的识别性能。

关 键 词:文本蕴含识别  协同训练  语义树核  

A Co-training Based Approach to Recognizing Textual Entailment
REN Han,WAN Jing,WU Hongmiao,FENG Wenhe.A Co-training Based Approach to Recognizing Textual Entailment[J].Journal of Chinese Information Processing,2014,28(6):114-119.
Authors:REN Han  WAN Jing  WU Hongmiao  FENG Wenhe
Affiliation:1. School of Foreign Languages and Literature, Wuhan University, Wuhan, Hubei 430072, China;
2. Hubei Research Base of Language and Intelligent Information Processing,Wuhan University, Wuhan, Hubei 430072, China;
3. School of Computer, Wuhan University, Wuhan, Hubei 430072, China
Abstract:This paper introduces a co-training approach to recognizing textual entailment. In this approach, a small labeled entailment dataset as well as a large unlabeled one are employed for co-training, which aims at solving the lack of entailment data . Two different views, rewriting view and assessing view, are proposed to measure structural and non-structural entailment relations, likewise two classifiers, namely semantic tree kernel based classifier and statistical features based classifier, are applied to train under the two views respectively. For predication, a global classifier is built, trained by the results of co-training. Experiments show that the co-training based approach achieves a good performance in the case of a small training dataset.
Keywords:recognizing textual entailment  co-training  semantic tree kernel  
本文献已被 CNKI 等数据库收录!
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号