首页 | 官方网站   微博 | 高级检索  
     

基于答案辅助的半监督问题分类方法
引用本文:张栋,李寿山,周国栋.基于答案辅助的半监督问题分类方法[J].计算机工程与科学,2015,37(12):2352-2357.
作者姓名:张栋  李寿山  周国栋
作者单位:;1.苏州大学计算机科学与技术学院
基金项目:国家自然科学基金重点项目(61331011);国家自然科学基金资助项目(61375073,61273320)
摘    要:问题分类旨在对问题的类型进行自动分类,该任务是问答系统研究的一项基本任务。提出了一种基于答案辅助的半监督问题分类方法。首先,将答案特征结合问题特征一起实现样本表示;然后,利用标签传播方法对已标注问题训练分类器,自动标注未标注问题的类别;最后,将初始标注的问题和自动标注的问题合并作为训练样本,利用最大熵模型对问题的测试文本进行分类。实验结果表明,本文提出的基于答案辅助的半监督分类方法能够充分利用未标注样本提升性能,明显优于其他的基准方法。

关 键 词:问答系统  问题分类  答案辅助  半监督分类  标签传播
收稿时间:2015-08-15
修稿时间:2015-12-25

A classification method for semi-supervised question classification with answers
ZHANG Dong,LI Shou shan,ZHOU Guo dong.A classification method for semi-supervised question classification with answers[J].Computer Engineering & Science,2015,37(12):2352-2357.
Authors:ZHANG Dong  LI Shou shan  ZHOU Guo dong
Affiliation:(School of Computer Science & Technology,Soochow University,Suzhou 215006,China)
Abstract:Question classification aims at classifying the types of questions automatically, and this is a basic task of the question answering system. We propose a classification method for semi-supervised questions with answers. Firstly, we combine answer features with question features to realize sample expressions. Then we train a question classifier on labeled questions using label propagation algorithm to annotate the category of unlabeled questions automatically. The questions of initial annotation and automatic annotation are merged with each other as training samples, and the maximum entropy model is adopted to classify the testing samples. Experimental results demonstrate that the classification method for semi supervised questions with answers in this paper can make full use of the unlabeled samples to improve the performance, and it outperforms other benchmark methods.
Keywords:question answering system  question classification  answer aiding  semi-supervised classification  label propagation  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机工程与科学》浏览原始摘要信息
点击此处可从《计算机工程与科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号