首页 | 官方网站   微博 | 高级检索  
     

基于分类模型的查询扩展方法
引用本文:李维银,石玉龙,陈杰,施重阳.基于分类模型的查询扩展方法[J].计算机科学,2015,42(6):18-22.
作者姓名:李维银  石玉龙  陈杰  施重阳
作者单位:北京理工大学计算机学院 北京100081
基金项目:本文受中国科学院自动化研究所复杂系统管理与控制国家重点实验室开放课题(99S9021F4D),国家自然科学基金(61472034),教育部新世纪优秀人才支持计划(NCET-13-0041),北京理工大学基础研究基金资助
摘    要:查询扩展作为查询优化的重要组成部分,对改善信息检索系统的性能起到了至关重要的作用.传统的伪相关反馈查询扩展方法虽然在一定程度上提高了检索性能,但选择的扩展词中会包含一部分与原查询不相关的词语,这对检索性能的提升产生了不利影响.提出了一种基于分类模型的查询扩展方法,该算法综合候选扩展词的统计信息和多种特征,采用朴素贝叶斯分类模型对初次得到的候选扩展词进行再次分类选择,进一步去除与查询词相关性小的扩展词.在TREC 2013数据集上的实验结果表明,提出的查询扩展方法能够有效提高用户查询的查准率和查全率.

关 键 词:查询扩展  分类模型  信息检索  伪相关反馈

Query Expansion Based on Classification Model
LI Wei-yin,SHI Yu-long,CHEN Jie and SHI Chong-yang.Query Expansion Based on Classification Model[J].Computer Science,2015,42(6):18-22.
Authors:LI Wei-yin  SHI Yu-long  CHEN Jie and SHI Chong-yang
Affiliation:School of Computer Science and Technology,Beijing Institute of Technology,Beijing 100081,China,School of Computer Science and Technology,Beijing Institute of Technology,Beijing 100081,China,School of Computer Science and Technology,Beijing Institute of Technology,Beijing 100081,China and School of Computer Science and Technology,Beijing Institute of Technology,Beijing 100081,China
Abstract:As a key component of query optimization,query expansion plays an important role in improving the perfor-mance of information retrieval systems.Traditional query expansion methods on pseudo-relevance feedback improve the performance of retrieval to some extent.However,the selected expansion terms will also include some irrelevant ones,which leads to adverse effect.In this paper,a novel query expansion method based on classification model was proposed.Combining with statistical information and various features of the candidate expansion terms,this method employs Naive Bayes classification model to reselect the candidate expansion terms so as to further filter the irrelevant ones.Experimental results on TREC 2013 datasets show that the proposed query expansion method can efficiently improve the precision and recall of user queries.
Keywords:Query expansion  Classification model  Information retrieval  Pseudo relevance feedback
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号