首页 | 官方网站   微博 | 高级检索  
     

基于矩阵加权关联规则挖掘的伪相关反馈查询扩展
引用本文:黄名选,严小卫,张师超.基于矩阵加权关联规则挖掘的伪相关反馈查询扩展[J].软件学报,2009,20(7):1854-1865.
作者姓名:黄名选  严小卫  张师超
作者单位:1. 广西教育学院,数学与计算机科学系,广西,南宁,530023
2. 广西师范大学,计算机科学与信息工程学院,广西,桂林,541004
3. 中山大学,逻辑与认知研究所,广东,广州,510275;广西师范大学,计算机科学与信息工程学院,广西,桂林,541004
基金项目:Supported by the National Natural Science Foundation of China under Grant No.90718020 (国家自然科学基金); the National BasicResearch Program of China under Grant No.2008CB317108 (国家重点基础研究发展计划(973)); the Australian Research Council Discovery under Grant No.DP0667060 (澳大利亚ARC 项目); the MOE Project of Key Research Institute of Humanities and Social Sciences at Universities of China under Grant No.07JJD720044 (教育部人文重点研究基地重大项目)
摘    要:提出一种面向查询扩展的矩阵加权关联规则挖掘算法,给出与其相关的定理及其证明过程.该算法采用4种剪枝策略,挖掘效率得到极大提高.实验结果表明,其挖掘时间比原来的平均时间减少87.84%.针对现有查询扩展的缺陷,将矩阵加权关联规则挖掘技术应用于查询扩展,提出新的查询扩展模型和更合理的扩展词权重计算方法.在此基础上提出一种伪相关反馈查询扩展算法——基于矩阵加权关联规则挖掘的伪相关反馈查询扩展算法,该算法能够自动地从前列n 篇初检文档中挖掘与原查询相关的矩阵加权关联规则,构建规则库,从中提取与原查询相关的扩展词,实现查询扩展.实验结果表明,该算法的检索性能确实得到了很好的改善.与现有查询扩展算法相比,在相同的查全率水平级下,其平均查准率有了明显的提高.

关 键 词:信息检索  伪相关反馈  查询扩展  关联规则  矩阵加权
收稿时间:2007/10/10 0:00:00
修稿时间:2008/4/15 0:00:00

Query Expansion of Pseudo Relevance Feedback Based on Matrix-Weighted Association Rules Mining
HUANG Ming-Xuan,YAN Xiao-Wei,ZHANG Shi-Chao.Query Expansion of Pseudo Relevance Feedback Based on Matrix-Weighted Association Rules Mining[J].Journal of Software,2009,20(7):1854-1865.
Authors:HUANG Ming-Xuan  YAN Xiao-Wei  ZHANG Shi-Chao
Affiliation:Department of Math and Computer Science;Guangxi College of Education;Nanning 530023;China;Institute of Logic and Cognition;SUN YAT-SEN University;Guangzhou 510275;China;College of Computer Science and Information Engineering;Guangxi Normal University;Guilin 541004;China
Abstract:An algorithm of matrix-weighted association rule mining for query expansion is presented based on the quadruple pruning, and a related theorem and its proof are given. This method can tremendously nhance the mining efficiency. Experimental results demonstrate that its mining time is averagely reduced by 87.84%, compared to that of the original one. And a query expansion algorithm of pseudo relevance feedback is proposed based on matrix-weighted association rule mining, which combines the association rules mining technique with the query expansion. The algorithm can automatically mine those matrix-weighted association rules related to the original query in the top-ranked retrieved documents to construct an association rules-based database, and extract expansion terms related to the original query from the database for query expansion. At the same time, a new computing method for weights of expansion terms is given. It makes the weighted value of an expansion term more reasonable. Experimental results show that this method is better than traditional ones in average precision.
Keywords:information retrieval  pseudo relevance feedback  query expansion  association rule  matrix-weighted
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号