首页 | 官方网站   微博 | 高级检索  
     

负关联规则挖掘与特征词抽取融合的局部反馈查询扩展
引用本文:黄名选.负关联规则挖掘与特征词抽取融合的局部反馈查询扩展[J].计算机工程与科学,2011,33(11):144.
作者姓名:黄名选
作者单位:广西教育学院科研处,广西南宁,530023
基金项目:广西教育厅科研课题,广西教育学院2010年度院级重点课题
摘    要:针对现有信息检索系统中存在的词不匹配问题,本文提出一种基于负关联规则挖掘与特征词抽取融合的局部反馈查询扩展算法。该算法首先从前列n篇初检局部文档中抽取特征词,建立特征词库;然后,对特征词库挖掘同时含有查询词和非查询词的频繁项集和非频繁项集,由此挖掘前件是查询项的负关联规则,提取负关联规则的后件作为负关联特征词,计算负关联特征词与原查询的相关性,根据相关性在特征词库中删除负关联特征词,将余下的特征词作为最终扩展词,和原查询组合成新查询实现查询扩展。实验结果表明,该算法能有效地提高和改善信息检索性能。

关 键 词:查询扩展  局部反馈  特征词  负关联规则

Query Expansion of Local Feedback Based on the Fusion of Negative Association Rules Mining and Feature Terms Extraction
HUANG Ming-xuan.Query Expansion of Local Feedback Based on the Fusion of Negative Association Rules Mining and Feature Terms Extraction[J].Computer Engineering & Science,2011,33(11):144.
Authors:HUANG Ming-xuan
Abstract:Aiming at the term mismatch issues of the existing information retrieval systems, a novel query expansion algorithm of local feedback is proposed based on the fusion of negative association rules mining and feature terms extraction. Firstly, the feature terms from the top-ranked n retrieved local documents are extracted to construct the feature terms database, and the frequent itemsets and non-frequent itemsets containing original query terms and non- query terms synchronously are mined in the feature terms database. And then, the negative association rules, the antecedent of which is the original query terms, are mined in frequent itemsets and non-frequent itemsets, and the consequents of the negative association rules are extracted as the negative association terms, and the correlation of each negative association term and the entire original query is calculated. Finally, the terms which are the same as the negative association terms are removed from the feature terms database according to the correlation and the rest of the terms of the feature terms database are combined with the original query for query expansion. The results of the experiment show that the proposed algorithm can effectively improve and enhance the information retrieval performance.
Keywords:query expansion  local feedback  feature term  negative association rule
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机工程与科学》浏览原始摘要信息
点击此处可从《计算机工程与科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号