首页 | 官方网站   微博 | 高级检索  
     

基于频繁项集与负规则的局部反馈查询扩展
引用本文:黄名选,钟智,张师超.基于频繁项集与负规则的局部反馈查询扩展[J].计算机工程与设计,2012,33(5):1863-1866,1880.
作者姓名:黄名选  钟智  张师超
作者单位:1. 广西教育学院科研处,广西南宁,530023
2. 广西师范学院计算机与信息工程学院,广西南宁,530023
3. 广西师范大学计算机学院,广西桂林,541004
基金项目:澳大利亚ARC基金项目(DP0985456);广西高校优秀人才资助计划基金项目(桂教人[2011]40号);广西教育厅科研基金项目(201106LX388);广西自然科学基金项目(2012GXNSFAA053235)
摘    要:针对信息检索中存在的词不匹配问题,提出了基于频繁项集和负关联规则挖掘的局部反馈查询扩展模型及其算法.该算法对前列n篇初检文档挖掘频繁项集和非频繁项集,并从频繁项集中提取关联词;从频繁项集和非频繁项集中挖掘负关联规则,提取负关联规则后件作为负关联词,计算负关联词与整个原查询词的相关性;根据相关性删除关联词库中与负关联词相同的词项,将余下的关联词项作为最终扩展词,并与原查询组合成新查询,实现查询扩展.实验结果表明,该算法能发现虚假的负关联词,有效地提高和改善信息检索性能.

关 键 词:频繁项集  负关联规则  局部反馈  查询扩展  信息检索

Query expansion of local feedback based on frequent itemsets and negative rules
HUANG Ming-xuan , ZHONG Zhi , ZHANG Shi-chao.Query expansion of local feedback based on frequent itemsets and negative rules[J].Computer Engineering and Design,2012,33(5):1863-1866,1880.
Authors:HUANG Ming-xuan  ZHONG Zhi  ZHANG Shi-chao
Affiliation:1.Scientific Research Office,Guangxi Institute of Education,Nanning 530023,China;2.College of Computer and Information Engineering,Guangxi Teachers Education University,Nanning 530023,China;3.College of Computer Science,Guangxi Normal University,Guilin 541004,China)
Abstract:Aiming at the term mismatch issues of existing information retrieval system,a novel query expansion model and its algorithm of local feedback is proposed based on frequent itemsets and negative association rules mining.Firstly,the frequent itemsets and non-frequent itemsets are mined synchronously in the top-ranked n chapter retrieved local documents.On one hand,the association terms are extracted from the frequent itemsets,on the other hand,negative association rules are mined in frequent itemsets and non-frequent itemsets and the consequents of negative association rules are extracted to make into negative association term.And then,final negative association terms are obtained according to the correlation of each negative association term and the entire original query.Finally,the terms the same as negative association terms are removed from association terms database and the rest of the terms of the association terms database are combined with original query for query expansion.The experimental results show that the proposed algorithm can not only detect those false negative association terms but also effectively improve and enhance the information retrieval performance.
Keywords:frequent itemset  negative association rule  local feedback  query expansion  information retrieval
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号