首页 | 官方网站   微博 | 高级检索  
     

基于FP-Tree的反向频繁项集挖掘
引用本文:郭宇红,童云海,唐世渭,杨冬青.基于FP-Tree的反向频繁项集挖掘[J].软件学报,2008,19(2):338-350.
作者姓名:郭宇红  童云海  唐世渭  杨冬青
作者单位:1. 北京大学,计算机科学技术系,北京,100871
2. 北京大学,视觉与听觉信息处理国家重点实验室,北京,100871
3. 北京大学,计算机科学技术系,北京,100871;北京大学,视觉与听觉信息处理国家重点实验室,北京,100871
基金项目:Supported by the National Natural Science Foundation of China under Grant No.60403041 (国家自然科学基金)
摘    要:在拓展现有反向频繁挖掘问题定义,探索反向频繁项集的3个具体应用后,提出了一种基于FP-tree的反向频繁项集挖掘方法.该方法首先采用分治思想,将目标约束划分为若干子约束,每步求解一个子线性约束问题,经过若干步迭代后找到一个满足整个给定约束的目标FP-tree;然后根据目标FP-tree生成一个仅含频繁项的临时事务数据库TempD;最后通过向TempD中撒入非频繁项得到目标数据集.理论分析和实验表明该方法是正确的、高效的,且与现有方法仅能输出1个目标数据集相比,该方法能够输出较多的目标数据集.

关 键 词:反向挖掘  FP树  频繁项集  隐私保护  知识隐藏
收稿时间:2006-06-10
修稿时间:2/5/2007 12:00:00 AM

Inverse Frequent Itemset Mining Based on FP-Tree
GUO Yu-Hong,TONG Yun-Hai,TANG Shi-Wei and YANG Dong-Qing.Inverse Frequent Itemset Mining Based on FP-Tree[J].Journal of Software,2008,19(2):338-350.
Authors:GUO Yu-Hong  TONG Yun-Hai  TANG Shi-Wei and YANG Dong-Qing
Abstract:After the current definition of the inverse frequent set mining problem is expanded and its three practical applications are explored, an FP-tree-based method is proposed for the inverse mining problem. First, the method divides target constraints into some sub constraints and each time it solves a sub linear constraint problem. After some iterations, it finds an FP-tree satisfying the whole given constraints. Then, based on the FP-tree it generates a temporary database TempD that only involves frequent items. The target datasets are obtained by scattering infrequent items into TempD. Theoretic analysis and experiments show that the method is right and efficient. Moreover, compared with the current methods, the method can output more than one target data set.
Keywords:inverse mining  FP-tree  frequent itemset  privacy preserving  knowledge hiding
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号