首页 | 官方网站   微博 | 高级检索  
     

PrefixSpan算法与CloSpan算法的分析与研究
引用本文:李锡娟,刘锋,闫娜娜,徐杰,耿波.PrefixSpan算法与CloSpan算法的分析与研究[J].计算机技术与发展,2008,18(1):70-73,76.
作者姓名:李锡娟  刘锋  闫娜娜  徐杰  耿波
作者单位:安徽大学,计算机信息学院,安徽,合肥,230039
摘    要:数据挖掘领域的一个活跃分支就是序列模式的发现,即在序列数据库中找出所有的频繁子序列.介绍序列模式挖掘的基本概念,然后对序列模式中的经典算法PrefixSpan算法和基于PrefixSpan框架的闭合序列模式CloSpan算法进行了描述,并对它们的执行过程及其特点进行了分析与比较,总结了各自的优缺点,指出PrefixSpan算法适用于短序列方面挖掘,而CloSpan算法在长序列或者阈值较低时胜过PrefixSpan算法且CloSpan算法挖掘大型的数据库有更好的性能,得出的结果对序列模式挖掘的设计有重要的参考价值.

关 键 词:序列模式挖掘  PrefixSpan算法  CloSpan算法  PrefixSpan  经典算法  分析  研究  Algorithm  Analysis  价值  设计  结果  性能  序列数据库  方面挖掘  阈值  长序列  短序列  比较  执行过程  描述  闭合序列模式  框架
文章编号:1673-629X(2008)01-0070-04
收稿时间:2007-04-02
修稿时间:2007年4月2日

Research and Analysis Between Algorithm of PrefixSpan and CloSpan
LI Xi-juan,LIU Feng,YAN Na-na,XU Jie,GENG Bo.Research and Analysis Between Algorithm of PrefixSpan and CloSpan[J].Computer Technology and Development,2008,18(1):70-73,76.
Authors:LI Xi-juan  LIU Feng  YAN Na-na  XU Jie  GENG Bo
Abstract:An active research in data mining area is the discovery of sequential patterns,which finds all frequent sub - sequences in a sequence database. Firstly introduces the basic concept of sequential pattern mining, then describes PrefixSpan algorithm and CloSpan which is based on PrefixSpan framework algorithm. The execution process and features of the sequencial mining classic algorithms were finally compared and analysed each other. It shows that PrefixSpan algorithm adapts to mine short sequences,but CloSpan outperforms PrefixSpan when the minimum support is low and sequence is long , furthermore CloSpan has better performance when minning longer frequent sequneces in a large data set. The result gained can be of important value as reference to the design of sequence mining.
Keywords:sequential pattem  PrefixSpan algorithrn  CloSpan algorithm
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号