首页 | 官方网站   微博 | 高级检索  
     

基于AC算法的比特流频繁序列挖掘
引用本文:雷东,王韬,马云飞. 基于AC算法的比特流频繁序列挖掘[J]. 计算机科学, 2017, 44(1): 128-133
作者姓名:雷东  王韬  马云飞
作者单位:军械工程学院信息工程系 石家庄050003,军械工程学院信息工程系 石家庄050003,军械工程学院信息工程系 石家庄050003
基金项目:本文受军内科研基金资助
摘    要:为解决比特流频繁序列挖掘效率不高以及易受用户数据影响而导致准确率低的问题,首先从理论上论证了短频繁序列挖掘存在的局限性,根据不同长度的频繁序列挖掘时存在的特点,将其分为长频繁序列与短频繁序列,提出比特流协议头部字段定位算法;基于AC多模式匹配算法分别针对长、短频繁序列挖掘的不同特点,提出了相应的挖掘方法,提高了挖掘结果的准确性。最后通过实验验证了所提算法的有效性。

关 键 词:比特流  AC算法  长频繁序列挖掘  短频繁序列挖掘
收稿时间:2015-12-16
修稿时间:2016-04-20

Frequent Pattern Mining in Bit Stream Based on AC Algorithm
LEI Dong,WANG Tao and MA Yun-fei. Frequent Pattern Mining in Bit Stream Based on AC Algorithm[J]. Computer Science, 2017, 44(1): 128-133
Authors:LEI Dong  WANG Tao  MA Yun-fei
Affiliation:Department of Information Engineering,Ordnance Engineering College,Shijiazhuang 050003,China,Department of Information Engineering,Ordnance Engineering College,Shijiazhuang 050003,China and Department of Information Engineering,Ordnance Engineering College,Shijiazhuang 050003,China
Abstract:The existing method of frequent pattern mining in bit stream is inefficient and the precision of the results is low under the influence of redundant data.In order to solve the problem,it was proved that the mining of short frequent pattern has great limitations.According to the different features when the patterns are mined with different lengths,the frequent sequences are divided into two types:long frequent pattern and short frequent pattern.An algorithm of finding the header fields of the protocol in bit stream was proposed,and the efficient algorithm of mining the long frequent pattern and the short frequent pattern were proposed based on AC multi-pattern matching algorithm.Simulation results on the Ethernet show that the proposed algorithm is effective.
Keywords:Bit stream  AC algorithm  Long frequent pattern mining  Short frequent pattern mining
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号