首页 | 官方网站   微博 | 高级检索  
     

一种多样性和精度加权的数据流集成分类算法
引用本文:张本才,王志海,孙艳歌,.一种多样性和精度加权的数据流集成分类算法[J].智能系统学报,2019,14(1):179-185.
作者姓名:张本才  王志海  孙艳歌  
作者单位:1. 北京交通大学 计算机与信息技术学院, 北京 100044;2. 信阳师范学院 计算机与信息技术学院, 河南 信阳 464000
摘    要:为了克服数据流中概念漂移对分类的影响,提出了一种基于多样性和精度加权的集成分类方法(diversity and accuracy weighting ensemble classification algorithm, DAWE),该方法与已有的其他集成方法不同的地方在于,DAWE同时考虑了多样性和精度这两种度量标准,将分类器在最新数据块上的精度及其在集成分类器中的多样性进行线性加权,以此来衡量一个分类器对于当前集成分类器的价值,并将价值度量用于基分类器替换策略。提出的DAWE算法与MOA中最新算法分别在真实数据和人工合成数据上进行了对比实验,实验表明,提出的方法是有效的,在所有数据集上的平均精度优于其他算法,该方法能有效处理数据流挖掘中的概念漂移问题。

关 键 词:数据流  概念漂移  多样性  精度  集成学习  数据块  价值度量  MOA

An ensemble classification algorithm based on diversity and accuracy weighting for data streams
ZHANG Bencai,WANG Zhihai,SUN Yan’ge,.An ensemble classification algorithm based on diversity and accuracy weighting for data streams[J].CAAL Transactions on Intelligent Systems,2019,14(1):179-185.
Authors:ZHANG Bencai  WANG Zhihai  SUN Yan’ge  
Affiliation:1. School of Computer and Information Technology, Beijing Jiaotong University, Beijing 100044, China;2. School of Computer and Information Technology, Xinyang Normal University, Xinyang 464000, China
Abstract:To overcome the effect of concept drift on data stream classification, we propose an ensemble classification algorithm based on diversity and accuracy weighting named DAWE. The difference between DAWE and other existing ensemble methods is that DAWE considers both diversity and accuracy. The classifier’s accuracy on the new data chunk and its diversity in the ensemble were linearly weighted to measure the value of the current ensemble classifier and the measured value was applied to the substitute strategy of the base classifier. The DAWE algorithm proposed in this paper was experimentally compared with the latest algorithms in massive online analysis (MOA), using both synthetic and real-world datasets. Experiments showed that the method proposed in this paper was effective and the average overall accuracy of the data sets was superior to that of other algorithms. Overall, this method can effectively manage concept drift in data stream mining.
Keywords:data stream  concept drift  diversity  accuracy  ensemble learning  data chunk  value measurement  MOA
点击此处可从《智能系统学报》浏览原始摘要信息
点击此处可从《智能系统学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号