首页 | 官方网站   微博 | 高级检索  
     


A survey of outlier detection in high dimensional data streams
Affiliation:1. Department of Network and Information Technology, Baoji University of Arts and Sciences, Shaanxi, China;2. Malaysian Institute of Information Technology, Universiti Kuala Lumpur, Kuala Lumpur, Malaysia;3. Department of Computer Science and Engineering, University of Barishal, 8254 Barishal, Bangladesh;4. College of Science and Engineering, Hamad Bin Khalifa University, Qatar;1. State Key Laboratory of Hydrology-Water Resources and Hydraulic Engineering, Hohai University, Nanjing 210098, China;2. College of Water Conservancy and Hydropower Engineering, Hohai University, Nanjing 210098, China;3. National Engineering Research Center of Water Resources Efficient Utilization and Engineering Safety, Hohai University, Nanjing 210098, China
Abstract:The rapid evolution of technology has led to the generation of high dimensional data streams in a wide range of fields, such as genomics, signal processing, and finance. The combination of the streaming scenario and high dimensionality is particularly challenging especially for the outlier detection task. This is due to the special characteristics of the data stream such as the concept drift, the limited time and space requirements, in addition to the impact of the well-known curse of dimensionality in high dimensional space. To the best of our knowledge, few studies have addressed these challenges simultaneously, and therefore detecting anomalies in this context requires a great deal of attention. The main objective of this work is to study the main approaches existing in the literature, to identify a set of comparison criteria, such as the computational cost and the interpretation of outliers, which will help us to reveal the different challenges and additional research directions associated with this problem. At the end of this study, we will draw up a summary report which summarizes the main limits identified and we will detail the different directions of research related to this issue in order to promote research for this community.
Keywords:Outlier detection  High dimensional data  Data streams
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号