首页 | 官方网站   微博 | 高级检索  
     

基于局部密度的快速离群点检测算法
引用本文:邹云峰,张昕,宋世渊,倪巍伟. 基于局部密度的快速离群点检测算法[J]. 计算机应用, 2017, 37(10): 2932-2937. DOI: 10.11772/j.issn.1001-9081.2017.10.2932
作者姓名:邹云峰  张昕  宋世渊  倪巍伟
作者单位:1. 国网江苏省电力公司 电力科学研究院, 南京 210036;2. 东南大学 计算机科学与工程学院, 南京 211189
基金项目:国家自然科学基金资助项目(61370077)。
摘    要:已有的密度离群点检测算法LOF不能适应数据分布异常情况离群点检测,INFLO算法虽引入反向k近邻点集有效地解决了数据分布异常情况的离群点检测问题,但存在需要对所有数据点不加区分地分析其k近邻和反向k近邻点集导致的效率降低问题。针对该问题,提出局部密度离群点检测算法--LDBO,引入强k近邻点和弱k近邻点概念,通过分析邻近数据点的离群相关性,对数据点区别对待;并提出数据点离群性预判断策略,尽可能避免不必要的反向k近邻分析,有效提高数据分布异常情况离群点检测算法的效率。理论分析和实验结果表明,LDBO算法效率优于INFLO,算法是有效可行的。

关 键 词:离群点检测  局部密度  k近邻点  弱k近邻点  反向k近邻点集  
收稿时间:2017-04-12
修稿时间:2017-07-02

Fast outlier detection algorithm based on local density
ZOU Yunfeng,ZHANG Xin,SONG Shiyuan,NI Weiwei. Fast outlier detection algorithm based on local density[J]. Journal of Computer Applications, 2017, 37(10): 2932-2937. DOI: 10.11772/j.issn.1001-9081.2017.10.2932
Authors:ZOU Yunfeng  ZHANG Xin  SONG Shiyuan  NI Weiwei
Affiliation:1. State Grid, Jiangsu Electronic Power Research Institute, Nanjing Jiangsu 210036, China;2. School of Computer Science and Engineering, Southeast University, Nanjing Jiangsu 211189, China
Abstract:Mining outliers is to find exceptional objects that deviate from the most rest of the data set. Outlier detection based on density has attracted lots of attention, but the density-based algorithm named Local Outlier Factor (LOF) is not suitable for the data set with abnormal distribution, and the algorithm named INFLuenced Outlierness (INFLO) solves this problem by analyzing both k nearest neighbors and reverse k nearest neighbors of each data point at cost of inferior efficiency. To solve this problem, a local density-based algorithm named Local Density Based Outlier detection (LDBO) was proposed, which can improve outlier detection efficiency and effectiveness simultaneously. LDBO introduced definitions of strong k nearest neighbors and weak k nearest neighbors to realize outlier relation analysis of those data points located nearby. Furthermore, to improve the outlier detection efficiency, prejudgement was applied to avoid unnecessary reverse k nearest neighbor analysis as far as possible. Theoretical analysis and experimental results Indicate that LDBO outperforms INFLO in efficiency, and it is effective and feasible.
Keywords:outlier detection  local density  strong k nearest neighbors  weak k nearest neighbors  Reverse k Nearest Neighbors (RkNN)  
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号