首页 | 官方网站   微博 | 高级检索  
     


Finding key attribute subset in dataset for outlier detection
Authors:Peng Yang  Qingsheng Zhu
Affiliation:1. Department of Electrical and Computer Engineering, Mississippi State University, MS, United States;2. Geospatial Sciences and Technology Branch, Naval Research Laboratory, MS, United States;3. Department of Electrical and Computer Engineering, Michigan Technological University, MI, United States;4. Department of Computer Science, Michigan Technological University, MI, United States
Abstract:Detection of outlier from high dimensional dataset have found important applications in many fields, yet the unexpected time consumption is likely to hinder its practical use. Thus, it makes sense to build an efficient method for finding meaningful outliers and analyzing their intentional knowledge. In this paper, we utilize the concept of rough set to construct a method for outlying reduction, based on an outlier detection and analysis system. By defining outlying partition similarity, we can mine outliers on the key attribute subset rather than on the full dimensional attribute set of dataset, as long as the similarity between outlying partitions produced on them is large enough. For this purpose, we propose a novel method for finding the key attribute subset in dataset, which starts by seeking all outliers on the full attribute set, and then searches through all outlying attribute subsets for these points. After that, it turns out to be able to determine the key attribute subset in accordance with the similarity between outlying partitions. By experiments, we show that our method allows more efficient seeking of key attribute subset than the previous methods, thereby improving the feasibility of outlier detection.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号