Finding key attribute subset in dataset for outlier detection期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Finding key attribute subset in dataset for outlier detection

Authors:	Peng Yang Qingsheng Zhu

Affiliation:	1. Department of Electrical and Computer Engineering, Mississippi State University, MS, United States;2. Geospatial Sciences and Technology Branch, Naval Research Laboratory, MS, United States;3. Department of Electrical and Computer Engineering, Michigan Technological University, MI, United States;4. Department of Computer Science, Michigan Technological University, MI, United States

Abstract:	Detection of outlier from high dimensional dataset have found important applications in many fields, yet the unexpected time consumption is likely to hinder its practical use. Thus, it makes sense to build an efficient method for finding meaningful outliers and analyzing their intentional knowledge. In this paper, we utilize the concept of rough set to construct a method for outlying reduction, based on an outlier detection and analysis system. By defining outlying partition similarity, we can mine outliers on the key attribute subset rather than on the full dimensional attribute set of dataset, as long as the similarity between outlying partitions produced on them is large enough. For this purpose, we propose a novel method for finding the key attribute subset in dataset, which starts by seeking all outliers on the full attribute set, and then searches through all outlying attribute subsets for these points. After that, it turns out to be able to determine the key attribute subset in accordance with the similarity between outlying partitions. By experiments, we show that our method allows more efficient seeking of key attribute subset than the previous methods, thereby improving the feasibility of outlier detection.

Keywords:
本文献已被 ScienceDirect 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏