首页 | 官方网站   微博 | 高级检索  
     

一种基于近邻规则的缺失数据填补方法
引用本文:王凤梅,胡丽霞.一种基于近邻规则的缺失数据填补方法[J].计算机工程,2012,38(21):53-55,62.
作者姓名:王凤梅  胡丽霞
作者单位:湖南科技学院计算机与通信工程系,湖南永州,425100
摘    要:数据缺失是数据挖掘与分析过程中的常见问题,若直接删除含缺失的事例可能导致不可靠的决策。为此,针对缺失数据的填补问题,提出一种基于近邻规则的缺失数据填补方法。根据关联规则的后件数据项进行分类,计算分类后的规则项与缺失项集间的相似度,用最相似的规则项值填补缺失值。实验结果表明,该方法具有较高的填补正确率。

关 键 词:关联规则  缺失数据  填补  近邻规则  相似度  K最近邻法
收稿时间:2012-01-05

A Missing Data Imputation Method Based on Neighbor Rules
WANG Feng-mei , HU Li-xia.A Missing Data Imputation Method Based on Neighbor Rules[J].Computer Engineering,2012,38(21):53-55,62.
Authors:WANG Feng-mei  HU Li-xia
Affiliation:(Department of Computer and Communication Engineering, Hunan University of Science and Engineering, Yongzhou 425100, China)
Abstract:Data missing is a common problem in data mining and data analysis process, it can lead to reliable decision-making if it is deleted with the cases directly. An imputation method of solving the missing data is put forward, which is based on association rule. In this method, the rules are classified by the rules' consequent, and then calculate the similarity of constrained rules cases' items and missing cases' items, impute the missing value with the most similar rule's item. Experimental results show this method has higher imputation accuracy.
Keywords:association rules  missing data  imputation  neighbor rule  similarity  K-Nearest Neighbor(KNN) algorithm
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号