首页 | 官方网站   微博 | 高级检索  
     


Mixed intelligent-multivariate missing imputation
Authors:K Gibert
Affiliation:1. Department Statistics and Operations Research, Universitat Politècnica de Catalunya-BarcelonaTech, Ed. C5, CN, C. Jordi Girona 1-3, Barcelona 08034, Spainkarina.gibert@upc.edu
Abstract:In real applications, important rates of missing data are often found and have to be preprocessed before the analysis. The literature for missing imputation is abundant. However, the most precise imputation methods require long time, and sometimes specific software; this implies a significant delay to get final results. The Mixed Intelligent-Multivariate Missing Imputation (MIMMI) method is proposed as a hybrid missing imputation methodology based on clustering. The MIMMI is a non-parametric method that combines the prior expert knowledge with multivariate analysis without requiring assumptions on the probabilistic models of the variables (normality, exponentiality, etc.). The proposed imputation values implicitly take into account the joint distribution of all variables and can be determined in a relatively short time. The MIMMI uses the conditional mean according to the self-underlying structure of the data set. It provides a good trade-off between accuracy and both simplicity and required time to data preparation. The mechanics of the method is illustrated with some case-studies, both synthetic and real applications related with human behaviour. In both cases, acceptable quality results were obtained in short time.
Keywords:clustering  multivariate imputation  prior expert knowledge
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号