首页 | 官方网站   微博 | 高级检索  
     

银行客户分类的数据特征选择方法与实证研究
引用本文:段刚龙,王妍,马鑫,杨泽阳. 银行客户分类的数据特征选择方法与实证研究[J]. 计算机工程与应用, 2022, 58(11): 302-312. DOI: 10.3778/j.issn.1002-8331.2010-0238
作者姓名:段刚龙  王妍  马鑫  杨泽阳
作者单位:西安理工大学 经济与管理学院,西安 710054
基金项目:陕西省软科学项目;陕西省社科界重大理论与现实问题研究;陕西省软科学研究计划一般项目;陕西省教育厅科学研究计划
摘    要:针对银行客户数据维度高、量级大和冗余特征多等问题,提出了一种借鉴多模态融合思想的综合特征筛选方法,通过计算并比较数据集中各特征的综合贡献度来对冗余特征进行筛选。基于真实银行客户数据特点,给出了一种包括类型转换及离散化、缺失值填充和标准化三部分的数据预处理方案,并对真实银行客户数据进行预处理;利用Pearson相关系数、随机森林、量化先验认知以及提出的多模态视角下的综合特征筛选方法对预处理后数据集中的冗余特征进行筛选,并分别提取到14个、8个、15个和11个特征;根据实验研究结果,从定性与定量两个层面对四种特征选择方法的特征选择效果进行充分比较。实验结果表明,提出的一种借鉴多模态融合思想的综合特征筛选方法能够有效弥补不同特征选择方法间的缺陷,降低数据维度,进而提升银行客户分类模型性能。

关 键 词:客户细分  特征选择  知识挖掘  量化先验认知  多模态  

Data Feature Selection Method and Empirical Study of Bank Customer Segmentation
DUAN Ganglong,WANG Yan,MA Xin,YANG Zeyang. Data Feature Selection Method and Empirical Study of Bank Customer Segmentation[J]. Computer Engineering and Applications, 2022, 58(11): 302-312. DOI: 10.3778/j.issn.1002-8331.2010-0238
Authors:DUAN Ganglong  WANG Yan  MA Xin  YANG Zeyang
Affiliation:School of Economics and Management, Xi’an University of Technology, Xi’an 710054, China
Abstract:Aiming at the problems of high dimension, large scale and many redundant features of bank customer data, this paper proposes a comprehensive feature selection method based on multi-modal fusion, which can select redundant features by calculating and comparing the comprehensive contribution of each feature in the data set. Firstly, based on the characteristics of real bank customer data, this paper presents a data preprocessing scheme including type conversion and discretization, missing value filling and standardization, and preprocesses the real bank customer data. Secondly, it uses Pearson correlation coefficient, random forest, quantitative prior cognition and the multimodal comprehensive feature selection method proposed in this paper to filter the redundant features in the preprocessed dataset, and 14, 8, 15 and 11 features are extracted respectively. Finally, according to the experimental results, the feature selection effects of the four feature selection methods are fully compared from the qualitative and quantitative levels. The experimental results show that a comprehensive feature selection method based on multimodal fusion can effectively make up for the defects of different feature selection methods, reduce the data dimension, and improve the performance of bank customer classification model.
Keywords:customer segmentation   feature selection   knowledge mining   quantitative prior knowledge   multi-modal  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号