首页 | 官方网站   微博 | 高级检索  
     

基于新型不纯度度量的代价敏感随机森林分类器
引用本文:师彦文,王宏杰.基于新型不纯度度量的代价敏感随机森林分类器[J].计算机科学,2017,44(Z11):98-101.
作者姓名:师彦文  王宏杰
作者单位:西南石油大学计算机科学学院 成都610500,西南石油大学计算机科学学院 成都610500
摘    要:针对不平衡数据集的有效分类问题,提出一种结合代价敏感学习和随机森林算法的分类器。首先提出了一种新型不纯度度量,该度量不仅考虑了决策树的总代价,还考虑了同一节点对于不同样本的代价差异;其次,执行随机森林算法,对数据集作K次抽样,构建K个基础分类器;然后,基于提出的不纯度度量,通过分类回归树(CART)算法来构建决策树,从而形成决策树森林;最后,随机森林通过投票机制做出数据分类决策。在UCI数据库上进行实验,与传统随机森林和现有的代价敏感随机森林分类器相比,该分类器在分类精度、AUC面积和Kappa系数这3种性能度量上都具有良好的表现。

关 键 词:代价敏感学习  随机森林  不纯度度量  分类回归树(CART)  不平衡数据

Cost-sensitive Random Forest Classifier with New Impurity Measurement
SHI Yan-wen and WANG Hong-jie.Cost-sensitive Random Forest Classifier with New Impurity Measurement[J].Computer Science,2017,44(Z11):98-101.
Authors:SHI Yan-wen and WANG Hong-jie
Affiliation:School of Computer Science,Southwest Petroleum University,Chengdu 610500,China and School of Computer Science,Southwest Petroleum University,Chengdu 610500,China
Abstract:For the problem of effective classification on imbalanced data sets,a classifier combining cost-sensitive learning and random forest algorithm is proposed.Firstly,a new impurity measure is proposed,taking into account not only the total cost of the decision tree,but also the cost difference of the same node for different samples.Then,the random forest algorithm is executed,K times sampling for the data set is performed,and K basic classifiers are built.Then,the decision tree is constructed by the classification regression tree (CART) algorithm based on the proposed impurity measure,so as to form the decision tree forest.Finally,the random forest algorithm makes the data classification decision by voting mechanism.In the UCI database,compared with the traditional random forest and the existing cost-sensitive random forest classifier,this classifier has good performance in the classification accuracy,AUC area and Kappa coefficient.
Keywords:Cost-sensitive learning  Random forest  Impurity measurement  Classification regression tree (CART)  Imbalanced data
点击此处可从《计算机科学》浏览原始摘要信息
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号