首页 | 官方网站   微博 | 高级检索  
     

使用PCA建立基于规则的组合分类器
引用本文:石国强,牛常勇,范明.使用PCA建立基于规则的组合分类器[J].计算机科学与探索,2010,4(5):455-463.
作者姓名:石国强  牛常勇  范明
作者单位:郑州大学,信息工程学院,郑州,450052
基金项目:国家自然科学基金No.60773048;;国家“十一五”计划科技支撑课题子课题No.2006BAF01A00~~
摘    要:提出了一种使用基于规则的基分类器建立组合分类器的新方法PCARules。尽管新方法也采用基分类器预测的加权投票来决定待分类样本的类,但是为基分类器创建训练数据集的方法与bagging和boosting完全不同。该方法不是通过抽样为基分类器创建数据集,而是随机地将特征划分成K个子集,使用PCA得到每个子集的主成分,形成新的特征空间,并将所有训练数据映射到新的特征空间作为基分类器的训练集。在UCI机器学习库的30个随机选取的数据集上的实验表明:算法不仅能够显著提高基于规则的分类方法的分类性能,而且与bagging和boosting等传统组合方法相比,在大部分数据集上都具有更高的分类准确率。

关 键 词:组合分类器  特征提取  主成分分析
修稿时间: 

Constructing Ensembles of Rule-based Classifiers Using PCA
SHI Guoqiang,NIU Changyong,FAN Ming.Constructing Ensembles of Rule-based Classifiers Using PCA[J].Journal of Frontier of Computer Science and Technology,2010,4(5):455-463.
Authors:SHI Guoqiang  NIU Changyong  FAN Ming
Affiliation:School of Information and Engineering, Zhengzhou University, Zhengzhou 450052, China
Abstract:A new method, called PCARules, is presented for constructing ensembles of rule-based classifiers. Although the class label of a sample to be classified is also determined by taking weighted vote among the predictions made by each base classifier, this method is very different from bagging and boosting in the way of creating the training data for a base classifier. Instead of creating a training data for each base classifier by sampling, this method splits the feature set into K subsets randomly, upon each of which principal component analysis (PCA) is applied to find the corresponding principal components. And then all principal components are put together to form a new feature space, into which all original training data are mapped to create the training set for a base classifier. Experiments carried on 30 benchmark datasets selected randomly from the UCI machine learning repository show that the method not only improves performance of rule-based classifiers significantly, but also achieves higher accuracy in most of data sets than traditional combining methods such as bagging and boosting.
Keywords:classifier ensemble  feature extraction  principal component analysis (PCA)
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机科学与探索》浏览原始摘要信息
点击此处可从《计算机科学与探索》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号