首页 | 官方网站   微博 | 高级检索  
     

一种基于置信度的代表点选择算法
引用本文:黄云,洪佳明,覃遵跃.一种基于置信度的代表点选择算法[J].计算机工程,2012,38(19):167-169,174.
作者姓名:黄云  洪佳明  覃遵跃
作者单位:1. 吉首大学软件学院,湖南张家界427000;中山大学信息科学与技术学院,广州510006
2. 中山大学信息科学与技术学院,广州,510006
摘    要:代表点选择是实现缩减数据集规模的有效途径,可以提高分类的准确率和执行效率.为此,通过引入分类置信度熵的概念,提出适应度评价函数,用于评估代表点的选择效果,以此找到最优的代表点集.该方法可与其他代表点选择方法结合,得到性能更优的代表点选择方法.与多个经典代表点选择方法进行实验比较,结果表明基于置信度的代表点选择方法在分类准确率和数据降低率上有一定优势.

关 键 词:置信度熵  适应度评价函数  代表点选择  k最近邻  半监督学习  遗传算法
收稿时间:2012-01-09

An Algorithm of Representative Point Selection Based on Confidence
HUANG Yun , HONG Jia-ming , QIN Zun-yue.An Algorithm of Representative Point Selection Based on Confidence[J].Computer Engineering,2012,38(19):167-169,174.
Authors:HUANG Yun  HONG Jia-ming  QIN Zun-yue
Affiliation:1.School of Software,Jishou University,Zhangjiajie 427000,China;2.School of Information Science and Technology,Sun Yat-Sen University,Guangzhou 510006,China)
Abstract:Representative point selection method aims to reduce the amount of training data instances for nearest neighbor classification algorithms,in order to improve the implementation efficiency and the classification accuracy.By introducing the concept of classification confidence entropy,a new fitness evaluation function is proposed to evaluate the prototype instances,and a new genetic algorithm is designed for representative point selection.This paper demonstrates that the new concept can also be used in other kind of representative point selection methods,in order to improve their performances.Compared with some other famous representative point selection algorithms,experimental results show that confidence based approach has some advantages in improving both the classification accuracy and the data reduction rate.
Keywords:confidence entropy  fitness evaluation function  representative point selection  k-nearest neighbor  semi-supervised learning  genetic algorithm
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号