首页 | 官方网站   微博 | 高级检索  
     

一种新的用于连续值属性离散化的约简算法
引用本文:刘震宇,郭宝龙,杨林耀.一种新的用于连续值属性离散化的约简算法[J].控制与决策,2002,17(5):545-549.
作者姓名:刘震宇  郭宝龙  杨林耀
作者单位:西安电子科技大学,测控工程系,陕西,西安,710071
基金项目:国家自然科学基金项目 (6 9975 0 15 )
摘    要:针对在Nguyen和Skowron的离散化算法中进行启发式约简时会出现某些属性不能进行离散化问题,以及在无核数据集中启发式约简算法计算量比较大等问题,在粗糙集理论和属性频率函数的基础上给出一个新概念-候选核,并提出一种新的用于连续值属性离散化的约简算法-基于候选核的启发式约简算法(简称BCC)。该算法可以寻找到能对所有属性进行离散化的约简,实验表明,所提出的BCC算法能提高大数据集的离散化效果。

关 键 词:连续值属性离散化  约简算法  数据挖掘  粗糙集理论  人工智能
文章编号:1001-0920(2002)05-0545-05
修稿时间:2001年8月13日

A new reduction algorithm for discretization of continuous features
LIU Zhen yu,GUO Bao long,YANG Lin yao.A new reduction algorithm for discretization of continuous features[J].Control and Decision,2002,17(5):545-549.
Authors:LIU Zhen yu  GUO Bao long  YANG Lin yao
Abstract:Therearetwoproblemsinthetraditionaldiscretization algorithm when heuristic reduction algorithms are used to find the reduction. One is that the reduction discretizing all attributes may not be found. The other is that the heuristic reduction algorithm needs a great deal of time to get the reduction in the data sets without core. To solve the two problems , a new concept called candidate core is given, which is built on the rough set theory and attribute frequency function, and a new heuristic reduction algorithm based on candidate core (named BCC) is presented. This new heuristic reduction algorithm of BCC can find the reduction of the data sets, which discretizes all attributes. The results of experiments show that the new algorithm can improve the performance of discretization for large data sets.
Keywords:data mining  rough set theory  discretization  reduction algorithm  
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《控制与决策》浏览原始摘要信息
点击此处可从《控制与决策》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号