New heuristic method for data discretization based on rough set theory |
| |
Authors: | ZHAO Jun ZHOU Ying-hua |
| |
Affiliation: | Institute of Computer Science and Technology,Chongqing University of Posts and Telecommunications,Chongqing 400065,China |
| |
Abstract: | Data discretization contributes much to the induction of classification rules or trees by machine learning methods. The rough set theory is a valid tool for discretizing continuous information systems. Herein, a new method is proposed to improve those typical rough set based heuristic algorithms for data discretization, by utilizing decision information to reduce the scales of candidate cuts, and by more reasonably measuring cut significance with a new conception of cut selection probability. Simulations demonstrate that compared with other typical discretization algorithms based on the rough set theory, the proposed method is more capable and valid to discretize continuous information systems. It can effectively improve the predictive accuracies of information systems while still conceptually keeping their consistency. |
| |
Keywords: | data discretization rough set theory cut cut significance selection probability |
本文献已被 维普 万方数据 ScienceDirect 等数据库收录! |
|