首页 | 官方网站   微博 | 高级检索  
     


Classification cost: An empirical comparison among traditional classifier, Cost-Sensitive Classifier, and MetaCost
Authors:Jungeun Kim  Keunho Choi  Gunwoo Kim  Yongmoo Suh
Affiliation:a 437-070, Hyundai Autoever Corp. 576, Sam-dong, Uiwang-si, Gyeonggi-Do, Republic of Korea
b 136-701, Business School, Korea University, Anam-dong 5-Ga, Sungbuk-Gu, Seoul, Republic of Korea
c 305-719, 612, Department of Business Administration, College of Business and Economics, Hanbat National University, San 16-1, Dukmyung-dong, Yuseong-Gu, Daejeon, Republic of Korea
Abstract:Loan fraud is a critical factor in the insolvency of financial institutions, so companies make an effort to reduce the loss from fraud by building a model for proactive fraud prediction. However, there are still two critical problems to be resolved for the fraud detection: (1) the lack of cost sensitivity between type I error and type II error in most prediction models, and (2) highly skewed distribution of class in the dataset used for fraud detection because of sparse fraud-related data. The objective of this paper is to examine whether classification cost is affected both by the cost-sensitive approach and by skewed distribution of class. To that end, we compare the classification cost incurred by a traditional cost-insensitive classification approach and two cost-sensitive classification approaches, Cost-Sensitive Classifier (CSC) and MetaCost. Experiments were conducted with a credit loan dataset from a major financial institution in Korea, while varying the distribution of class in the dataset and the number of input variables. The experiments showed that the lowest classification cost was incurred when the MetaCost approach was used and when non-fraud data and fraud data were balanced. In addition, the dataset that includes all delinquency variables was shown to be most effective on reducing the classification cost.
Keywords:Fraud detection  Cost-sensitive learning  Cost-Sensitive Classifier  MetaCost
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号