Robust cost-sensitive kernel method with Blinex loss and its applications in credit risk evaluation.

Journal: Neural networks : the official journal of the International Neural Network Society
Published Date:

Abstract

Credit risk evaluation is a crucial yet challenging problem in financial analysis. It can not only help institutions reduce risk and ensure profitability, but also improve consumers' fair practices. The data-driven algorithms such as artificial intelligence techniques regard the evaluation as a classification problem and aim to classify transactions as default or non-default. Since non-default samples greatly outnumber default samples, it is a typical imbalanced learning problem and each class or each sample needs special treatment. Numerous data-level, algorithm-level and hybrid methods are presented, and cost-sensitive support vector machines (CSSVMs) are representative algorithm-level methods. Based on the minimization of symmetric and unbounded loss functions, CSSVMs impose higher penalties on the misclassification costs of minority instances using domain specific parameters. However, such loss functions as error measurement cannot have an obvious cost-sensitive generalization. In this paper, we propose a robust cost-sensitive kernel method with Blinex loss (CSKB), which can be applied in credit risk evaluation. By inheriting the elegant merits of Blinex loss function, i.e., asymmetry and boundedness, CSKB not only flexibly controls distinct costs for both classes, but also enjoys noise robustness. As a data-driven decision-making paradigm of credit risk evaluation, CSKB can achieve the "win-win" situation for both the financial institutions and consumers. We solve linear and nonlinear CSKB by Nesterov accelerated gradient algorithm and Pegasos algorithm respectively. Moreover, the generalization capability of CSKB is theoretically analyzed. Comprehensive experiments on synthetic, UCI and credit risk evaluation datasets demonstrate that CSKB compares more favorably than other benchmark methods in terms of various measures.

Authors

  • Jingjing Tang
    School of Business Administration, Southwestern University of Finance and Economics, Chengdu 611130, China. Electronic address: tangjingjing13@mails.ucas.ac.cn.
  • Jiahui Li
    College of Communication Engineering, Jilin University, Changchun, Jilin China.
  • Weiqi Xu
    School of Business Administration, Faculty of Business Administration, Southwestern University of Finance and Economics, Chengdu 611130, China. Electronic address: oda@smail.swufe.edu.cn.
  • Yingjie Tian
    Research Center on Fictitious Economy and Data Science, Chinese Academy of Sciences, Beijing 100190, China; Key Laboratory of Big Data Mining and Knowledge Management, Chinese Academy of Sciences, Beijing 100190, China. Electronic address: tyj@ucas.ac.cn.
  • Xuchan Ju
    Research Center on Fictitious Economy and Data Science, Chinese Academy of Sciences, Beijing 100190, China; School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing 101408, China; Key Laboratory of Big Data Mining and Knowledge Management, Chinese Academy of Sciences, Beijing 100190, China. Electronic address: juxuchan@126.com.
  • Jie Zhang
    College of Physical Education and Health, Linyi University, Linyi, Shandong, China.