A hybrid unsupervised machine learning model with spectral clustering and semi-supervised support vector machine for credit risk assessment.

Journal: PloS one
Published Date:

Abstract

In credit risk assessment, unsupervised classification techniques can be introduced to reduce human resource expenses and expedite decision-making. Despite the efficacy of unsupervised learning methods in handling unlabeled datasets, their performance remains limited owing to challenges such as imbalanced data, local optima, and parameter adjustment complexities. Thus, this paper introduces a novel hybrid unsupervised classification method, named the two-stage hybrid system with spectral clustering and semi-supervised support vector machine (TSC-SVM), which effectively addresses the unsupervised imbalance problem in credit risk assessment by targeting global optimal solutions. Furthermore, a multi-view combined unsupervised method is designed to thoroughly mine data and enhance the robustness of label predictions. This method mitigates discrepancies in prediction outcomes from three distinct perspectives. The effectiveness, efficiency, and robustness of the proposed TSC-SVM model are demonstrated through various real-world applications. The proposed algorithm is anticipated to expand the customer base for financial institutions while reducing economic losses.

Authors

  • Tao Yu
    Department of Smart Experience Design Kookmin University, Seoul 02707, Republic of Korea.
  • Wei Huang
    Shaanxi Institute of Flexible Electronics, Northwestern Polytechnical University, 710072 Xi'an, China.
  • Xin Tang
    School of Science, Huazhong Agricultural University, Wuhan, Hubei province, China.
  • Duosi Zheng
    College of Business, Southern University of Science and Technology, Shenzhen, China.