TY - JOUR
T1 - Two-stage credit rating prediction using machine learning techniques
AU - Wu, Hsu Che
AU - Hu, Ya Han
AU - Huang, Yen Hao
N1 - Publisher Copyright:
© Emerald Group Publishing Limited.
PY - 2014/7/29
Y1 - 2014/7/29
N2 - Purpose – Credit ratings have become one of the primary references for financial institutions to assess credit risk. Conventional credit rating approaches mainly concentrated on two-class classification (i.e. good or bad credit), which lacks adequate precision to perform credit risk evaluations in practice. In addition, most of previous researches directly focussed on employing various data mining techniques, but rare studies discussed the influence of data preprocessing before classifier construction. The paper aims to discuss these issues. Design/methodology/approach – This study considers nine-class classification (i.e. nine credit risk level) to credit rating prediction. For the development of more accurate classifiers, the paper adopts two-stage analysis, which integrates multiple data preprocessing and supervised learning techniques. Specifically, the first stage applies feature selection, data clustering, and data resampling methods to preprocess the data, and then the second stage utilizes several classification techniques and classifier ensembles to construct prediction models. Findings – The results show that Bagging-DT with data resampling method achieves excellent accuracy (82.96 percent), indicating that the proposed two-stage prediction model is better than conventional one-stage models. Originality/value – Practical implication of this study can lower credit rating expenses and also allow corporations to gain credit rating information instantly.
AB - Purpose – Credit ratings have become one of the primary references for financial institutions to assess credit risk. Conventional credit rating approaches mainly concentrated on two-class classification (i.e. good or bad credit), which lacks adequate precision to perform credit risk evaluations in practice. In addition, most of previous researches directly focussed on employing various data mining techniques, but rare studies discussed the influence of data preprocessing before classifier construction. The paper aims to discuss these issues. Design/methodology/approach – This study considers nine-class classification (i.e. nine credit risk level) to credit rating prediction. For the development of more accurate classifiers, the paper adopts two-stage analysis, which integrates multiple data preprocessing and supervised learning techniques. Specifically, the first stage applies feature selection, data clustering, and data resampling methods to preprocess the data, and then the second stage utilizes several classification techniques and classifier ensembles to construct prediction models. Findings – The results show that Bagging-DT with data resampling method achieves excellent accuracy (82.96 percent), indicating that the proposed two-stage prediction model is better than conventional one-stage models. Originality/value – Practical implication of this study can lower credit rating expenses and also allow corporations to gain credit rating information instantly.
KW - Decision making
KW - Knowledge management
UR - http://www.scopus.com/inward/record.url?scp=84927523946&partnerID=8YFLogxK
U2 - 10.1108/K-10-2013-0218
DO - 10.1108/K-10-2013-0218
M3 - 期刊論文
AN - SCOPUS:84927523946
SN - 0368-492X
VL - 43
SP - 1098
EP - 1113
JO - Kybernetes
JF - Kybernetes
IS - 7
ER -