DOI QR코드

DOI QR Code

Ensemble Learning with Support Vector Machines for Bond Rating

회사채 신용등급 예측을 위한 SVM 앙상블학습

  • Received : 2011.12.01
  • Accepted : 2012.05.30
  • Published : 2012.06.30

Abstract

Bond rating is regarded as an important event for measuring financial risk of companies and for determining the investment returns of investors. As a result, it has been a popular research topic for researchers to predict companies' credit ratings by applying statistical and machine learning techniques. The statistical techniques, including multiple regression, multiple discriminant analysis (MDA), logistic models (LOGIT), and probit analysis, have been traditionally used in bond rating. However, one major drawback is that it should be based on strict assumptions. Such strict assumptions include linearity, normality, independence among predictor variables and pre-existing functional forms relating the criterion variablesand the predictor variables. Those strict assumptions of traditional statistics have limited their application to the real world. Machine learning techniques also used in bond rating prediction models include decision trees (DT), neural networks (NN), and Support Vector Machine (SVM). Especially, SVM is recognized as a new and promising classification and regression analysis method. SVM learns a separating hyperplane that can maximize the margin between two categories. SVM is simple enough to be analyzed mathematical, and leads to high performance in practical applications. SVM implements the structuralrisk minimization principle and searches to minimize an upper bound of the generalization error. In addition, the solution of SVM may be a global optimum and thus, overfitting is unlikely to occur with SVM. In addition, SVM does not require too many data sample for training since it builds prediction models by only using some representative sample near the boundaries called support vectors. A number of experimental researches have indicated that SVM has been successfully applied in a variety of pattern recognition fields. However, there are three major drawbacks that can be potential causes for degrading SVM's performance. First, SVM is originally proposed for solving binary-class classification problems. Methods for combining SVMs for multi-class classification such as One-Against-One, One-Against-All have been proposed, but they do not improve the performance in multi-class classification problem as much as SVM for binary-class classification. Second, approximation algorithms (e.g. decomposition methods, sequential minimal optimization algorithm) could be used for effective multi-class computation to reduce computation time, but it could deteriorate classification performance. Third, the difficulty in multi-class prediction problems is in data imbalance problem that can occur when the number of instances in one class greatly outnumbers the number of instances in the other class. Such data sets often cause a default classifier to be built due to skewed boundary and thus the reduction in the classification accuracy of such a classifier. SVM ensemble learning is one of machine learning methods to cope with the above drawbacks. Ensemble learning is a method for improving the performance of classification and prediction algorithms. AdaBoost is one of the widely used ensemble learning techniques. It constructs a composite classifier by sequentially training classifiers while increasing weight on the misclassified observations through iterations. The observations that are incorrectly predicted by previous classifiers are chosen more often than examples that are correctly predicted. Thus Boosting attempts to produce new classifiers that are better able to predict examples for which the current ensemble's performance is poor. In this way, it can reinforce the training of the misclassified observations of the minority class. This paper proposes a multiclass Geometric Mean-based Boosting (MGM-Boost) to resolve multiclass prediction problem. Since MGM-Boost introduces the notion of geometric mean into AdaBoost, it can perform learning process considering the geometric mean-based accuracy and errors of multiclass. This study applies MGM-Boost to the real-world bond rating case for Korean companies to examine the feasibility of MGM-Boost. 10-fold cross validations for threetimes with different random seeds are performed in order to ensure that the comparison among three different classifiers does not happen by chance. For each of 10-fold cross validation, the entire data set is first partitioned into tenequal-sized sets, and then each set is in turn used as the test set while the classifier trains on the other nine sets. That is, cross-validated folds have been tested independently of each algorithm. Through these steps, we have obtained the results for classifiers on each of the 30 experiments. In the comparison of arithmetic mean-based prediction accuracy between individual classifiers, MGM-Boost (52.95%) shows higher prediction accuracy than both AdaBoost (51.69%) and SVM (49.47%). MGM-Boost (28.12%) also shows the higher prediction accuracy than AdaBoost (24.65%) and SVM (15.42%)in terms of geometric mean-based prediction accuracy. T-test is used to examine whether the performance of each classifiers for 30 folds is significantly different. The results indicate that performance of MGM-Boost is significantly different from AdaBoost and SVM classifiers at 1% level. These results mean that MGM-Boost can provide robust and stable solutions to multi-classproblems such as bond rating.

회사채 신용등급은 투자자의 입장에서는 수익률 결정의 중요한 요소이며 기업의 입장에서는 자본비용 및 기업 가치와 관련된 중요한 재무의사결정사항으로 정교한 신용등급 예측 모형의 개발은 재무 및 회계 분야에서 오랫동안 전통적인 연구 주제가 되어왔다. 그러나, 회사채 신용등급 예측 모형의 성과와 관련된 가장 중요한 문제는 등급별 데이터의 불균형 문제이다. 예측 문제에 있어서 데이터 불균형(Data imbalance) 은 사용되는 표본이 특정 범주에 편중되었을 때 나타난다. 데이터 불균형이 심화됨에 따라 범주 사이의 분류경계영역이 왜곡되므로 분류자의 학습성과가 저하되게 된다. 본 연구에서는 데이터 불균형 문제가 존재하는 다분류 문제를 효과적으로 해결하기 위한 다분류 기하평균 부스팅 기법 (Multiclass Geometric Mean-based Boosting MGM-Boost)을 제안하고자 한다. MGM-Boost 알고리즘은 부스팅 알고리즘에 기하평균 개념을 도입한 것으로 오분류된 표본에 대한 학습을 강화할 수 있으며 불균형 분포를 보이는 각 범주의 예측정확도를 동시에 고려한 학습이 가능하다는 장점이 있다. 회사채 신용등급 예측문제를 활용하여 MGM-Boost의 성과를 검증한 결과 SVM 및 AdaBoost 기법과 비교하여 통계적으로 유의적인 성과개선 효과를 보여주었으며 데이터 불균형 하에서도 벤치마킹 모형과 비교하여 견고한 학습성과를 나타냈다.

Keywords

References

  1. 강필성, 조성준, "데이터 불균형 해결을 위한 Undersampling 기반 앙상블 SVMs", 대한산업공학회/한국경영과학회 2006 춘계공동학술대회, 2006.
  2. 김명종, "기업부실 예측 데이터의 불균형 문제 해결을 위한 앙상블 학습", 지능정보연구, 2009, 15권 3호(2009), 1-15.
  3. 신택수, 홍태호, "AdaBoost 알고리즘 기반 SVM을 이용한 부실 확률분포 기반의 기업신용평가", 지능정보연구, 17권 3호(2011), 25-41.
  4. 안현철, 김경재, 한인구, "다분류 Support Vector Machine을 이용한 한국기업의 지능형 기업채권평가모형", 경영학연구, 35권 5호(2006), 1479-1496.
  5. 옥중경, 김경재, "유전자 알고리즘 기반의 기업부실예측 통합모형", 지능정보연구, 15권 4호(2009), 99-121.
  6. Bruzzone, L. and S. B. Serpico, "Classifications of imbalanced remote-sensing data by neural networks", Pattern recognition letters, Vol.18, No.11-13(1997), 1323-1328. https://doi.org/10.1016/S0167-8655(97)00109-8
  7. Cao, L. and F. E. H. Tay, "Financial forecasting using support vector machines", Neural Computing and Applications, Vol.10(2001), 184- 192. https://doi.org/10.1007/s005210170010
  8. Chawla, N., A. Lazarevic, L. Hall, and K. Bowyer, "SMOTEBoost : Improving prediction of the minority class in boosting", 7th European conference on principles and practice of knowledge discovery in databases(2003), Cavtatv Dubrovnik, Croatia, 107-119.
  9. Chaveesuk, R., Srivaree-Ratana, C., and A. E. Smith, "Alternative neural network approaches to corporate bond rating", Journal of Engineering Valuation and Cost Analysis, Vol.2, No.2(1993), 117-131.
  10. Cover, T. M. and J. A. Thomas, Element of information theory, John Wiley and Sons, 1991.
  11. Darbellay, G. A., "An estimator of the mutual information based on a criterion for independence", Computational Statistics and Data Analysis, Vol.32(1999), 1-17. https://doi.org/10.1016/S0167-9473(99)00020-1
  12. Dutta, S. and S. Shekhar, "Bond rating : A non-conservative application application of neural networks", Proceedings of IEEE International Conference on Neural Networks, (1988), II443-II450.
  13. Ederington, H. L., "Classification models and bond ratings", Financial Review, Vol.20, No.4(1985), 237-262. https://doi.org/10.1111/j.1540-6288.1985.tb00306.x
  14. Fawcett, T. and F. Provost, "Adaptive fraud detection", Data Mining and Knowledge discovery, Vol.1, No.3(1997), 291-316. https://doi.org/10.1023/A:1009700419189
  15. Fisher, L., "Determinants of risk premiums on corporate bonds", Journal of Political Economy, Vol.67(1959), 217-237. https://doi.org/10.1086/258172
  16. Freund, Y. and R. E. Schapire, "A decision theoretic generalization of online learning and an application to boosting", Journal of Computer and System Science, Vol.55, No.1(1997), 119- 139. https://doi.org/10.1006/jcss.1997.1504
  17. Gentry, J. A., Whitford, D. T., and P. Newbold, "Predicting industrial bond ratings with a probit model and funds flow components", Financial Review, Vol.23, No.3(1988), 269-286. https://doi.org/10.1111/j.1540-6288.1988.tb01267.x
  18. Hong, X., "A kernel-based two-class classifier for imbalanced data sets", IEEE Transactions on neural networks, Vol.18, No.1(2007), 28-40. https://doi.org/10.1109/TNN.2006.882812
  19. Hsu, C. W. and C. J. Lin, "A Comparison of Methods for Multiclass Support Vector Machines", IEEE Transactions on Neural Networks, Vol.13, No.2(2002), 415-425. https://doi.org/10.1109/72.991427
  20. Huang, Zan, Chen, Hsinchun, Hsu, Chia-Jung, Chen, Wun-Hwa, and Wu, Soushan, "Credit rating analysis with support vector machines and neural networks. A market comparative study", Decision Support Systems, Vol.37(2004), 543-558. https://doi.org/10.1016/S0167-9236(03)00086-1
  21. Jackson, J. D. and J. W. Boyd, "A statistical approach to modeling the behavior of bond raters", The Journal of Behavioral Economics, Vol.17, No.3(1988), 173-193. https://doi.org/10.1016/0090-5720(88)90008-3
  22. Kim, K., "Financial time series forecasting using support vector machines", Neurocomputing, Vol.55(2004), 307-319.
  23. Kotsiantis, S., D. Tzelepis, E. Kounmanakos, and V. Tampakas, "Selective costing voting for bankruptcy prediction", International Journal of Knowledge-based and Intelligent Engineering Systems, Vol.11(2007), 115-127. https://doi.org/10.3233/KES-2007-11204
  24. Kubat, M., Holte, R., and S. Matwin, "Learning when Negative example abound", Proceedings of the 9th European Conference on Machine Learning, ECML'97, 1997.
  25. Kwon, Y. S., Han, I. G., and K. C. Lee, "Ordinal Pairwise Partitioning (OPP) approach to neural networks training in bond rating", Intelligent Systems in Accounting, Finance and Management, Vol.6(1997), 23-40. https://doi.org/10.1002/(SICI)1099-1174(199703)6:1<23::AID-ISAF113>3.0.CO;2-4
  26. Maher, J. J. and T. K. Sen, "Predicting bond ratings using neural networks : A comparison with logistic regression", Intelligent Systems in Accounting, Finance and Management, Vol.6(1997), 59-72. https://doi.org/10.1002/(SICI)1099-1174(199703)6:1<59::AID-ISAF116>3.0.CO;2-H
  27. Maia, T. T., A. P. Braga, and A. F. Carvalho, "Hybrid classification algorithms based on boosting and support vector machines", Kybernetes, Vol.37, No.9(2008), 1469-1491. https://doi.org/10.1108/03684920810907814
  28. Min, S. H., J. M. Lee, and I. G. Han, Hybrid genetic algorithms and support vector machines for bankruptcy prediction, Expert Systems with Applications, Vol.31(2006), 652-660. https://doi.org/10.1016/j.eswa.2005.09.070
  29. Optiz, D. and R. Maclin, "Popular ensemble methods : an empirical study", Journal of Artificial Intelligence, Vol.11(1999), 169-198.
  30. Pinches, G. E. and K. A. Mingom, "A multivariate analysis of industrial bond ratings", Journal of Finance, Vol.28, No.1(1973), 1-18. https://doi.org/10.1111/j.1540-6261.1973.tb01341.x
  31. Platt, J., "Fast Training of Support Vector Machines using Sequential Minimal Optimization. In B. Schoelkopf, C. Burges, and A. Smola, (Eds.)", Advances in Kernel MethodsSupport Vector Learning, MIT Press, 1998.
  32. Pogue, T. F. and R. M. Soldofsky, "What's in a bond rating?", Journal of Financial and Quantitative Analysis, Vol.4, No.2(1969), 201-228. https://doi.org/10.2307/2329840
  33. Seiffert, C., T. M. Khoshgoftaar, J. Van Hulse, and A. Napolitano, "RUSBoost : Improving classification performance when training data is skewed", 19th International Conference on Pattern Recognition, (2008), 1-4.
  34. Shin, H. J. and S. Z. Cho, "Response modeling with support vector machines", Expert Systems with applications, Vol.30, No.4(2006), 746-760. https://doi.org/10.1016/j.eswa.2005.07.037
  35. Shin, K., T. Lee, and H. Kim, "An application of support vector machines in bankruptcy prediction", Expert Systems with Applications, Vol.28(2005), 127-135. https://doi.org/10.1016/j.eswa.2004.08.009
  36. Tay, F. E. J. and L. J. Cao, "Modified support vector machine in financial time series forecasting", Neurocomputing, Vol.48(2002), 847-861. https://doi.org/10.1016/S0925-2312(01)00676-2
  37. Vapnik, V. N., "The nature of statistical learning theory", New York:Springer, 1995.
  38. Wang, B. X. and N. Japkowicz, "Boosting support vector machines for imbalanced data sets", Knowledge and Information Systems, Vol.25(2010), 1-10. https://doi.org/10.1007/s10115-009-0198-y
  39. Weiss, G. M., "Mining with rarity : A unifying framework", SIGKDD Explorations, Vol.T, No.1(2004), 7-19.
  40. West, R. R., "An alternative approach to predicting corporate bond ratings", Journal of Accounting Research, Vol.8, No.1(1970), 118-125. https://doi.org/10.2307/2674717
  41. Wu, G. and E. Chang, "Adaptive feature-space conformal transformation for imbalanced data learning", Proceedings of the 20th International Conference on Machine Learning, 2003.
  42. Wu, G. and E. Chang, "KBA : Kernel boundary alignment considering imbalanced data distribution", IEEE Transactions on knowledge and data engineering, Vol.17, No.6(2005), 786- 795. https://doi.org/10.1109/TKDE.2005.95
  43. Wu, G. Y. Wu, L. Jiao, Y. F. Wang, and E. Chang, "Multi-camera spatio-temporal fusion and biased sequence-data learning for security surveillance", Proceedings of 20th International Conference on Multimedia, 2003.

Cited by

  1. 경영분석지표와 의사결정나무기법을 이용한 유상증자 예측모형 개발 vol.18, pp.4, 2012, https://doi.org/10.13088/jiis.2012.18.4.059