DOI QR코드

DOI QR Code

Impact of Ensemble Member Size on Confidence-based Selection in Bankruptcy Prediction

부도예측을 위한 확신 기반의 선택 접근법에서 앙상블 멤버 사이즈의 영향에 관한 연구

  • 김나라 (이화여자대학교 경영학과) ;
  • 신경식 (이화여자대학교 경영학과) ;
  • 안현철 (국민대학교 경영대학 경영정부학부)
  • Received : 2013.05.22
  • Accepted : 2013.06.19
  • Published : 2013.06.30

Abstract

The prediction model is the main factor affecting the performance of a knowledge-based system for bankruptcy prediction. Earlier studies on prediction modeling have focused on the building of a single best model using statistical and artificial intelligence techniques. However, since the mid-1980s, integration of multiple techniques (hybrid techniques) and, by extension, combinations of the outputs of several models (ensemble techniques) have, according to the experimental results, generally outperformed individual models. An ensemble is a technique that constructs a set of multiple models, combines their outputs, and produces one final prediction. The way in which the outputs of ensemble members are combined is one of the important issues affecting prediction accuracy. A variety of combination schemes have been proposed in order to improve prediction performance in ensembles. Each combination scheme has advantages and limitations, and can be influenced by domain and circumstance. Accordingly, decisions on the most appropriate combination scheme in a given domain and contingency are very difficult. This paper proposes a confidence-based selection approach as part of an ensemble bankruptcy-prediction scheme that can measure unified confidence, even if ensemble members produce different types of continuous-valued outputs. The present experimental results show that when varying the number of models to combine, according to the creation type of ensemble members, the proposed combination method offers the best performance in the ensemble having the largest number of models, even when compared with the methods most often employed in bankruptcy prediction.

부도예측을 위한 지식기반시스템에서 모델은 실적에 영향을 끼치는 주요한 요인이다. 예측 모형의 개발에 있어 초기 연구들은 통계기법 및 인공지능기법들을 이용하여 최고 실적을 가지는 단일 모델을 만드는데 주력하였다. 1980년대 중반 이후에는 다수 기술의 통합(하이브리드), 더 나아가, 다수 모델의 결과의 결합(앙상블) 기법이 수많은 실험에서 개별 모델들보다 더 나은 결과를 보여왔다. 다수 모델들의 출력값들을 결합하여 한 개의 최종 예측값을 산출하는 앙상블 모델링에서 결합기법은 앙상블의 예측 정확도에 영향을 끼치는 중요한 이슈이다. 본 논문은 부도예측을 위한 앙상블 결합기법으로서 앙상블 멤버들이 다른 유형의 연속형 수치 출력값들을 산출하더라도 통일된 확신을 측정할 수 있는 확신 기반의 선택 접근법을 제안하고 이에 대한 앙상블 멤버 사이즈의 영향을 연구하였다. 실험 결과는 앙상블 멤버들의 생성 타입에 따라 결합하는 모델 개수를 변화시켰을 때 가장 많은 기본 모델들을 가지는 앙상블에서의 제안 결합기법이 부도예측에 가장 자주 사용되는 다른 방법들에 비해서도 가장 높은 실적을 가진다는 것을 보였다.

Keywords

References

  1. Abdelwahed, T. and E. M. Amir, "New Evolutionary Bankruptcy Forecasting Model based on Genetic Algorithms and Neural Networks," Proceedings of the 17th IEEE International Conference on Tools with Artificial Intelligence, (2005).
  2. Alfaro, E., N. Garcia, M. Gamez, and D. Elizondo, "Bankruptcy forecasting : an empirical comparison of AdaBoost and neural networks," Decision Support Systems, Vol.45(2008), 110-122. https://doi.org/10.1016/j.dss.2007.12.002
  3. Altman, E. I., "Financial Ratios, Discriminant Analysis and the Prediction of Corporate Bankruptcy," The Journal of Finance, Vol.23, No.4(1968), 589-609. https://doi.org/10.1111/j.1540-6261.1968.tb00843.x
  4. Atiya, A. F., "Bankruptcy Prediction for Credit Risk Using Neural Networks : A Survey and New Results," IEEE transactions on neural networks, Vol.12, No.4(July, 2001), 929-935. https://doi.org/10.1109/72.935101
  5. Back, B., T. Laitinen, and K. Sere, "Neural networks and genetic algorithms for bankruptcy predictions," Expert Systems with Applications, Vol.11(1996), 407-413. https://doi.org/10.1016/S0957-4174(96)00055-3
  6. Beaver, W., "Financial Ratios as Predictors of Failure, Journal of Accounting Research, Vol.4(1966), 71-111. https://doi.org/10.2307/2490171
  7. Breiman, L., "Bagging predictors," Machine Learning, Vol.24, No.2(1996), 497-501.
  8. Chan, A. P. F., W. Y. Ng, D. S. Yeung, E. C. C. Tsang, and M. Firth, "Bankruptcy prediction using multiple classifier system with mutual information feature grouping," IEEE international conference on systems, man, and cybernetics, (2006), 845-850.
  9. Chen, W. S. and Y. K. Du, "Using Neural Networks and Data Mining Techniques for the Financial Distress Prediction Model," Expert Systems with Applications, Vol.36(2009), 4075-4086. https://doi.org/10.1016/j.eswa.2008.03.020
  10. Cho, S. B. and J. H. Kim, "Combining multiple neural networks by fuzzy integral for robust classification," IEEE Transactions on System, Man, and Cybernetics, Vol.25, No.2(1995), 380-384. https://doi.org/10.1109/21.364825
  11. Clemen, R. T., "Combining forecasts : A review and annotated bibliography," International Journal of Forecasting, Vol.5(1989), 559-583. https://doi.org/10.1016/0169-2070(89)90012-5
  12. Duda, R. O., P. E. Hart, and D. G. Stork, Pattern Classification, 2nd ed. John Wiley & Sons, NY, 2001.
  13. Finlay, S., "Multiple classifier architectures and their application to credit risk assessment," European Journal of Operational Research, Vol.210(2011), 368-378. https://doi.org/10.1016/j.ejor.2010.09.029
  14. Freund, Y. and R. E. Schapire, "A decisiontheoretic generalization of on-line learning and an application to boosting," Unpublished manuscript. Quoted in Quinlan, J.R., "Bagging, boosting, and C4. 5," Proceedings of the National Conference on Artificial Intelligence, (1996).
  15. Hansen, L. K. and P. Salamon, "Neural network ensembles," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol.12, No.10(1990), 993-1001. https://doi.org/10.1109/34.58871
  16. Hashem, S., "Optimal linear combinations of neural networks," Neural Networks, Vol.10, No.4 (1997), 599-614. https://doi.org/10.1016/S0893-6080(96)00098-6
  17. Hayashia, Y., and R. Setionob, "Combining neural network predictions for medical diagnosis," Computers in Biology and Medicine, Vol.32 (2002), 237-246. https://doi.org/10.1016/S0010-4825(02)00006-9
  18. Heskes, T., Balancing between bagging and bumping, In : Mozer, M. C., Jordan, M. I., Petsche, T. (Eds.), Advances in Neural Information Processing Systems 9. Cambridge, MIT Press, (1997), 466-472.
  19. Ho, T. K., J. J. Hull, and S. N. Srihari, "Decision combination in multiple classifier systems," Pattern Analysis and Machine Intelligence, Vol.16, No.1(1994), 66-75. https://doi.org/10.1109/34.273716
  20. Hong, S. H. and K. S. Shin, "Using GA based Input Selection Method for Artificial Neural Network Modeling Application to Bankruptcy Prediction," Journal of Intelligence and Information Systems, Vol.9, No.1(2003), 227-249.
  21. IBM SPSS Modeler Help, Modeling Nodes, Neural Networks, embedded in IBM SPSS Modeler, (2012).
  22. Jo, H. K. and I. G. Han, "Integration of Case-Based Forecasting, Neural Network, and Discriminant Analysis for Bankruptcy Prediction," Expert Systems With Applications, Vol.2, No.4(1996), 415-422.
  23. Jo, H. K., I. G. Han, and H. Lee, "Bankruptcy prediction using case-based reasoning, neural networks, and discriminant analysis," Expert Systems With Applications, Vol.13(1997), 97-108. https://doi.org/10.1016/S0957-4174(97)00011-0
  24. Kim, K. J, "Data Mining using Instance Selection in Artificial Neural Networks for Bankruptcy Prediction," Journal of Intelligence and Information Systems, Vol.10, No.1(2004), 109-123.
  25. Kim, M. J. and D. K. Kang, "Ensemble with neural networks for bankruptcy prediction," Expert Systems with Applications, Vol.37(2010), 3373-3379. https://doi.org/10.1016/j.eswa.2009.10.012
  26. Kuncheva, L. I., Combining pattern classifiers : Methods and algorithms, Hoboken, New Jersey, John Wiley & Sons, 2004.
  27. Leshno, M. and Y. Spector, "Neural network prediction analysis : The bankruptcy case," Neurocomputing, Vol.10(1996), 125-147. https://doi.org/10.1016/0925-2312(94)00060-3
  28. Lin, F. Y. and S. McClean, "A data mining approach to the prediction of corporate failure," Knowledge-Based Systems, Vol.14(2001), 189-195. https://doi.org/10.1016/S0950-7051(01)00096-X
  29. Mar-Molinero, C. and C. Serrano-Cinca, "Bank failure : a multidimensional scaling approach," The European Journal of Finance, Vol.7 (2001), 165-183. https://doi.org/10.1080/13518470122202
  30. Neophytou, E. and C. Mar-Molinero, "Predicting Corporate Failure in the UK : A Multidimensional Scaling Approach," Journal of Business Finance and Accounting, Vol.31 (2004), 667-710.
  31. Ruta, D. and B. Gabrys, "An Overview of Classifier Fusion Methods," Computing and Information Systems, Vol.7(2000), 1-10.
  32. Odom, M. and R. Sharda, "A neural network model for bankruptcy prediction," Proc. Int. Joint Conf. Neural Networks, San Diego, CA, (1990).
  33. Ohlson, J., "Financial ratios and the probabilistic prediction of bankruptcy," Journal of Accounting Research, Vol.18, No.1(1980), 109-131. https://doi.org/10.2307/2490395
  34. Olmeda, I. and E. Fernandez, "Hybrid classifiers for financial multicriteria decision making_the case of bankruptcy prediction," Computational Economics, Vol.10(1997), 317-335. https://doi.org/10.1023/A:1008668718837
  35. Pal, S. K. and A. Pal, Pattern Recognition : From Classical to Modern Approaches, World Scientific, 2001.
  36. Platt, J. C., Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods, In Advances in large margin classifiers, Alexander J. S., P. Bartlett, B, Scholkopf, D. Schuurmans, eds., Cambridge, MIT Press, 1999.
  37. Ravi, V., H. Kurniawana, P. Thaia, and P. Kumarb, "Soft computing system for bank performance prediction," Applied Soft Computing, Vol.8 (2008), 305-315. https://doi.org/10.1016/j.asoc.2007.02.001
  38. Sewell, M., "Ensemble Learning," Research Note, RN/11/02, Dept. of Computer Science, UCL, 2011.
  39. Shin, K. S. and I. G. Han, "Bankruptcy Prediction Modeling Using Multiple Neural Network Models," Proceedings of the Fall Conference on the Korean Operations Research and Management Science Society (KORMS), (1998), 145-148.
  40. Shin, K. S. and K. J. Lee, "Bankruptcy Prediction Modeling Using Multiple Neural Network Models," Lecture Notes in Computer Science, Vol.3214(2004), 668-674.
  41. Shmueli, G., N. R. Patel, and P. C. Bruce, Data Mining for Business Intelligence : Concepts, Techniques, and Applications in Microsoft Office Excel With Xlminer, Hardcover, 2nd Edition, Wiley, 2010.
  42. Sun, J. and Li, h. "Listed companies' financial distress prediction based on weighted majority voting combination of multiple classifiers," Expert Systems with Applications, Vol.35 (2008), 818-827. https://doi.org/10.1016/j.eswa.2007.07.045
  43. Tai, Q. Y. and K. S. Shin, "GA-based Normalization Approach in Back-propagation Neural Network for Bankruptcy Prediction Modeling," Journal of Intelligence and Information Systems, Vol.16, No.3(2010), 1-14.
  44. Taniguchi, M. and V. Tresp, "Averaging regularized estimators," Neural Computation, Vol.9(1997), 1163-1178. https://doi.org/10.1162/neco.1997.9.5.1163
  45. Tsai, C. F. and J. W. Wu, "Using Neural Network Ensembles for Bankruptcy Prediction and Credit Scoring," Expert Systems with Applications, Vol.34(2008), 2639-2649. https://doi.org/10.1016/j.eswa.2007.05.019
  46. Verikas, A., A. Lipnickas, K. Malmqvist, M. Bacauskiene, and A. Gelzinis, "Soft combination of neural classifiers : A comparative study," Pattern Recognition Letters, Vol.20 (1999), 429-444. https://doi.org/10.1016/S0167-8655(99)00012-4
  47. Verikas, A., Z. Kalsyte, M. Bacauskiene, and A. Gelzinis, "Hybrid and ensemble-based soft computing techniques in bankruptcy prediction : a survey," Soft Computing, Vol.14 (2010), 995-1010. https://doi.org/10.1007/s00500-009-0490-5
  48. Wahba, G., Multivariate function and operator estimation, based on smoothing splines and reproducing kernels, Nonlinear Modeling and Forecasting, SFI Studies in the Sciences of Complexity, Proc. Vol.XII, Eds. M. Casdagli and S. Eubank, Addison-Wesley, 1992.
  49. Wahba, G., "Support vector machines, reproducing kernel Hilbert spaces and the randomized GACV," Technical Report, NO.984rr, Department of Statistics, University of Wisconsin, 1998.
  50. West, D., S. Dellana, and J. Qian, "Neural network ensemble strategies for financial decision applications," Computers and Operations Research, Vol.32(2005), 2543-2559. https://doi.org/10.1016/j.cor.2004.03.017
  51. Xu, L., A. Krzyzak, and C. Y. Suen, "Methods for combining multiple classifiers and their applications to handwriting recognition," IEEE Transactions on Systems, Man, and Cybernetics, Vol.22(1992), 418-435. https://doi.org/10.1109/21.155943

Cited by

  1. KOSDAQ 시장의 관리종목 지정 탐지 모형 개발 vol.24, pp.3, 2013, https://doi.org/10.13088/jiis.2018.24.3.157