DOI QR코드

DOI QR Code

System Trading using Case-based Reasoning based on Absolute Similarity Threshold and Genetic Algorithm

절대 유사 임계값 기반 사례기반추론과 유전자 알고리즘을 활용한 시스템 트레이딩

  • 한현웅 (국민대학교 비즈니스IT전문대학원) ;
  • 안현철 (국민대학교 비즈니스IT전문대학원)
  • Received : 2017.06.14
  • Accepted : 2017.09.28
  • Published : 2017.09.29

Abstract

Purpose This study proposes a novel system trading model using case-based reasoning (CBR) based on absolute similarity threshold. The proposed model is designed to optimize the absolute similarity threshold, feature selection, and instance selection of CBR by using genetic algorithm (GA). With these mechanisms, it enables us to yield higher returns from stock market trading. Design/Methodology/Approach The proposed CBR model uses the absolute similarity threshold varying from 0 to 1, which serves as a criterion for selecting appropriate neighbors in the nearest neighbor (NN) algorithm. Since it determines the nearest neighbors on an absolute basis, it fails to select the appropriate neighbors from time to time. In system trading, it is interpreted as the signal of 'hold'. That is, the system trading model proposed in this study makes trading decisions such as 'buy' or 'sell' only if the model produces a clear signal for stock market prediction. Also, in order to improve the prediction accuracy and the rate of return, the proposed model adopts optimal feature selection and instance selection, which are known to be very effective in enhancing the performance of CBR. To validate the usefulness of the proposed model, we applied it to the index trading of KOSPI200 from 2009 to 2016. Findings Experimental results showed that the proposed model with optimal feature or instance selection could yield higher returns compared to the benchmark as well as the various comparison models (including logistic regression, multiple discriminant analysis, artificial neural network, support vector machine, and traditional CBR). In particular, the proposed model with optimal instance selection showed the best rate of return among all the models. This implies that the application of CBR with the absolute similarity threshold as well as the optimal instance selection may be effective in system trading from the perspective of returns.

Keywords

References

  1. 김선웅, 안현철. "Support Vector Machines 와 유전자 알고리즘을 이용한 지능형 트레이딩 시스템 개발." 지능정보연구 16권 1호, 2010, pp.71-92.
  2. 박윤주, "통계적 분석 기법을 기반으로 한 사례 기반추론에 대한 연구," 박사학위논문, 경영공학전공, 한국과학기술원, 2006.
  3. 안현철, 김경재, 한인구, "효과적인 고객관계관리를 위한 사례기반추론 동시 최적화 모형." 지능정보연구 11권 2호, 2005a, pp.175-195.
  4. 안현철, 이형용. "투자 의사결정 지원을 위한 유전자 알고리즘 기반의 다중 인공지능기법 결합 모형: KOSPI 에의 응용." e-비즈니스연구 10권 1호, 2009, pp.215-236.
  5. 안현철. "사례기반추론의 유사 임계치 및 커버리지 최적화." 정보처리학회논문지. 소프트웨어 및 데이터 공학 2권 8호, 2013, pp.535-542.
  6. 옥중경, 김경재. "유전자 알고리즘 기반의 기업 부실예측 통합모형." 지능정보연구 15권 4호, 2009, pp.99-120.
  7. 이우식. "딥러닝분석과 기술적 분석 지표를 이용한 한국 코스피주가지수 방향성 예측." 한국데이터정보과학회지 28권 2호, 2017, pp.287-295.
  8. 이재식, 송영균, 허성회. "인공신경망 앙상블을 이용한 옵션 투자예측 시스템," 한국지능정보시스템학회 학술대회논문집, 2000, pp.489-497.
  9. 이형용. "한국 주가지수 등락 예측을 위한 유전자 알고리즘 기반 인공지능 예측기법결합모형." Entrue Journal of Information Technology Vol.7, No.2, 2008, pp.33-43.
  10. 이훈영, 박기남. "사례기반예측시스템의 정확한 예측을 위한 최적 결합 사례개수결정방법에 관한 연구." 경영학연구 27권 5호, 1999, pp.1239-1252.
  11. 송성환, 황선호, 이용희, 이현경, 한경석, 김종배, "트레이딩을 위한 소셜 빅데이터 분석 모델", Asia-pacific Journal of Multimedia Services Convergent with Art, Humanities, and Sociology, Vol.6 No.3, 2016, pp.91-100.
  12. 송유정, 이종우, "텐서플로우를 이용한 주가 변동 예측 딥러닝 모델 설계 및 개발." 한국정보과학회 학술발표논문집, 2017, pp.799-801.
  13. 홍승현, 신경식, "유전자 알고리즘을 활용한 인공신경망 모형 최적입력변수의 선정: 부도예측 모형을 중심으로." 한국지능정보시스템학회 9권 1호, 2003, pp.227-249.
  14. Aamodt, A., and Plaza, E.. "Case-based reasoning: Foundational issues, methodological variations, system approaches." AI communications, Vol.7, No.1, 1994, pp.39-59.
  15. Achelis, S. B., Technical Analysis from A to Z. New York: McGraw Hill, 2001.
  16. Ahn, H., and Kim, K.-j., "Using genetic algorithms to optimize nearest neighbors for data mining," Annals of Operations Research, Vol. 163, No.1, 2008, pp. 5-18. https://doi.org/10.1007/s10479-008-0325-2
  17. Ahn, H., and Kim, K. J., "Bankruptcy prediction modeling with hybrid case-based reasoning and genetic algorithms approach." Applied Soft Computing 9.2 (2009): 599-607. https://doi.org/10.1016/j.asoc.2008.08.002
  18. Ahn, H., Kim, K J.., and Han, I., "Global optimization of feature weights and the number of neighbors that combine in a case‐based reasoning system." Expert Systems, Vol.23, No.5, 2006a, pp.290-301. https://doi.org/10.1111/j.1468-0394.2006.00410.x
  19. Ahn, H., Kim, K. J., and Han, I., "A case-based reasoning system with the twodimensional reduction technique for customer classification." Expert Systems with Applications, Vol.32 No.4, 2007, pp.1011-1019. https://doi.org/10.1016/j.eswa.2006.02.021
  20. Ahn, H., Kim, K. J., and Han, I., "Hybrid genetic algorithms and case‐based reasoning systems for customer classification." Expert Systems, Vol.23, No.3, 2006b, pp.127-144. https://doi.org/10.1111/j.1468-0394.2006.00329.x
  21. Alexander, S. S. "Price Movements in Speculative Markets: Trends or Random Walks, Number 2." IMR; Industrial Management Review (pre-1986), Vol.5, No.2, 1964, 25.
  22. Atsalakis, G. S., and Valavanis, K. P., "Forecasting stock market short-term trends using a neuro-fuzzy based methodology." Expert Systems with Applications, Vol.36, No.7, 2009a, pp.10696-10707. https://doi.org/10.1016/j.eswa.2009.02.043
  23. Atsalakis, G. S., and Valavanis, K. P., "Surveying stock market forecasting techniques-Part II: Soft computing methods." Expert Systems with Applications, Vol.36, No.3, 2009b, pp.5932-5941. https://doi.org/10.1016/j.eswa.2008.07.006
  24. Babu, T. R. and M. N. Murty, "Comparison of genetic algorithm based prototype selection schemes", Pattern Recognition, Vol.34, No.2, 2001, pp.523-525. https://doi.org/10.1016/S0031-3203(00)00094-7
  25. Bao, D., and Yang. Z., "Intelligent stock trading system by turning point confirming and probabilistic reasoning." Expert Systems with Applications, Vol.34, No.1 (2008, pp.620-627. https://doi.org/10.1016/j.eswa.2006.09.043
  26. Brock, W., Lakonishok, J., and LeBaron, B, "Simple technical trading rules and the stochastic properties of stock returns." The Journal of Finance, Vol.47, No.5, 1992, pp.1731-1764. https://doi.org/10.1111/j.1540-6261.1992.tb04681.x
  27. Caporale, G. M., and Pittis, N., "Cointegration and predictability of asset prices." Journal of International Money and Finance, Vol.17, No.3, 1998, pp.441-453. https://doi.org/10.1016/S0261-5606(98)00015-1
  28. Cardie, C., "Using decision trees to improve case-based learning", Proceedings of the Tenth International Conference on Machine Learning, San Francisco, CA, 1993, pp.25-32.
  29. Chang, C. C., and Lin, C. J.,. "LIBSVM: a library for support vector machines." ACM Transactions on Intelligent Systems and Technology (TIST), Vol.2, No.3, 2011, 27.
  30. Chavarnakul, T., and Enke, D., "A hybrid stock trading system for intelligent technical analysis-based equivolume charting." Neurocomputing, Vol.72, No.16, 2009, pp.3517-3528. https://doi.org/10.1016/j.neucom.2008.11.030
  31. Chiu, C., "A case-based customer classification approach for direct marketing." Expert Systems with Applications, Vol.22, No.2, 2002, pp.163-168. https://doi.org/10.1016/S0957-4174(01)00052-5
  32. Chiu, C., Chang, P. C., and Chiu, N. H., "A case-based expert support system for due-date assignment in a wafer fabrication factory." Journal of Intelligent Manufacturing, Vol.14, No.3, 2003, pp.287-296. https://doi.org/10.1023/A:1024693524603
  33. Choi, J. "Technical indicators." Seoul: Jinritamgu Publishing, 1995.
  34. Chun, S. H., and Park, Y. J., "Dynamic adaptive ensemble case-based reasoning: application to stock market prediction." Expert Systems with Applications, Vol.28, No.3, 2005, pp.435-443. https://doi.org/10.1016/j.eswa.2004.12.004
  35. Dao, T., and Ahn, H., "An Optimized Combination of ${\pi}$-fuzzy Logic and Support Vector Machine for Stock Market Prediction," Journal of Intelligence and Information Systems, Vol.20, No.4, 2014, pp.43-58. https://doi.org/10.13088/jiis.2014.20.4.43
  36. Donaldson, R. G., and Kamstra, M., "An artificial neural network-GARCH model for international stock return volatility." Journal of Empirical Finance, Vol.4, No.1, 1997, pp.17-46. https://doi.org/10.1016/S0927-5398(96)00011-4
  37. Elton, E. J. and M. J. Gruber, Modern Portfolio Theory and Investment Analysis, Wiley, 1984.
  38. Fama, E. F. "The behavior of stock-market prices." The journal of Business, Vol.38, No.1, 1965, pp.34-105. https://doi.org/10.1086/294743
  39. Fama, E. F., "Efficient capital markets: A review of theory and empirical work." The journal of Finance, Vol.25, No.2, 1970, pp.383-417. https://doi.org/10.1111/j.1540-6261.1970.tb00518.x
  40. Fuller, R. J., & Kling, J. L., "Is the stock market predictable?." The Journal of Portfolio Management, Vol.16, No.4, 1990, pp.28-36. https://doi.org/10.3905/jpm.1990.409279
  41. Granger, C. W, "Some properties of time series data and their use in econometric model specification." Journal of econometrics, Vol.16, No.1, 1981, pp.121-130. https://doi.org/10.1016/0304-4076(81)90079-8
  42. Hart, P.E., "The condensed nearest neighbor rule", IEEE Transactions on Information Theory, Vol.14, No.3, 1968, pp.515-516. https://doi.org/10.1109/TIT.1968.1054155
  43. Huang, Y. S., C.C. Chiang, J. W. Shieh and E. Grimson, "Prototype optimization for nearest-neighbor classification", Pattern Recognition, Vol. 35, No.6, 2002, pp.1237-1245. https://doi.org/10.1016/S0031-3203(01)00124-8
  44. i Guiu, J. G., i Ribé, E. G., i Mansilla, E. B., and i Fàbrega, X. L., "Automatic diagnosis with genetic algorithms and case-based reasoning." Artificial Intelligence in Engineering, Vol.13, No.4, 1999, pp.367-372. https://doi.org/10.1016/S0954-1810(99)00009-6
  45. Jarmulak, J., Craw, S., and Rowe, R., "Self-optimising CBR retrieval." Tools with Artificial Intelligence, 2000. ICTAI 2000. Proceedings. 12th IEEE International Conference on. IEEE, 2000.
  46. Kim, K. J., "Toward global optimization of case-based reasoning systems for financial forecasting." Applied intelligence, Vol.21, No.3, 2004, pp.239-249. https://doi.org/10.1023/B:APIN.0000043557.93085.72
  47. Kim, K.-j., and Ahn, H., "Simultaneous optimization of artificial neural networks for financial forecasting," Applied Intelligence, Vol.36, No.4, 2012, pp.887-898. https://doi.org/10.1007/s10489-011-0303-2
  48. Kim, K. J., and Han, I., "Application of a hybrid genetic algorithm and neural network approach in activity-based costing." Expert Systems with Applications, Vol.24, No.1, 2003, pp.73-77. https://doi.org/10.1016/S0957-4174(02)00084-2
  49. Kim, K. J., and Han, I., "Genetic algorithms approach to feature discretization in artificial neural networks for the prediction of stock price index." Expert systems with Applications, Vol.19, No.2, 2000, pp.125-132. https://doi.org/10.1016/S0957-4174(00)00027-0
  50. Kim, K. J., and Han, I., "Maintaining case-based reasoning systems using a genetic algorithms approach." Expert Systems with Applications, Vol.21, No.3, 2001, pp.139-145. https://doi.org/10.1016/S0957-4174(01)00035-5
  51. Kim, K. J., and Lee, W. B., "Stock market prediction using artificial neural networks with optimal feature transformation." Neural computing & applications, Vol.13, No.3, 2004, pp.255-260. https://doi.org/10.1007/s00521-004-0428-x
  52. Kuncheva, L. I., and Jain, L. C., "Nearest neighbor classifier: Simultaneous editing and feature selection." Pattern recognition letters, Vol.20, No.11, 1999, pp.1149-1156. https://doi.org/10.1016/S0167-8655(99)00082-3
  53. Lipowezky, U., "Selection of the optimal prototype subset for 1-NN classification", Pattern Recognition Letters, Vol.19, No.10, 1998, pp.907-918. https://doi.org/10.1016/S0167-8655(98)00075-0
  54. Lo, A. W., and MacKinlay, A. C., "Stock market prices do not follow random walks: Evidence from a simple specification test." Review of financial studies, Vol.1, No.1, 1988, pp.41-66. https://doi.org/10.1093/rfs/1.1.41
  55. Malkiel, B. G., A random walk down Wall Street: including a life-cycle guide to personal investing. WW Norton & Company, 1999.
  56. McMillan, D. G., :"Non-linear forecasting of stock returns: Does volume help?." International Journal of forecasting, Vol.23, No.1, 2007, pp.115-126. https://doi.org/10.1016/j.ijforecast.2006.06.002
  57. Nunez-Letamendia, L., "Fitting the control parameters of a genetic algorithm: An application to technical trading systems design." European journal of operational research, Vol.179, No.3, 2007, pp.847-868. https://doi.org/10.1016/j.ejor.2005.03.067
  58. Poon, S. H., and Taylor, S. J., "Stock returns and volatility: an empirical study of the UK stock market." Journal of banking & finance, Vol.16, No.1, 1992, pp.37-59. https://doi.org/10.1016/0378-4266(92)90077-D
  59. Sanchez, J. S., F. Pla and F. J. Ferri, "Prototype selection for the nearest neighbour rule through proximity graphs", Pattern Recognition Letters, Vol.18, No.6, 1997, pp.507-513. https://doi.org/10.1016/S0167-8655(97)00035-4
  60. Schulmeister, S., "Profitability of technical stock trading: Has it moved from daily to intraday data?." Review of Financial Economics, Vol.18, No.4, 2009, pp.190-201. https://doi.org/10.1016/j.rfe.2008.10.001
  61. Shin, K. S., and Han, I., "Case-based reasoning supported by genetic algorithms for corporate bond rating." Expert Systems with Applications, Vol.16, No.2, 1999, pp.85-95. https://doi.org/10.1016/S0957-4174(98)00063-3
  62. Siedlecki, W., and Sklansky, J., "A note on genetic algorithms for large-scale feature selection." Pattern recognition letters, Vol.10, No.5, 1989, pp.335-347. https://doi.org/10.1016/0167-8655(89)90037-8
  63. Silvapulle, P., and Choi, J. S., "Testing for linear and nonlinear Granger causality in the stock price-volume relation: Korean evidence." The Quarterly Review of Economics and Finance, Vol.39, No.1, 1999, pp.59-76. https://doi.org/10.1016/S1062-9769(99)80004-0
  64. Skalak, D.B., "Prototype and feature selection by sampling and random mutation hill climbing algorithms", Proceedings of the Eleventh International Conference on Machine Learning, New Jersey, NJ, 293-301, 1994.
  65. Sun, J., and Hui, X. F., "Financial distress prediction based on similarity weighted voting CBR." International Conference on Advanced Data Mining and Applications. Springer Berlin Heidelberg, 2006.
  66. Vince, R., Portfolio management formulas: mathematical trading methods for the futures, options, stock markets, Vol. 1. John Wiley & Sons, 1990.
  67. Wang, Y., and Ishii, N., "A method of similarity metrics for structured representations." Expert Systems with Applications, Vol.12, No.1, 1997, pp.89-100. https://doi.org/10.1016/S0957-4174(96)00083-8
  68. Wilder Jr, J. W., "The Relative Strength Index," Journal of Technical Analysis of Stocks and Commodities, Vol.4, 1986, pp.343-346.
  69. Williams, L., "The Ultimate Oscillator." Technical Analysis of Stocks and Commodities, Vol.3, No.4, 1985, pp.140-141.
  70. Wilson, D.L., "Asymptotic properties of nearest neighbor rules using edited data", IEEE Transactions on Systems, Man, and Cybernetics, Vol.2, No.3., 1972, pp.408-421. https://doi.org/10.1109/TSMC.1972.4309137
  71. Zhang, Y., & Wu, L., "Stock market prediction of S&P 500 via combination of improved BCO approach and BP neural network." Expert systems with applications, Vol.36, No.5, 2009, pp.8849-8854. https://doi.org/10.1016/j.eswa.2008.11.028