DOI QR코드

DOI QR Code

Study on the Prediction Model for Employment of University Graduates Using Machine Learning Classification

머신러닝 기법을 활용한 대졸 구직자 취업 예측모델에 관한 연구

  • 이동훈 (단국대학교 대학원 데이터지식서비스공학과) ;
  • 김태형 (단국대학교 대학원 데이터지식서비스공학과)
  • Received : 2020.05.18
  • Accepted : 2020.06.24
  • Published : 2020.06.30

Abstract

Purpose Youth unemployment is a social problem that continues to emerge in Korea. In this study, we create a model that predicts the employment of college graduates using decision tree, random forest and artificial neural network among machine learning techniques and compare the performance between each model through prediction results. Design/methodology/approach In this study, the data processing was performed, including the acquisition of the college graduates' vocational path survey data first, then the selection of independent variables and setting up dependent variables. We use R to create decision tree, random forest, and artificial neural network models and predicted whether college graduates were employed through each model. And at the end, the performance of each model was compared and evaluated. Findings The results showed that the random forest model had the highest performance, and the artificial neural network model had a narrow difference in performance than the decision tree model. In the decision-making tree model, key nodes were selected as to whether they receive economic support from their families, major affiliates, the route of obtaining information for jobs at universities, the importance of working income when choosing jobs and the location of graduation universities. Identifying the importance of variables in the random forest model, whether they receive economic support from their families as important variables, majors, the route to obtaining job information, the degree of irritating feelings for a month, and the location of the graduating university were selected.

Keywords

References

  1. Agatonovic, S. and Beresford, R. "Basic concepts of artificial neural network (ANN) modeling and its application in pharmaceutical research," Journal of Pharmaceutical and Biomedical Analysis, Vol. 22. 1999, pp. 718-720.
  2. Alpaydin, E. "Introduction to Machine Learning. second edition," The MIT Press, Cambridge, Massachusetts, USA. 2010. pp. 20-24.
  3. Anyanwu. M. and Shiva. S., "Comparative Analysis of Serial Decision Tree Classification Algorithms," International Journal of Computer Science and Security, Vol. 3, No. 3, 2009, pp. 233.
  4. Breiman, L., Friedman, J., Olshen, L., and Stone, J., "Classification and Regression trees," CHAPMAN and HALL/CRC, USA. 1989. pp. 55-58.
  5. Breiman, L. "RANDOM FORESTS," 1999. pp. 2-3.
  6. Breiman, L. "RANDOM FORESTS," 2001. pp. 7-8.
  7. Chae, C. G. and Kim, T. G., "Analysis of Determinants of Employment Performance of Young College Students," The Journal of Vocational Education Research, Vol. 28, No. 2., 2009. pp. 96-99.
  8. Chawla, N. V., Bower, K. W., Hall, L. O., and Kegelmeyer, W. P., "SMOTE: Synthetic Minority Over-sampling Technique," Journal of Artificial Intelligence Research, Vol. 16, 2002, pp. 321-357. https://doi.org/10.1613/jair.953
  9. Choi, I. S. and Shin, E. J., "An Empirical Study of the Determinants of Successful Job Seeking of College Students - Focusing on the Impacts of Job Education Programs," Economic Education Research, Vol. 23, No. 1. 2016, pp. 23-49.
  10. Choi, J. H. and Seo, D. S., "Application of data mining decision tree," Statistical Analysis Research. Vol. 4, No. 1. 1999, pp. 62, 63-67.
  11. Choi, P. S. and Min, I. S., "Employment prediction model for college graduates using machine learning techniques," Vocational competency development research. Vol. 21, No. 1, 2018, pp. 32-38.
  12. Chung, T. Y., and Lee, K. Y., "Determinants of Job Finding among College Graduates - with Emphasis on the Effects of GPA -," Korea Business Review, Vol. 8, No. 2, 2006, pp. 159-184
  13. Geron, A. "Hands-On Machine Learning with Scikit-Learn and TensorFlow," O'Reilly, USA. 2017, pp. 10-12.
  14. Gil, H. Y. and Choi, Y. M., "Analysis of Determinants of Employment Types of Graduates," The Journal of Vocational Education Research, Vol. 33, No. 6. 2014, pp. 13-19.
  15. Hong, G. S., "Study on determinants of youth unemployment," Analysis of the Korean economy. Vol. 24, No. 2, 2018, pp. 91-93.
  16. Hssina, B., Merbouha, A., Ezzikouri, H., and Erritali, M., "A comparative study of decision tree ID3 and C4.5," International Journal of Advanced Computer Science and Applications, Special Issue on Advances in Vehicular Ad Hoc Networking and Applications: 2014, pp. 13-18.
  17. Jung, M. N. and Yim, Y. S., "Path analysis of variables related to entering the labor market among college graduates," The Journal of Career Education Research, Vol. 23, No. 2, 2010, pp. 143-146.
  18. Kaisler, S., Armour, F., Espinosa, J., and Monet, W., "Big Data: Issues and Challenges Moving Forward," Hawaii International Conference on System Sciences, 46, 2016, pp. 996-997
  19. Kang, H. Y., "Big data application examples and utilization strategies," Magazine of the SAREK, Vol. 45, No. 1, 2016, pp. 32-33.
  20. Kim, D. A., Kang, D. A., and Song, J. W., "Classification Analysis for Unbalanced Data," The Korean Journal of Applied Statistics, Vol. 28, No. 3, 2015, pp. 495. https://doi.org/10.5351/KJAS.2015.28.3.495
  21. Kim, J. K., "Domestic and foreign big data trends and success stories," Industrial Engineering Magazine, Vol. 23, No. 1, 2016. pp. 48-49.
  22. Kim, J. S., "Big data analysis technology and use cases," Journal of Contents, Vol. 12, No. 1. 2014, pp. 18-19.
  23. Kim, S. H., "The effect of job preparation activities of young college graduates on entering the labor market: focusing on whether and when to get a full-time job," Education Culture Research, Vol. 24, 2014, pp. 313-318.
  24. Kwak, H. and Lee, S. W., "Competitiveness Analysis for Artificial Intelligence Technology through Patent Analysis," Journal of Information Systems, Vol 28, No. 3., 2019, pp. 141-158. https://doi.org/10.5859/KAIS.2019.28.3.141
  25. Lee, J. H., Lee, C. K. and Lee, H. H., "An Analysis of College Graduates Employment factors using Data Mining: The Importance of Volunteer Services," Logos Management Review, Vol. 17, No. 2., 2019, pp. 143-156.
  26. Lee, H. W., Lee, S. R., and Chung, K, K., "The impact of Artificial Intelligence Adoption in Canadidates Screening and Joub Interview on Intentions to Apply," Journal of Information Systems, Vol. 28, No. 2, 2019, pp. 25-52. https://doi.org/10.5859/KAIS.2019.28.2.25
  27. Lee, Y. J., Lee, S. H., and Lee, J. S., "KB Card's marketing activities and bigdata utilization," Korea Business Review, Vol. 18, No. 1, 2014, pp. 162-163.
  28. Magerman, D., "Statistical Decision-Tree Models for Parsing," ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics. 1995, pp. 276-279.
  29. Marsland, S., "Machine Learning Algorithmic Perspective," CRC Press, USA. 2015. pp. 6-9.
  30. Mingers, J., "An empirical comparison of pruning methods for decision tree induction," Machine Learning, 4, 1989, pp. 241-242. https://doi.org/10.1023/A:1022604100933
  31. Noh, K. R. and Heo, S. J., "Analysis of Factors Influencing Achievement of Employment Goals," The Journal of Vocational Education Research, Vol. 1, No. 22, 2015, pp. 10-14.
  32. Oh, J. Y, Lee, W. G., Lee, J. M. and Park, M. S., "Big Data Use Cases of Last Mile Logistics," The Korea Institute of Information and Communication Engineering., 2019, pp. 121-123
  33. Oh, S. G., "An analysis of the determinants of youth employment probability in Korea," Master's thesis, 2003, Yonsei University.
  34. Park, G. H., "To foster leaders of the 4th Industrial Revolution Training market research," Ministry of Employment and Labor. 2018, pp. 167-171
  35. Pal, M., "Random forest classifier for remote sensing classification," International Journal of Remote Sensing, Vol. 26, No. 1, 2005, pp. 2-3.
  36. Park, K. Y. and Cheon, Y. M., "A Factor analysis of employment for college graduates," Employment survey Conference 2016, 2016.
  37. Podgorelec, V., Kokol, P., Stiglic, B. and Rozman, I., "Decision trees: an overview and their use in medicine," Journal of Medical Systems, Kluwer Academic/ Plenum Press, Vol. 26, No. 5, 2002, pp. 9-10. https://doi.org/10.1023/A:1013034719088
  38. Son, J. S., "A study on transportation policy using big data: Focusing Seoul city late night bus case," Korean Policy Studies Review. 2019. pp. 19-34
  39. Sutton, R. and Barto, A., "Reinforcement Learning: An Introduction," A Bradford Book, USA, England. 2015, pp. 2-4
  40. Zhao, Y. and Zhang, Y., "Comparison of decision tree methods for finding active objects," Advances in Space Research, Vol. 41, No. 12, 2007, pp. 3-4.
  41. Zhu, W., Zeng, W. and Wang, N., "Sensitivity, Specificity, Accuracy, Associated Confidence Interval and ROC Analysis with Practical SAS${(R)}$," Health Care and Life Sciences, Implementations, 2010, pp. 1-2.
  42. Zurada, J., "Introduction to Arificial Neural Systems," West Publishing Company, USA. 1992, pp. 37-38