DOI QR코드

DOI QR Code

Prediction of box office using data mining

데이터마이닝을 이용한 박스오피스 예측

  • Received : 2016.07.22
  • Accepted : 2016.10.20
  • Published : 2016.12.31

Abstract

This study deals with the prediction of the total number of movie audiences as a measure for the box office. Prediction is performed by classification techniques of data mining such as decision tree, multilayer perceptron(MLP) neural network model, multinomial logit model, and support vector machine over time such as before movie release, release day, after release one week, and after release two weeks. Predictors used are: online word-of-mouth(OWOM) variables such as the portal movie rating, the number of the portal movie rater, and blog; in addition, other variables include showing the inherent properties of the film (such as nationality, grade, release month, release season, directors, actors, distributors, the number of audiences, and screens). When using 10-fold cross validation technique, the accuracy of the neural network model showed more than 90 % higher predictability before movie release. In addition, it can be seen that the accuracy of the prediction increases by adding estimates of the final OWOM variables as predictors.

본 연구는 영화 흥행의 척도로서 총 관객수의 예측을 다루었다. 의사결정나무, MLP 신경망모형, 다항로짓모형, support vector machine과 같은 데이터마이닝 분류 기법들을 사용하여 개봉 전, 개봉 일, 개봉 1주 후, 그리고 개봉 2주 후 시점 별로 예측이 이루어진다. 국적, 등급, 개봉 월, 개봉 계절, 감독, 배우, 배급사, 관객수, 그리고 스크린 수와 같은 영화의 내재적인 속성을 나타내는 변수 뿐만 아니라 포털의 평점과 평가자 수, 블로그 수, 뉴스 수와 같은 온라인 구전 변수들이 예측변수로 사용되었다. 10-중 교차 검증에서 신경망모형의 정확도는 개봉 전 시점에서도 90% 이상의 높은 예측력을 보였다. 또한 최종 온라인 구전 변수의 추정치를 예측변수로 추가함으로서 예측의 정확도가 더 높아짐을 볼 수 있다.

Keywords

References

  1. Jeon, S. and Son, Y.S. (2016). Effect of online word-of-mouth variables as predictors of box office, The Korean Journal of Applied Statistics, 29, 657-678. https://doi.org/10.5351/KJAS.2016.29.4.657
  2. Kim, T., Hong, J., and Koo, H. (2013). Forecasting box-office revenue by considering social network services in the Korean market, Journal Teknologi (Social Sciences), 64, 97-101.
  3. Kim, Y.H. and Hong, J.H. (2011). A study for the development of motion picture box-office prediction model, Communications for Statistical Applications and Methods, 18, 859-869. https://doi.org/10.5351/CKSS.2011.18.6.859
  4. Korean Film Council (2015). 2015 Korean film consumer survey, Korean Film.
  5. Korean Film Council (2016). 2015 Korean film industry settlement, Korean Film, 71.
  6. SAS Institute Inc (2012). Getting started with SAS Enterprise Miner 12.1, SAS Institute Inc., Cary.
  7. Sharda, R. and Delen, D. (2006). Predicting box-office success of motion pictures with neural networks, Expert Systems with Applications, 30, 243-254. https://doi.org/10.1016/j.eswa.2005.07.018
  8. Song, J. and Han, S. (2013). Predicting gross box office revenue for domestic films, Communications for Statistical Applications and Methods, 20, 301-309. https://doi.org/10.5351/CSAM.2013.20.4.301
  9. Yim, J. and Hwang, B. (2014). Predicting movie success based on machine learning using twitter, KIPS Transactions on Software and Data Engineering, 3, 263-270. https://doi.org/10.3745/KTSDE.2014.3.7.263
  10. Zhang, L., Luo, J., and Yang, S. (2009). Forecasting box office revenue of movies with BP neural network, Expert Systems with Applications, 36, 6580-6587. https://doi.org/10.1016/j.eswa.2008.07.064