DOI QR코드

DOI QR Code

CNN Architecture Predicting Movie Rating from Audience's Reviews Written in Korean

한국어 관객 평가기반 영화 평점 예측 CNN 구조

  • 김형찬 (한국기술교육대학교 컴퓨터공학부) ;
  • 오흥선 (한국기술교육대학교 컴퓨터공학부) ;
  • 김덕수 (한국기술교육대학교 컴퓨터공학부)
  • Received : 2019.10.18
  • Accepted : 2019.11.15
  • Published : 2020.01.31

Abstract

In this paper, we present a movie rating prediction architecture based on a convolutional neural network (CNN). Our prediction architecture extends TextCNN, a popular CNN-based architecture for sentence classification, in three aspects. First, character embeddings are utilized to cover many variants of words since reviews are short and not well-written linguistically. Second, the attention mechanism (i.e., squeeze-and-excitation) is adopted to focus on important features. Third, a scoring function is proposed to convert the output of an activation function to a review score in a certain range (1-10). We evaluated our prediction architecture on a movie review dataset and achieved a low MSE (e.g., 3.3841) compared with an existing method. It showed the superiority of our movie rating prediction architecture.

본 논문에서는 합성곱 신경망 기반의 영화 평점 예측 구조를 제안한다. 제안하는 구조는 문장 분류을 위하 고안된 TextCNN를 세 가지 측면에서 확장하였다. 첫 번째로 문자 임베딩을 이용하여 단어의 다양한 변형들을 처리할 수 있다. 두 번째로 주목 메커니즘을 적용하여 중요한 특징을 더욱 부각하였다. 세 번째로 활성 함수의 출력을 1-10 사이의 평점으로 만드는 점수 함수를 제안하였다. 제안하는 영화 평점 예측 구조를 평가하기 위해서 영화 리뷰 데이터를 이용하여 평가해 본 결과 기존의 방법을 사용했을 때보다 더욱 낮은 MSE를 확인하였다. 이는 제안하는 영화 평점 예측 구조의 우수성을 보여 주었다.

Keywords

References

  1. A. Krizhevsky, I. Sutskever, and G. E. Hinton, "Imagenet classification with deep convolutional neural networks," Advances in Neural Information Processing Systems, pp. 1097-1105, 2012.
  2. O. Abdel-Hamid, A. Mohamed, H. Jiang, L. Deng, G. Penn, and D. Yu, "Convolutional neural networks for speech recognition," IEEE/ACM Transactions on Audio, Speech, and Language Processing, 22, 10, pp.1533-1545, 2014. https://doi.org/10.1109/TASLP.2014.2339736
  3. R. Collobert and J. Weston, "A unified architecture for natural language processing: Deep neural networks with multitask learning," In Proceedings of the 25th Inter-National Conference on Machine Learning, 160-167, 2008.
  4. Y. Kim, "Convolutional Neural Networks for Sentence Classification," ArXiv e-prints:1408.5882, 2014.
  5. K. Simonyan and A. Zisserman. "Very Deep Convolutional Networks for Large-Scale Image Recognition," ArXiv e-prints:1409.1556, 2014.
  6. A. Conneau, H. Schwenk, L. Barrault, and Y. Lecun, "Very Deep Convolutional Networks for Text Classification," ArXiv eprints:1606.01781, 2016.
  7. H. T. Le, C. Cerisara, and A. Denis, "Do convolutional networks need to be deep for text classification?," arXiv preprint 1707.04108, 2017.
  8. C. M. Chang, J. Cho, H. Liu, R. K. Wagner, H. Shu, A. Zhou, C. S. Cheuk, and A. Muse, "Changing models across cultures: Associations of phonological awareness and morphological structure awareness with vocabulary and word recognition in second graders from beijing, hong kong, korea, and the united states," Journal of Experimental Child Psychology, Vol.92, No.2, pp.140-160, 2005. https://doi.org/10.1016/j.jecp.2005.03.009
  9. S. Petrov, D. Das, and R. McDonald. "A universal part-ofspeech tagset," arXiv preprint arXiv:1104.2086, 2011.
  10. Mecab-ko morphological analyzer [internet], http://eunjeon.blogspot.com/.
  11. Twitter morphological analyzer [internet], https://openkoreantext.org.
  12. Hannanum morphological analyzer [internet], http://semanticweb.kaist.ac.kr/research/hannanum/index.html
  13. KoNLPy benchmark performance among the libraries, [internet] http://konlpy.org/en/latest/morph/#compariso n-between-pos-tagging-classes.
  14. J. Hu, L. Shen, S. Albanie, G. Sun, and E. Wu, "Squeezeand-Excitation Networks," ArXiv e-prints:1709.01507, 2017.
  15. Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin, "Attention Is All You Need," In Advances in Neural Information Processing Systems, 2017, pp. 5998-6008.
  16. Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K., "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding," arXiv preprint arXiv:1810.04805, 2018.
  17. X. Yu, A. Falenska, and N. T. Vu. A, "general-purpose tagger with convolutional neural networks," arXiv preprint:1706.01723, 2017.
  18. D. P. Kingma and J. Ba., "Adam: A Method for Stochastic Optimization," ArXiv e-prints, 1412.6980, 2014.
  19. M. D. Zeiler, "Adadelta: an adaptive learning rate method," arXiv preprint:1212.5701, 2012.