A Sentiment Classification Method Using Context Information in Product Review Summarization

상품 리뷰 요약에서의 문맥 정보를 이용한 의견 분류 방법

  • 양정연 (서울대학교 컴퓨터공학부) ;
  • 명재석 (서울대학교 컴퓨터공학부) ;
  • 이상구 (서울대학교 컴퓨터공학부)
  • Published : 2009.08.15

Abstract

As the trend of e-business activities develop, customers come into contact with products through on-line shopping sites and lots of customers refer product reviews before the purchasing on-line. However, as the volume of product reviews grow, it takes a great deal of time and effort for customers to read and evaluate voluminous product reviews. Lately, attention is being paid to Opinion Mining(OM) as one of the effective solutions to this problem. In this paper, we propose an efficient method for opinion sentiment classification of product reviews using product specific context information of words occurred in the reviews. We define the context information of words and propose the application of context for sentiment classification and we show the performance of our method through the experiments. Additionally, in case of word corpus construction, we propose the method to construct word corpus automatically using the review texts and review scores in order to prevent traditional manual process. In consequence, we can easily get exact sentiment polarities of opinion words in product reviews.

e비즈니스가 활발히 이루어지면서 소비자들은 온라인 쇼핑몰올 통해 수많은 상품을 접할 수 있게 되었고, 상품구매 시 다른 사람들의 리뷰를 참고하게 되었다. 하지만, 리뷰의 수도 많아짐에 따라 소비자가 모든 리뷰들을 살펴보기가 힘들다는 문제점이 대두되었으며 이를 해결하기 위해서 리뷰의 상품에 대한 평가를 요약하고 성향을 파악하는 오피니언 마이닝 연구가 나타나게 되었다. 본 논문에서는 상품리뷰를 대상으로 오피니언 마이닝을 수행하는 경우 어휘의 의견 성향을 파악할 때, 문맥정보를 활용하여 기존의 의견분류방법 보다 좀 더 정확한 의견 판단이 가능한 방법에 대해 다루고 있다. 이를 위해, 어휘가 사용될 때의 문맥정보를 정의하고 이를 의견분류에 적용하는 방법을 제안하였으며, 실험을 통하여 기존 연구 보다 상황별 알맞은 의견분류가 가능함을 보였다. 또한 수작업으로 말뭉치의 핵심 어휘들을 정의했던 기존 연구들에서의 방식에서 벗어나, 리뷰본문과 리뷰점수를 활용하여 자동으로 상황에 맞는 말뭉치를 구축하는 방법도 제안하였다. 이를 통해 상품리뷰에 나타난 어휘들의 문맥에 맞는 의미 성향을 정확하고 쉽게 판별해 낼 수 있게 되었다.

Keywords

References

  1. M. Hu, B. Liu, 'Mining and summarizing customer reviews,' Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp.168-177, 2004 https://doi.org/10.1145/1014052.1014073
  2. D. Lee, O. Jeong, S. Lee, 'Opinion Mining of Customer Feedback Data on the Web,' Proceedings of The Second International Conference on Ubiquitous Information Management and Communication, pp.230-235, 2008 https://doi.org/10.1145/1352793.1352842
  3. X. Ding, B. Liu, P. S Yu, 'A holistic lexicon-based approach to opinion mining,' Proceedings of the international conference on Web search and web data mining, pp.231-240, 2008 https://doi.org/10.1145/1341531.1341561
  4. Tumey, P. D, 'Thumbs up or thumbs down': semantic orientation applied to unsupervised classification of reviews,' In Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, 2002 https://doi.org/10.3115/1073083.1073153
  5. Tumey, P. D., Littman, M. L., 'Measuring praise and criticism: Inference of semantic orientation from association,' ACM Trans. Inf. Syst., 21, 4, Oct. 2003 https://doi.org/10.1145/944012.944013
  6. Namrata Godbole, Manjunath Srinivasaiah, Steven Skiena, 'Large-Scale Sentiment Analysis for News and Blogs,' Int'I AAAI Conference on Weblogs and Social Media (ICWSM 2007), 2007
  7. E. Boiy, P. Hens, K. Deschacht, M. Moens, 'Automatic Sentiment Analysis in On-line Text,' ELPUB2007 Conference on Electronic Publishing, June 2007
  8. J. Yi, W. Niblack, 'Sentiment Mining in WebFountain,' International Conference on Data Engineering (ICDE'05), pp.1073-1083, 2005 https://doi.org/10.1109/ICDE.2005.132
  9. T. Nasukawa, J. Yi, 'Sentiment analysis: capturing favorability using natural language processing,' Proceedings of the K- CAP-03, 2nd International Conference on Knowledge Capture, pp.70-77, 2003 https://doi.org/10.1145/945645.945658
  10. Xiaowen Ding, Bing Liu. 'The Utility of Linguistic Rules in Opinion Mining,' pp.811-812, SlGIR2007, 2007 https://doi.org/10.1145/1277741.1277921
  11. M. Gamon, A. Aue, S. Corston-Oliver, E. Ringger. 'Pulse: Mining Customer Opinions from Free Text,' In Lecture Notes in Computer Science, VoI.3646. Springer Verlag. (IDA 2005), 2005 https://doi.org/10.1007/11552253_12
  12. Wilson, T., Wiebe, J.. Hoffmann, P. 'Recognizing contextual polarity in phrase-level sentiment analysis,' In Proceedings of the Conjerence on Human Language Technology and Empirical Methods in Natural Language Processing, pp.347-354, 2005 https://doi.org/10.3115/1220575.1220619
  13. Scaffidi, C., Bierhoff, K., Chang, E., Felker, M., Ng, H., Jin, C. 'Red Opal: product-feature scoring from reviews,' In Proceedings of the 8th ACM Conference on Electronic Commerce 2007, pp.182-191, 2007 https://doi.org/10.1145/1250910.1250938
  14. Miller, G., Beckwith, R, Fellbaum, C., Gross, D., Miler, K. 'Introduction to WordNet: An on-line lexical database,' International Journal of Lexicography, pp.235-244, 1990 https://doi.org/10.1093/ijl/3.4.235