DOI QR코드

DOI QR Code

Product Feature Extraction and Rating Distribution Using User Reviews

사용자 리뷰를 이용한 상품 특징 추출 및 평점 분배

  • Son, Soobin (Department of Computer Science and Engineering, College of Engineering, Myongji University) ;
  • Chun, Jonghoon (Department of Data Technology, School of Convergence Software, College of ICT Convergence, Myongji University)
  • Received : 2017.01.20
  • Accepted : 2017.02.24
  • Published : 2017.02.28

Abstract

We propose a method to analyze the user reviews and ratings of the products in the online shopping mall and automatically extracts the features of the products to determine the characteristics of a product. By judging whether a rating is given by a specific feature of a product, our method distributes the score to each feature. Conventional methods force users to wastes time reading overflowing number of reviews and ratings to decide whether to buy the product or not. Moreover, it is difficult to grasp the merits and demerits of the product, because of the way reviews and ratings are provided. It is structured in a way that it is impossible to decide which rating is given to the which characteristics of the product. Therefore, in this paper, to resolve this problem, we propose a method to automatically extract the feature of the product from the user review and distribute the score to appropriate characteristics of the product by calculating the rating of each feature from the overall rating. proposed method collects product reviews and ratings, conducts morphological analysis, and extracts features and emotional words of the products. In addition, a method for determining the polarity of a sentence in which the feature appears is given a weight value for each feature. results of the experiment and the questionnaires comparing the existing methods show the usefulness of the proposed method. We also validates the results by comparing the analysis conducted by the product review experts.

온라인 쇼핑몰에서 상품에 대한 사용자 리뷰와 평점을 분석하여 상품의 특징을 자동으로 추출하고 평점이 어떤 특징에 의해 부여된 것인지 판단하여 각 특징에 분배하여 점수화함으로써 상품의 특징을 파악할 수 있는 방법을 제안한다. 기존 방식은 상품 구매 여부를 결정하기 위해서 많은 리뷰와 평점을 읽는데 시간을 허비하거나, 상품의 장단점을 파악하기 어려울 뿐더러 상품에 부여된 평점이 어떠한 특징에 의해서 부여되었는지 알 수 없는 구조로 되어있다. 따라서 본 논문에서는 이러한 문제를 해소하기 위하여 사용자 리뷰에서 상품의 특징을 자동으로 추출하고 각 특징별 평점을 전체 평점에서 자동으로 분배 계산하여 보여주는 방법을 제안한다. 제안하는 방법은 상품별 리뷰와 평점을 수집하여 형태소 분석을 수행하고 이를 통해 상품의 특징과 이에 대한 감성어를 추출한다. 또한, 상품의 특징을 파악할 수 있도록 각 특징에 대한 가중치를 특징이 출현한 문장의 극성을 판단하여 부여하는 방법을 기술한다. 실험을 통하여 얻은 결과와 기존 방법을 비교하는 설문조사를 통하여 제안하는 방법의 유용성을 입증하였고, 상품 리뷰 전문가의 분석과 실험의 결과를 비교함으로써 타당성을 입증하였다.

Keywords

References

  1. Ahan, J. K. and Kim, H. W., "Building a Korean Sentiment Dictionary and Applications of Natural Languages Processing," Korea Intelligent Information Systems Society, 2014.
  2. Hogenboom, A., Bal, D., and Frasincar, F., Exploiting Emoticons in Sentiment Analysis, SAC, 2013.
  3. Hu, M. and Liu, B., "Mining and Summarizing Customer Reviews," KDD, pp. 168-177, 2004.
  4. Hu, M. H., Sun, A. and Lim, E. P., "Comments-Oriented Blog Summarization by Sentence Extraction," CIKM, pp. 901-904, 2007.
  5. Internet Price Comparision Site (www.bb.co.kr).
  6. Khabiri, E., Caverlee, J., and Hsu, C. F., "Summarizing User-Contributed Comments," Association for the Advancement of Artificial Intelligence, 2011.
  7. Kim, J. O., Lee, S. S., and Yong, H. S., "Automatic Opinion Classfication of Korean Text," Korean Institute of Information Scientists and Engineers, Vol. 38, No. 6, pp. 423-428, 2011.
  8. Kokoma Morphological Analyzer http://kkma.snu.ac.kr/.
  9. Korean Information Science Society: Linguistic Engineering Research Association https://sites.google.com/site/sighclt/.
  10. Lee, K. S., Song, H. J., Son, Y. W., Hwang, M. J., and Park, Y. S., "Cognitive Psychological Approach to Statistics," Statistical Survey, 2008.
  11. Lee, W. C., Lee, H. A., and Lee, K. J., "Automatic Product Feature Extraction for Efficient Analysis of Product Reviews Using Term Statistics," Journal of Korean Institute of Information Scientists and Engineers, Vol. 16B, , No. 6, pp. 497-502, 2009.
  12. Lee, Y. J., Ji, J. H., Woo, G., and Cho, H. G., "Analysis and Visualization for Comment Messages of Internet Posts," Journal of The Korea Contents Association, Vol. 9, No. 7, pp. 45-56, 2009. https://doi.org/10.5392/JKCA.2009.9.7.045
  13. Menupan.com, http://www.menupan.com/.
  14. Myung, J. S., Lee, D. J., and Lee, S. G., "A Korean Product Review Analysis System Using a Semi-Automatically Constructed Semantic Dictionary," Journal of KISS: Software and Applications. Vol. 35, No. 6, pp. 392-403, 2008.
  15. National Statistics Portal, http://kosis.kr/.
  16. Naver Shopping, http://shopping.naver.com/.
  17. Oh, P. H., Hwang, B. Y., "Real-time Spatial Recommendation System based on Sentiment Analysis of Twitter," The Journal of Society for e-Business Studies, Vol. 21, No. 3, pp. 15-28, 2016. https://doi.org/10.7838/JSEBS.2016.21.3.015
  18. SeleniumHQ Browser Automation (http://www.seleniumhq.org).
  19. SentiWordNet http://sentiwordnet.isti.cnr.it/.
  20. Sipos, R. and Joachims, T., "Generating Comparative Summaries from Reviews," CIKM, pp. 1853-1856, 2013.
  21. Socher, R., Perelygin, A., Wu, J. Y., Chuang, J., Manning, C. D., Ng, A. Y., and Potts, C., Recursive Deep Models for Semantic Compositionality Over a Sentiment, Treebank, EMNLP, 2013.
  22. Somprasertsri, G. and Lalitrojwong, P., "Mining Feature-Opinion in Online Customer Reviews for Opinion Summarization," Journal of Universal Computer Science, Vol. 16, No. 6, pp. 938-955, 2010.
  23. Song, J. S. and Lee, S. W., "Automatic Construction of Positive/Negative Feature-Predicate Dictionary for Polarity Classification of Product Reviews," Korean Institute of Information Scientists and Engineers, Vol. 38, No. 3, pp. 157-168, 2011.
  24. Song, S. I., Lee, D. J., Lee, S. G., "Identifying Sentiment Polarity of Korean Vocabulary Using PMI," Korea Computer Integrated Circuits Conference Vol. 37, No. 1, pp. 260-265, 2010.
  25. Wikipedia https://en.wikipedia.org/wiki/Normalization_(statistics).
  26. Yatani, K., Novati, M., Trusty, A., and Truong, K. N., "Review Spotlight: A User Interface for Summarizing User-generated Reviews Using Adjective-Noun Word Pairs," CHI, pp. 1541-1550, 2011.
  27. Zhai, Z., Liu, B., Xu, H., and Jia, P., "Clustering Product Features for Opinion Mining," ACM, pp. 347-354, 2011.

Cited by

  1. 사용자 리뷰 분석을 통한 호텔 평가 항목별 누락 평점 예측 방법론 vol.16, pp.4, 2017, https://doi.org/10.9716/kits.2017.16.4.161
  2. 시각장애인을 위한 라즈베리 파이 기반 지폐 인식기 개발 vol.23, pp.2, 2017, https://doi.org/10.7838/jsebs.2018.23.2.021
  3. 비휘발적 소셜 큐레이션 서비스가 가능한 대화형 상거래 플랫폼 개발 vol.23, pp.3, 2018, https://doi.org/10.7838/jsebs.2018.23.3.145
  4. 온라인 리뷰 분석을 통한 상품 평가 기준 추출: LDA 및 k-최근접 이웃 접근법을 활용하여 vol.26, pp.1, 2017, https://doi.org/10.13088/jiis.2020.26.1.097