DOI QR코드

DOI QR Code

Automatic Generation of Issue Analysis Report Based on Social Big Data Mining

소셜 빅데이터 마이닝 기반 이슈 분석보고서 자동 생성

  • Received : 2014.07.11
  • Accepted : 2014.11.24
  • Published : 2014.12.31

Abstract

In this paper, we propose the system for automatic generation of issue analysis report based on social big data mining, with the purpose of resolving three problems of the previous technologies in a social media analysis and analytic report generation. Three problems are the isolation of analysis, the subjectivity of experts and the closure of information attributable to a high price. The system is comprised of the natural language query analysis, the issue analysis, the social big data analysis, the social big data correlation analysis and the automatic report generation. For the evaluation of report usefulness, we used a Likert scale and made two experts of big data analysis evaluate. The result shows that the quality of report is comparatively useful and reliable. Because of a low price of the report generation, the correlation analysis of social big data and the objectivity of social big data analysis, the proposed system will lead us to the popularization of social big data analysis.

본 논문은 지금까지의 소셜미디어 분석과 분석보고서 생성의 세 가지 문제점을 해결하기 위해서 소셜 빅데이터 마이닝에 기반한 이슈분석보고서 자동 생성 시스템을 제안한다. 세 가지 문제점은 분석의 고립성, 전문가의 주관성과 고비용에 기인한 정보의 폐쇄성이다. 시스템은 자연언어 질의분석, 이슈분석, 소셜 빅데이터 분석, 소셜 빅데이터 상관성분석과 자동 보고서 생성으로 구성된다. 생성된 보고서의 유용성을 평가하기 위해, 본 논문에서는 리커트척도를 사용하였고, 빅데이터 분석 전문가 2명이 평가하였다. 평가결과는 리커트 척도 평가에서 보고서의 품질이 비교적 유용하고 신뢰할 수 있는 것으로 평가되었다. 보고서 생성의 저비용, 소셜 빅데이터의 상관성 분석과 소셜 빅데이터 분석의 객관성 때문에, 제안된 시스템이 소셜 빅데이터 분석의 대중화를 선도할 것으로 기대된다.

Keywords

References

  1. Jeong Heo, Pum-Mo Ryu, Yoon-Jae Choi, Hyun-Ki Kim and Cheol-Young Ock, "An Issue Event Search System based on Big Data for Decision Supporting: Social Wisdom", Journal of KIISE: Software and Application, Vol.40, No.7, 2013.07.
  2. Oskar Gross, Antoine Docucet and Hannu Toivonen, "Document Summarization Based on Word Associations", Proceedings of the 37th international ACM SIGIR conference on Research and Development in Information Retrieval. ACM, 2014.
  3. Hongjie Li, Lifu Huang, Qifeng Fan and Lian'en Huang, "Comments-Oriented Summarization in Blogsphere Using a Two-Stage Sentence Similarity Measure", In Web-Age Information Management. Springer International Publishing, pp.480-483, 2014.
  4. Dehong Gao, Wenjie Li, Xiaoyan Cai, Renxian Zhang, and You Ouyang, "Sequential Summarization: A Full View of Twitter Trending Topics", IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), Vol.22, No.2, pp.293-302, 2014.
  5. Zi Yang, Keke Cai, Jie Tang, Li Zhang, Zhong Su, and Juanzi Li, "Social Context Summarization", In Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval. ACM, pp.255-264, 2011.
  6. Yo-Han Jo, Hyo-Jung Oh, Chung-Hee Lee, and Hyun-Ki Kim, "Fine-grained Sentiment Lexicon Construction via Semi-supervised Learning", 25th Annual Conference on HCLT, 2013.
  7. Moon-Soo Chang, "Empirical Sentiment Classification Using Psychological Emotions and Social Web Data", Journal of Korean Institute of Intelligent Systems, Vol.22, No.5, pp.563-569, 2012. https://doi.org/10.5391/JKIIS.2012.22.5.563
  8. Yong-Min Park, Su-Jeong Kwak, Daniel Lee, Bo-Gyum Kim, Yeo-Chan Yoon, and Jae-Sung Lee, "Construction of Korean Test Collection for Social Media Text Sentiment Analysis", Proceeding of the KIISE Fall Conference, Vol.39, No.2, pp.118-120, 2012.
  9. Kong-Joo Lee, Jee-Eun Kim, and Bo-Hyun Yun, "Extracting Multiword Sentiment Expressions by Using a Domain-Specific Corpus and a Seed Lexicon," ETRI Journal, Vol.35, No.5, pp.838-848. 2013. https://doi.org/10.4218/etrij.13.0113.0093
  10. Pum-Mo Ryu, Hyun-Jin Kim, Hyun-Ki Kim, and Sang-Kyu Park, "Social Media Issue Detection & Monitoring based on Deep Language Analysis Techniques," Journal of Computing Science and Engineering, Vol.30, No.6, pp.47-58, 2012.
  11. Chung-Hee Lee, Hyun-Jin Kim, Hyo-Jung Oh, Jeong Hur, Pum-Mo Ryu, and Hyun-Ki Kim, "Social WISDOM: An Issue Detection/Monitoring System", Proceedings of the Korea Information Processing Society Conference, Vol.19, No.2, 2012.
  12. Jeong Heo, Pum-Mo Ryu, Yoon-Jae Choi, and Hyun-Ki Kim, "Event Template Extraction for the Decision Support based on Social Media", 24th Annual Conference on HCLT, 2012.
  13. Yoonjae Choi, Pum-Mo Ryu, Hyunki Kim, and Changki Lee, "Extracting Events from Web Documents for Social Media Monitoring using Structured SVM", IEICE, Vol.E96-D, No. 6, 2013.
  14. Min-Chul Yang, Jung-Tae Lee, and Hae-Chang Rim, "Using Link Analysis to Discover Interesting Message Spread Across Twitter", Workshop Proceedings of TextGraphs-7 on Graph-based Methods for Natural Language Processing. Association for Computational Linguistics, pp.15-19, 2012.
  15. Min-Chul Yang, Jung-Tae Lee, Seung-Wook Lee, and Hae-Chang Rim, "Finding Interesting Posts in Twitter Based on Retweet Graph Analysis", Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval. ACM, 2012.
  16. Yong-Jin Bae, Pum-Mo Ryu, and Hyun-Ki Kim, "Predicting Popular Tweets based on Similarity Analysis from Collaborative Features", Journal of KIISE: Software and Application, Vol.40, No.7, pp.405-416, 2013.
  17. Eytan Barkshy, Jake M. Hofman, Winter A. Mason, and Duncan J. Watts, "Everyone's an Influencer : Quantifying Influence on Twitter", Proceedings of the fourth ACM international conference on Web search and data mining. ACM, 2011.
  18. Kyeongtaek, Kim, "$F_n$-Measure: An External Cluster Evaluation Measure", Journal of Society of Korea Industrial and Systems Engineering, Vol.35, No.4, pp.244-248, 2012. https://doi.org/10.11627/jkise.2012.35.4.244