DOI QR코드

DOI QR Code

Individual Interests Tracking : Beyond Macro-level Issue Tracking

거시적 이슈 트래킹의 한계 극복을 위한 개인 관심 트래킹 방법론

  • 류신 (국민대학교 비즈니스IT전문대학원) ;
  • 김남규 (국민대학교 경영정보학부)
  • Received : 2014.10.26
  • Accepted : 2014.12.13
  • Published : 2014.12.31

Abstract

Recently, the volume of unstructured text data generated by various social media has been increasing rapidly; consequently, the use of text mining to support decision-making has also been growing. In particular, academia and industry are paying significant attention to topic analysis in order to discover the main issues from a large volume of text documents. Topic analysis can be regarded as static analysis because it analyzes a snapshot of the distribution of various issues. In contrast, some recent studies have attempted to perform dynamic issue tracking, which analyzes and traces issue trends during a predefined period. However, most traditional issue tracking methods have a common limitation : when a new period is included, topic analysis must be repeated for all the documents of the entire period, rather than being conducted only on the new documents of the added period. Additionally, traditional issue tracking methods do not concentrate on the transition of individuals' interests from certain issues to others, although the methods can illustrate macro-level issue trends. In this paper, we propose an individual interests tracking methodology to overcome the two limitations of traditional issue tracking methods. Our main goal is not to track macro-level issue trends but to analyze trends of individual interests flow. Further, our methodology has extensible characteristics because it analyzes only newly added documents when the period of analysis is extended. In this paper, we also analyze the results of applying our methodology to news articles and their access logs.

Keywords

References

  1. Aggarwal, A., G. Waghmare, and A. Sureka, "Mining Issue Tracking Systems Using Topic Models for Trend Analysis, Corpus Exploration, and Understanding Evolution", Proceedings of the 3rd International Workshop on Realizing Artificial Intelligence Synergies in Software Engineering, 2014, 52-58.
  2. Albright, R., Taming Text with the SVD, SAS Institute Inc., 2006.
  3. Bae, J., N. Han, and M. Song, "Twitter Issue Tracking System by Topic Modeling Techniques", Journal of Intelligence and Information Systems, Vol.20, No.2, 2014, 109-122.
  4. Bae, J., J. Son, and M. Song, "Analysis of Twitter for 2012 South Korea Presidential Election by Text Mining Techniques", Journal of Intelligence and Information Systems, Vol.19, No.3, 2013, 141-156.
  5. Ding, W. and C. Chen, "Dynamic Topic Detection and Tracking : A Comparison of HDP, C-word, and Cocitation Methods", Journal of the Association for Information Science and Technology, Vol.65, No.10, 2014, 2084-2097. https://doi.org/10.1002/asi.23134
  6. Han, J., M. Kamber, and J. Pei, Data Mining : Concepts and Techniques, 3rd Edition, Morgan Kaufmann Publishers, 2011.
  7. Hong, J., N. Kim, and S. Lee, "A Methodology for Automatic Multi-Categorization of Single-Categorized Documents", Journal of Intelligence and Information Systems, Vol.20, No.3, 2014.
  8. Huang, S., W. Peng, J. Li, and D. Lee, "Sentiment and Topic Analysis on Social Media : a Multi-task Multi-label Classification Approach", Proceedings of the 5th Annual ACM Web Science Conference, 2013, 172-181.
  9. Jin, H., R. Schwarts, S., Sista, F. Walls, "Topic Tracking for Radio, TV Broadcast and Newswire", Proceedings of DARPA Broadcast News Workshop, 1999.
  10. Kim, J., N. Kim, and Y. Cho, "User-Perspective Issue Clustering Using Multi-Layered Two-Mode Network Analysis", Journal of Intelligence and Information Systems, Vol.20, No.2, 2014.
  11. Lee, K.M., H. Namgoong, E.H. Kim, G.Y. Lee, and H.K. Kim, "Analysis of Multi-dimensional Interaction among SNS Users", Journal of Korean Society for Internet Information, Vol.12, No.2, 2010, 113-122.
  12. Ma, J., Y., Wang, H., Zhu, and Y Shen, "Research on Method of Adaptive Topic Tracking Based on Evolution of Public Opinion Ontology", ACEEE International Journal on Information Technology, Vol.4, No.1, 2014.
  13. McKinsey Global Institute, Big Data : The next Frontier for Innovation, Competition, and Productivity, McKinsey and Company, 2011.
  14. Provost, F. and T. Fawcett, Data Science for Business, O'Reilly, 2013.
  15. Salton, G. and M.J. McGill, Introduction to Modern Information Retrieval, McGraw Hill, 1983.
  16. Salton, G., A. Wong, and C.S. Yang, "A Vector Space Model for Automatic Indexing", Communications of the ACM, Vol.18, No.11, 1975, 613-620. https://doi.org/10.1145/361219.361220
  17. Weiss, S.M., N. Indurkhya, and T. Zhang, Fundamentals of Predictive Text Mining, Springer, 2010.
  18. Xu, P., Y. Wu, E. Wei, T. Peng, S. Liu, J.H. Zhu, and H. Qu, "Visual Analysis of Topic Competition on Social Media", IEEE Transactions on Visualization and Computer Graphics, Vol.19, No.12, 2013, 2012-2021. https://doi.org/10.1109/TVCG.2013.221
  19. Yin, Z., L. Cao, and J. Han, "Latent Community Topic Analysis : Integration of Community Discovery with Topic Modeling", Journal of ACM Transactions on Intelligent Systems and Technology, Vol.3, No.4, 2012.