DOI QR코드

DOI QR Code

Applying Centrality Analysis to Solve the Cold-Start and Sparsity Problems in Collaborative Filtering

협업필터링의 신규고객추천 및 희박성 문제 해결을 위한 중심성분석의 활용

  • Cho, Yoon-Ho (College of Business Administration, Kookmin University) ;
  • Bang, Joung-Hae (College of Business Administration, Kookmin University)
  • Received : 2011.08.01
  • Accepted : 2011.08.16
  • Published : 2011.09.30

Abstract

Collaborative Filtering (CF) suffers from two major problems:sparsity and cold-start recommendation. This paper focuses on the cold-start problem for new customers with no purchase records and the sparsity problem for the customers with very few purchase records. For the purpose, we propose a method for the new customer recommendation by using a combined measure based on three well-used centrality measures to identify the customers who are most likely to become neighbors of the new customer. To alleviate the sparsity problem, we also propose a hybrid approach that applies our method to customers with very few purchase records and CF to the other customers with sufficient purchases. To evaluate the effectiveness of our method, we have conducted several experiments using a data set from a department store in Korea. The experiment results show that the combination of two measures makes better recommendations than not only a single measure but also the best-seller-based method and that the performance is improved when applying the hybrid approach.

본 연구에서는 협업필터링의 두 가지 근본적인 문제인 신규고객 추천(cold-start recommendation)과 희박성(sparsity) 문제를 해결하고자 한다. 먼저, 사회 네트워크 분석에서 가장 많이 활용 되고 있는 세 가지 중심성 지표인 연결중심성(degree centrality), 근접중심성(closeness centrality), 매개중심성(betweenness centrality)을 결합한 다양한 중심성 지표들을 만든 후 이를 기반으로 신규고객의 잠재 이웃고객을 찾고 그 이웃고객들의 구매정보를 이용하여 신규고객에게 상품을 추천하는 새로운 방법을 제시한다. 다음으로 희박성 문제를 해결하기 위하여, 구매정보가 충분한 고객에게는 협업필터링을, 그렇지 않은 고객에게는 협업필터링 대신 제시한 신규고객 추천방법을 적용하는 하이브리드 추천 방법을 제안한다. 제시한 추천 방법의 효과성을 평가하기 위하여 국내 유명 백화점 중의 하나인 H백화점의 구매 트랜잭션 데이터를 사용하여 실험하였다. 실험결과로부터 근접중심성과 매개중심성을 결합한 지표를 신규고객 추천 시에 사용할 경우 추천 성능이 가장 우수한 것으로 판명되었으며, 제안한 하이브리드 추천 방법이 기존의 협업필터링의 성능을 상당히 개선함으로써 희박성 문제를 해결할 수 있는 새로운 대안임이 입증되었다.

Keywords

References

  1. Adomavicius, G. and A. Tuzhilin, "Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions", IEEE Transactions on Knowledge and Data Engineering, Vol.17, No.6 (2005), 734-749. https://doi.org/10.1109/TKDE.2005.99
  2. Aggarwal, C. C., Z. Sun, and P. S. Yu, "Online Algorithms for Finding Profile Association Rules", Proceedings of the seventh international conference on Information and Knowledge Management, (1998), 86-95.
  3. Aggarwal, C. C. and P. S. Yu, "Data mining techniques for personalization", Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, Vol.23(2000), 4-9.
  4. Billsus, D. and M. J. Pazzani, "Learning collaborative information filters", Proceedings of the 15th International Conference on Machine Learning, (1998), 46-54.
  5. Bonacich, P., "Power and Centrality:A Family of Measures", American Journal of Sociology, Vol. 92(1987), 1170-1182. https://doi.org/10.1086/228631
  6. Borgatti, S. P., "Centrality and network flow", Social Networks, Vol.27, No.1(2005), 55-71. https://doi.org/10.1016/j.socnet.2004.11.008
  7. Borgatti, S. P. and M. G. Everett, "A Graph-theoretic perspective on centrality", Social Networks, Vol.28(2006), 466-484. https://doi.org/10.1016/j.socnet.2005.11.005
  8. Cho, Y. H. and J. H. Bang, "Social Network Analysis for New Product Recommendation", Asia Pacific Journal of Intelligent Technology and Management, Vol.15, No.4(2009), 123-140.
  9. Cho, Y. H. and I. H. Kim, "Predicting the Performance of Recommender Systems through Social Network Analysis and Artificial Neural Network", Asia Pacific Journal of Intelligent Technology and Management, Vol.16, No.4 (2010), 159-172.
  10. Cho, Y. H. and J. K. Kim, "Application of Web usage mining and product taxonomy to collaborative recommendations in e-commerce", Expert Systems with Applications, Vol.26, No.3(2004), 234-246.
  11. Choi, S. S., S. H. Cha, and C. Tappert, "A Survey of Binary Similarity and Distance Measures", Journal on Systemics, Cybernetics and Informatics, Vol.8, No.1(2010), 43-48.
  12. Costenbader, E. and T. W. Valente, "The stability of centrality measures when networks are sampled", Social Networks, Vol.25(2003), 283-307. https://doi.org/10.1016/S0378-8733(03)00012-1
  13. Frank, O., "Using centrality modeling in network surveys", Social Networks, Vol.24(2002), 385-394. https://doi.org/10.1016/S0378-8733(02)00014-X
  14. Freeman, L., "Centrality in Social Networks: Conceptual clarification", Social Networks, Vol.1(1979), 215-239.
  15. Huang, Z., H. Chen, and D. Zeng, "Applying Associative Retrieval Techniques to Alleviate the Sparsity Problem in Collaborative Filtering", ACM Transactions on Information Systems, Vol.22, No.1(2004), 116-142. https://doi.org/10.1145/963770.963775
  16. Human, S. E. and K. G. Provan, "Legitimacy Building in the Evolution of Small-Firm Multilateral Networks:A Comparative Study of Success and Demise", Administrative Science Quarterly, Vol.45, No.2(2000), 327-365. https://doi.org/10.2307/2667074
  17. Kauffiman, S., The Origins of Order, Oxford University Press, 1993.
  18. Kim, H. K., Y. U. Ryu, Y. H. Cho, and J. K. Kim, "Customer-Driven Content Recommendation over a Network of Customers", IEEE Transactions on Systems, Man, and Cybernetics-Part A:Systems and Humans, Forthcoming, 2011.
  19. Kim, J. K. and Y. H. Cho, "Using Web Usage Mining and SVD to Improve E-commerce Recommendation Quality", LNAI, Vol.2891 (2003), 86-97.
  20. Krulwich, B., "Lifestyle Finder:Intelligent User Profiling Using Large-Scale Demographic Data", Artificial Intelligence Magazine, Vol. 18, No.2(1997), 37-45.
  21. Kukkonen, H. O., K. Lyytinen, and Y. J. Yoo, "Social Networks and Information System s:Ongoing and Future Research Streams", Journal of the Association for Information Systems, Vol.11(2010), 61-68.
  22. Lee, S. K., Y. H. Cho, and S. H. Kim, "Collaborative filtering with ordinal scale-based implicit ratings for mobile music recommendations", Information Sciences, Vol.180, No.11 (2010), 2142-2155. https://doi.org/10.1016/j.ins.2010.02.004
  23. Melville, P., R. J. Mooney, and R. Nagarajan, "Content-boosted Collaborative Filtering", Proceeding SIGIR 2001 Workshop on Recommender Systems, 2001.
  24. Park, J. H., Y. H. Cho, and J. K. Kim, "Social Network:A Novel Approach to New Customer Recommendations", Asia Pacific Journal of Intelligent Technology and Management, Vol.15, No.1(2009), 123-140.
  25. Park, S. T., Pennock, O. Madani, N. Good, and D. DeCoste, "Naive Filterbots for Robust Cold-Start Recommendations", KDD, 2006.
  26. Ryu, Y. U., H. K. Kim, Y. H. Cho, and J. K. Kim, "Peer-oriented content recommendation in a social network", Proceedings of the Sixteenth Workshop on Information Technologies and Systems, (2006), 115-120.
  27. Sarwar, B., G. Karypis, J. A. Konstan, and J. Riedl, "Analysis of recommendation algorithms for e-commerce", Proceedings of ACM E-commerce 2000 conference, (2000), 158-167.
  28. Sarwar, B., G. Karypis, J. A. Konstan, and J. Riedl, "Application of Dimensionality Reduction in Recommender Systems-A Case Study", Proceedings of ACM WebKDD Workshop, 2000.
  29. Schein, A. I., A. Popescul, D. M. Pennock, and L. H. Ungar, "Methods and Metrics for Cold-Start Recommendations", SIGIR, 2002.
  30. Scott, J., Social Network Analysis:A Handbook. Thousand Oaks, CA:Sage, 2000.
  31. Wasserman, S. and K. Faust, Social network analysis: methods and applications, Cambridge University Press, 1994.
  32. Weare, C., W. E. Loges, and N. Oztas, "Email Effects on the Structure of Local Associations: A Social Network Analysis", Social Science Quarterly, Vol.88, No.1(2007), 222-243. https://doi.org/10.1111/j.1540-6237.2007.00455.x
  33. Yu, K., A. Schwaighofer, V. Tresp, X. Xu, and H. Kriegel, "Probabilistic Memory-based Collaborative Filtering", IEEE Transactions on Knowledge and Data Engineering, Vol. 16, No.1(2004), 56-69. https://doi.org/10.1109/TKDE.2004.1264822
  34. Zemljic, B. and V. Hlebec, "Reliability of measures of centrality and prominence", Social Networks, Vol.27(2005), 73-88. https://doi.org/10.1016/j.socnet.2004.11.010

Cited by

  1. The Effect of the Personalized Settings for CF-Based Recommender Systems vol.18, pp.2, 2012, https://doi.org/10.13088/jiis.2012.18.2.131
  2. Clustering Method based on Genre Interest for Cold-Start Problem in Movie Recommendation vol.19, pp.1, 2011, https://doi.org/10.13088/jiis.2013.19.1.057
  3. 구조적 공백과 협업필터링을 이용한 추천시스템 vol.20, pp.4, 2014, https://doi.org/10.13088/jiis.2014.20.4.107
  4. 이차원 고객충성도 세그먼트 기반의 고객이탈예측 방법론 vol.26, pp.4, 2020, https://doi.org/10.13088/jiis.2020.26.4.111
  5. 네트워크 중심성 척도가 추천 성능에 미치는 영향에 대한 연구 vol.27, pp.1, 2021, https://doi.org/10.13088/jiis.2021.27.1.023