DOI QR코드

DOI QR Code

A Study on the Intellectual Structure of Data Science Using Co-Word Analysis

동시출현단어분석을 통한 데이터과학 분야의 지적구조에 관한 연구

  • 김현정 (서울여자대학교 사회과학대학 문헌정보학과)
  • Received : 2017.11.18
  • Accepted : 2017.12.13
  • Published : 2017.12.30

Abstract

Data Science is emerging as a closely related field of study to Library and Information Science (LIS), and as an interdisciplinary subject combining LIS, statistics and computer science in an attempt to understand the value of data by applying what LIS has been doing for collecting, storing, organizing, analyzing, and utilizing information. To investigate which subject fields other than LIS, statistics, and computer science are related to Data Science, this study retrieved 667 materials from Web of Science Core Collection, extracted terms representing Web of Science Categories, examined subject fields that are studying Data Science using descriptive analysis, analyzed the intellectual structure of the field by co-word analysis and network analysis, and visualized the results as a Pathfinder network with clustering created with the PNNC clustering algorithm. The result of this study might help to understand the intellectual structure of the Data Science field, and may be helpful to give an idea for developing relatively new curriculum.

최근 문헌정보학의 관련 분야로 주목받고 있는 데이터과학은 오랫동안 문헌정보학에서 해오던 정보의 수집, 저장, 조직, 분석, 활용 등의 활동을 데이터에 적용하여 그 가치를 이해하려는 학문이며, 통계학과 컴퓨터공학 등 다른 학문분야와의 연계가 필요한 분야이다. 이러한 데이터과학 분야의 연구 영역을 파악하기 위하여 동시출현단어 분석을 사용하여 Web of Science 핵심컬렉션에 수록된 문헌들 중 데이터 과학 관련 자료들을 수집하고, 그 주제범주를 활용하여 네트워크분석을 실시하였다. 총 667건의 자료에 대한 159개의 주제범주를 기술분석하여 데이터과학 관련 연구가 많이 이루어지고 있는 학문분야를 조사하였고, 네트워크분석을 통해 데이터과학 분야 연구영역의 지적구조를 시각적으로 파악하였다. 분석결과, 데이터과학 분야의 연구들은 2개 영역 9개 군집으로 구분되었으며, 주제범주의 용어들 중 중심성이 높은 용어들을 통해 각 군집의 대표적인 주제들을 선정하였다. 연구의 결과는 데이터과학 분야의 연구들에 대한 지적구조를 파악하는데 도움이 될 수 있고, 문헌정보학과의 연계융합전공으로서의 데이터과학 교과과정 개발에 방향성을 제시할 수도 있을 것이다.

Keywords

Acknowledgement

Supported by : 서울여자대학교

References

  1. 강범일, 박지홍 (2013). 프로파일링 분석과 동시출현단어 분석을 이용한 한국어교육학의 정체성 분석. 정보관리학회지, 30(4), 195-213. http://dx.doi.org/10.3743/KOSIM.30.4.195 (Kang, Beomil, & Park, Ji-Hong (2013). Profiling and co-word analysis of teaching Korean as a foreign language domain. Journal of the Korean Society for Information Management, 30(4), 195-213. http://dx.doi.org/10.3743/KOSIM.30.4.195)
  2. 강범일, 이재윤 (2014). 트위터 관련 연구에 대한 계량정보학적 분석. 정보관리학회지, 31(3), 293-311. http://dx.doi.org/10.3743/KOSIM.31.3.293 (Kang, Beomil, & Lee, Jae Yun (2014). A bibliometric analysis on twitter research. Journal of the Korean Society for Information Management, 31(3), 293-311. http://dx.doi.org/10.3743/KOSIM.31.3.293)
  3. 강지혜 (2016). 문헌정보학과의 데이터 사이언스 커리큘럼 개발 실태와 방향성 고찰. 한국도서관.정보학회지, 47(3), 343-363. http://dx.doi.org/10.16981/kliss.47.201609.343 (Kang, Ji Hei (2016). Study on the current status of Data Science curriculum in Library and Information Science and its direction. Journal of Korean Library and Information Science Society, 47(3), 343-363. http://dx.doi.org/10.16981/kliss.47.201609.343)
  4. 김선회, 윤순진 (2015). 한국 환경사회학의 지적 구조: ECO 논문 제목의 동시출현단어분석을 중심으로. 환경사회학연구 ECO, 19(2), 165-211. (Kim, Sun-Hoi, & Yun, Sun-Jin (2015). The knowledge structure of environmental sociology in Korea: Based on the co-word analysis of article titles in ECO. ECO, 19(2), 165-211.)
  5. 김판준 (2015a). 국외 독서 및 독서교육 연구동향 분석. 정보관리학회지, 32(3), 69-97. http://dx.doi.org/10.3743/KOSIM.2015.32.3.069 (Kim, Pan Jun (2015a). An analytical study on research trends of reading and reading instruction in overseas: Focused on Library and Information Science. Journal of the Korean Society for Information Management, 32(3), 69-97. http://dx.doi.org/10.3743/KOSIM.2015.32.3.069)
  6. 김판준 (2015b). 디지털 큐레이션 연구동향 분석과 과제: 문헌정보학 분야를 중심으로. 정보관리학회지, 32(1), 265-295. http://dx.doi.org/10.3743/KOSIM.2015.32.1.265 (Kim, Pan Jun (2015b). An analytical study on research trends of digital curation: Focused on Library and Information Science. Journal of the Korean Society for Information Management, 32(1), 265-295. http://dx.doi.org/10.3743/KOSIM.2015.32.1.265)
  7. 김하진, 송민 (2014). 동시출현단어 분석을 통한 국내외 정보학 학회지 연구동향 파악. 정보관리학회지, 31(1), 99-118. http://dx.doi.org/10.3743/KOSIM.2014.31.1.099 (Kim, Ha Jin, & Song, Min (2014). A study on the research trends in domestic/international information science articles by co-word analysis. Journal of the Korean Society for Information Management, 31(1), 99-118. http://dx.doi.org/10.3743/KOSIM.2014.31.1.099
  8. 서선경, 정은경 (2013). 동시출현단어 분석 기반 오픈 액세스 분야 지적구조에 관한 연구. 한국비블리아학회지, 24(1), 207-228. https://doi.org/10.14699/kbiblia.2013.24.1.207 (Seo, SunKyung, & Chung, EunKyung (2013). Domain analysis on the field of open access by co-word analysis. Journal of the Korean Biblia Society for Library and Information Science, 24(1), 207-228. https://doi.org/10.14699/kbiblia.2013.24.1.207)
  9. 이명호 (2016). 데이터 사이언스 교과과정에 대한 연구. 한국비블리아학회지, 27(1), 263-290. http://dx.doi.org/10.14699/kbiblia.2016.27.1.263 (Yi, Myongho (2016). A study on the curriculums of Data Science. Journal of the Korean Biblia Society for Library and Information Science, 27(1), 263-290. http://dx.doi.org/10.14699/kbiblia.2016.27.1.263)
  10. 이수상 (2014). 언어 네트워크 분석 방법을 활용한 학술논문의 내용분석. 정보관리학회지, 31(4), 49-68. http://dx.doi.org/10.3743/KOSIM.31.4.049 (Lee, Soo-Sang (2014). A content analysis of journal articles using the language network analysis methods. Journal of the Korean Society for Information Management, 31(4), 49-68. http://dx.doi.org/10.3743/KOSIM.31.4.049)
  11. 이재윤 (2006). 지적 구조 분석을 위한 새로운 클러스터링 기법에 관한 연구. 정보관리학회지, 23(4), 215-231. https://doi.org/10.3743/kosim.2006.23.4.215 (Lee, Jae Yun (2006). A novel clustering method for examining and analyzing the intellectual structure of a scholarly field. Journal of the Korean Society for Information Management, 23(4), 215-231. https://doi.org/10.3743/kosim.2006.23.4.215)
  12. 이재윤 (2013). tnet과 WNET의 가중 네트워크 중심성 지수 비교 연구. 정보관리학회지, 30(4), 241-264. http://dx.doi.org/10.3743/KOSIM.2013.30.4.241 (Lee, Jae Yun (2013). A comparison study on the weighted network centrality measures of tnet and WNET. Journal of the Korean Society for Information Management, 30(4), 241-264. http://dx.doi.org/10.3743/KOSIM.2013.30.4.241)
  13. 이재윤 (2015). 데이터 사이언스와 데이터 리터러시. 한국정보관리학회 학술대회 논문집, 11-15. (Lee, Jae Yun (2015). Data Science and data literacy. Proceedings of the 22nd Conference of Korean Society for Information Management, 11-15.)
  14. 이재윤, 김수정 (2016). 국내 재난 관련 연구 동향에 대한 계량정보학적 분석. 정보관리학회지, 33(3), 103-124. http://dx.doi.org/10.3743/KOSIM.2016.33.4.103 (Lee, Jae Yun, & Kim, Soojung (2016). A bibliometric analysis of research trends on disaster in Korea. Journal of the Korean Society for Information Management, 33(3), 103-124. http://dx.doi.org/10.3743/KOSIM.2016.33.4.103)
  15. 이정규, 이용구 (2013). 동시출현단어 분석을 이용한 도서관경영 분야의 지적구조 분석. 한국정보관리학회 학술대회 논문집, 23-26. (Lee, Jung-Gyu, & Lee, Yong-Gu (2013). A study on the analysis of intellectual structure of library management studies using co-word analysis. Proceedings of the 20th Conference of Korean Society for Information Management, 23-26.)
  16. 장령령, 홍현진 (2014). 학술지 중요도와 키워드 순서를 고려한 단어동시출현분석을 이용한 독서분야의 지적구조 분석. 한국비블리아학회지, 25(1), 295-318. http://dx.doi.org/10.14699/kbiblia.2014.25.1.295 (Zhang, Ling Ling, & Hong, Hyun Jin (2014). Examining the intellectual structure of reading studies with co-word analysis based on the importance of journals and sequence of keywords. Journal of the Korean Biblia Society for Library and Information Science, 25(1), 295-318. http://dx.doi.org/10.14699/kbiblia.2014.25.1.295)
  17. 최상희 (2014). 동시출현단어분석을 이용한 연관영화정보 분석 연구. 정보관리학회지, 31(4), 161-178. http://dx.doi.org/10.3743/KOSIM.2014.31.4.161 (Choi, Sanghee (2014). An analysis of related movie information using the co-word method. Journal of the Korean Society for Information Management, 31(4), 161-178. http://dx.doi.org/10.3743/KOSIM.2014.31.4.161)
  18. 최예진, 정연경 (2016). 동시출현단어 분석에 기반한 메타데이터 분야의 지적구조에 관한 연구. 정보관리학회지, 33(3), 63-83. http://dx.doi.org/10.3743/KOSIM.2016.33.3.063 (Choi, Ye-Jin, & Chung, Yeon-Kyoung (2016). A study on the intellectual structure of metadata research by using co-word analysis. Journal of the Korean Society for Information Management, 33(3), 63-83. http://dx.doi.org/10.3743/KOSIM.2016.33.3.063)
  19. 최형욱, 정은경 (2017). 사회학 분야의 연구데이터 특성과 지적구조 규명에 관한 연구. 정보관리학회지, 34(3), 109-124. http://dx.doi.org/10.3743/KOSIM.2017.34.3.109 (Choi, Hyung Wook, & Chung, EunKyung (2017). An investigation on characteristics and intellectual structure of sociology by analyzing cited data. Journal of the Korean Society for Information Management, 34(3), 109-124. http://dx.doi.org/10.3743/KOSIM.2017.34.3.109)
  20. 허고은, 송민 (2013). 저자동시인용 분석과 동시출현단어 분석을 이용한 의료정보학 저널의 지적구조분석. 정보관리학회지, 30(2), 207-225. https://doi.org/10.3743/kosim.2013.30.2.207 (Heo, Go Eun, & Song, Min (2013). Examining the intellectual structure of a medical informatics journal with author co-citation analysis and co-word analysis. Journal of the Korean Society for Information Management, 30(2), 207-225. https://doi.org/10.3743/kosim.2013.30.2.207)
  21. Cervone, H. F. (2015). Informatics and data science: An overview for the information professional. Digital Library Perspectives, 32(1), 7-10. http://dx.doi.org/10.1108/DLP-10-2015-0022
  22. Cho, J. (2014). Intellectual structure of the institutional repository field: A co-word analysis. Journal of Information Science, 40(3), 386-397. http://dx.doi.org/10.1177/0165551514524686
  23. Danowski, J. (1993). Network analysis of message content. In W. D. Richards Jr. & G. A. Barnett (Eds.), Progress in Communication Sciences IV (pp. 197-221). Norwood, NJ: Ablex.
  24. Gan, C., & Wang, W. (2015). Research characteristics and status on social media in China: A bibliometric and co-word analysis. Scientometrics, 105(2), 1167-1182. https://doi.org/10.1007/s11192-015-1723-2
  25. He, Q. (1999). Knowledge discovery through co-word analysis. Library Trends, 48(1), 133-159.
  26. Hu, C. P., Hu, J. M., Deng, S. L., & Liu, Y. (2013). A co-word analysis of library and information science in China. Scientometrics, 97(2), 369-382. https://doi.org/10.1007/s11192-013-1076-7
  27. Kuhn, T. (1996). The Structure of Scientific Revolutions (3rd ed.). Chicago: The University of Chicago Press.
  28. Leydesdorff, L., & Nerghes, A. (2017). Co-word maps and topic modeling: A comparison using small and medium-sized corpora (N<1,000). Journal of the Association for Information Science and Technology, 68(4), 1024-1035. http://dx.doi.org/10.1002/asi.23740
  29. Liu, G. Y., Hu, J. M., & Wang, H. L. (2012). A co-word analysis of digital library field in China. Scientometrics, 91(1), 203-217. https://doi.org/10.1007/s11192-011-0586-4
  30. Nguyen, L. T. K., & Keip, M. (2018). A data-driven approach to nonlinear elasticity. Computers and Structures, 194, 97-115. http://dx.doi.org/10.1016/j.compstruc.2017.07.031
  31. Ravikumar, S., Agrahari, A., & Singh, S. N. (2015). Mapping the intellectual structure of scientometrics: A co-word analysis of the journal Scientometrics (2005-2010). Scientometrics, 102, 929-955. http://dx.doi.org/10.1007/s11192-014-1402-8
  32. Tang, R., & Sae-Lim, W. (2016). Data Science programs in U.S. higher education: An exploratory content analysis of program description, curriculum structure, and course focus. Education for Information, 32, 269-290. http://dx.doi.org/10.3233/EFI-160977
  33. Wang, Z. Y., Li, G., Li, C. Y., & Li, A. (2012). Research on the semantic-based co-word analysis. Scientometrics, 90(3), 855-875. https://doi.org/10.1007/s11192-011-0563-y
  34. Wang, Z., Zhao, H., & Wang, Y. (2015). Social networks in marketing research 2001-2014: A co-word analysis. Scientometrics, 105(1), 65-82. https://doi.org/10.1007/s11192-015-1672-9
  35. Web of Science (2017, November 24). Web of Science Core Collection Help. Retrieved from http://images.webofknowledge.com/WOKRS526R11/help/WOS/contents.html
  36. White, H. D. (2003). Pathfinder networks and author cocitation analysis: A remapping of paradigmatic information scientists. Journal of the American Society for Information Science and Technology, 54(5), 423-434. https://doi.org/10.1002/asi.10228
  37. Yang, Y., Wu, M., & Cui, L. (2012). Integration of three visualization methods based on co-word analysis. Scientometrics, 90(2), 659-673. https://doi.org/10.1007/s11192-011-0541-4
  38. Zong, Q. J., Shen, H. Z., Yuan, Q. J., Hu, X. W., Hou, Z. P., & Deng, S. G. (2013). Doctoral dissertations of library and information science in China: A co-word analysis. Scientometrics, 94(2), 781-799. https://doi.org/10.1007/s11192-012-0799-1