DOI QR코드

DOI QR Code

Improved Multidimensional Scaling Techniques Considering Cluster Analysis: Cluster-oriented Scaling

클러스터링을 고려한 다차원척도법의 개선: 군집 지향 척도법

  • Received : 2012.05.07
  • Accepted : 2012.05.30
  • Published : 2012.06.30

Abstract

There have been many methods and algorithms proposed for multidimensional scaling to mapping the relationships between data objects into low dimensional space. But traditional techniques, such as PROXSCAL or ALSCAL, were found not effective for visualizing the proximities between objects and the structure of clusters of large data sets have more than 50 objects. The CLUSCAL(CLUster-oriented SCALing) technique introduced in this paper differs from them especially in that it uses cluster structure of input data set. The CLUSCAL procedure was tested and evaluated on two data sets, one is 50 authors co-citation data and the other is 85 words co-occurrence data. The results can be regarded as promising the usefulness of CLUSCAL method especially in identifying clusters on MDS maps.

개체들 사이의 관계를 저차원 공간에 매핑하는 다차원척도법을 수행하기 위한 다양한 방법과 알고리즘이 개발되어왔다. 그러나 PROXSCAL이나 ALSCAL과 같은 기존의 기법들은 50개 이상의 개체를 포함하는 데이터 집합을 대상으로 개체 간의 관계와 군집 구조를 시각화하는데 있어서 효과적이지 못한 것으로 나타났다. 이 연구에서 제안하는 군집 지향 척도법 CLUSCAL(CLUster-oriented SCALing)은 기존 방법과 달리 입력되는 데이터의 군집 구조를 고려하도록 고안되었다. 50명의 저자동시인용 데이터와 85개 단어의 동시출현 데이터에 대해서 적용해본 결과 제안한 CLUSCAL 기법은 군집 구조를 잘 식별할 수 있는 MDS 지도를 생성하는 유용한 기법임이 확인되었다.

Keywords

References

  1. 곽선영, 정은경 (2012). 복수저자기반 동시인용분석을 활용한 지적구조 분석. 정보관리학회지, 29(1), 115-134. http://dx.doi.org/10.3743/KOSIM.2012.29.1.115(Kwak, Sun-Young, & Chung, Eunkyung (2012). Domain analysis on economics by utilizing cocitation analysis of multiple authorship. Journal of the Korean Society for Information Management, 29(1), 115-134. http://dx.doi.org/10.3743/KOSIM.2012.29.1.115)
  2. 김판준 (2011). 저자 프로파일링 기법을 이용한 국내 독서 연구 영역 분석. 한국비블리아학회지, 22(4), 21-44.(Kim, Pan Jun (2011). Domain analysis of reading research in Korea using author profiling. Journal of the Korean Biblia Society for Library and Information Science, 22(4), 21-44.)
  3. 김희전, 조현양 (2010). 저자동시인용분석과 저자서지결합분석에 의한 지적 구조 분석. 정보관리학회지, 27(3), 283-306. http://dx.doi.org/10.3743/KOSIM.2010.27.3.283(Kim, Hee-Jeon, &, Cho, Hyun-Yang (2010). A study on intellectual structure using author co-citation analysis and author bibliographic coupling analysis in the field of social welfare science. Journal of the Korean Society for Information Management, 27(3), 283-306. http://dx.doi.org/10.3743/KOSIM.2010.27.3.283)
  4. 문주영 (2011). 2000년대 비서학연구의 저자동시인용분석. 비서학논총, 20(1), 25-44.(Moon, Ju Young (2011). Study on intellectual structure of secretarial studies using author co-citation analysis. Journal of Secretarial Science, 20(1), 25-44.)
  5. 박재신, 정영미 (2010). 지구적 환경문제 해결을 위한 학술활동과 환경운동 경향 연구. 정보관리학회지, 27(3), 83-102. http://dx.doi.org/10.3743/KOSIM.2010.27.3.083(Park, Jae-Shin, & Chung, Young Mee (2010). An informetric study on academic activities and environmental movements in solving global environmental problems. Journal of the Korean Society for Information Management, 27(3), 83-102. http://dx.doi.org/10.3743/KOSIM.2010.27.3.083)
  6. 유종덕, 최은주 (2011). 저자프로파일링분석과 저자동시인용분석의 유용성 비교 검증. 정보관리학회지, 28(1), 123-144. http://dx.doi.org/10.3743/KOSIM.2011.28.1.123(Ryoo, Jong-duk, & Choi, Eun-Ju (2011). A comparison test on the potential utility between Author Profiling Analysis (APA) and Author Co-Citation Analysis (ACA). Journal of the Korean Society for Information Management, 28(1), 123-144. http://dx.doi.org/10.3743/KOSIM.2011.28.1.123)
  7. 이은숙, 정영미 (2002). 복수저자를 고려한 동시인용분석 연구: 정보학과 컴퓨터과학을 대상으로. 지식처리연구, 3(2), 1-26. Retrieved from http://jkpm.yonsei.ac.kr/fulltext/v3n2a1.pdf(Lee, Eun Suk, & Chung, Young Mee (2002). A co-citation analysis of multiple authorship in the subject fields of information science and computer science. Journal of Knowledge Processing, 3(2), 1-26. Retrieved from http://jkpm.yonsei.ac.kr/fulltext/v3n2a1.pdf)
  8. 이재윤 (2007). 지적 구조 분석을 위한 MDS 지도 작성 방식의 비교 분석. 한국문헌정보학회지, 41(2), 335-357. http://dx.doi.org/10.4275/KSLIS.2007.41.2.335(Lee, Jae Yun (2007). A comparison analysis of various approaches to multidimensional scaling in mapping a knowledge domain's intellectual structure. Journal of the Korean Society for Library and Information Science, 41(2), 335-357. http://dx.doi.org/10.4275/KSLIS.2007.41.2.335)
  9. 이재윤, 김희전, 유종덕 (2010). 저자프로파일링과 요인분석을 이용한 국내 주거학 분야의 지적 구조 분석. 한국문헌정보학회지, 44(2), 285-308. http://dx.doi.org/10.4275/KSLIS.2010.44.2.285(Lee, Jae Yun, Kim, Hee-Jeon, & Ryoo, Jong-Duk (2010). Examining the intellectual structure of housing studies in Korea with text mining and factor Analysis. Journal of the Korean Society for Library and Information Science, 44(2), 285-308. http://dx.doi.org/10.4275/KSLIS.2010.44.2.285)
  10. 이재윤, 정주희 (2006). 연구자 소속과 표제어 분석을 통한 국내 인지과학 분야의 학제적 구조 파악. 제13회 한국정보관리학회 학술대회 논문집, 127-134.(Lee, Jae Yun, & Jung, Ju Hee (2006). Examining the interdisciplinary structure of Korean cognitive science through analyzing author affiliations and title words. Proceedings of the 13th Annual Conference of Korean Society for Information Management, 127-134.)
  11. 정영미, 이재윤 (2001). 지식 분류의 자동화를 위한 클러스터링 모형 연구. 정보관리학회지, 18(2), 203-230.(Chung, Young Mee, & Lee, Jae Yun (2001). Development of a clustering model for automatic knowledge classification. Journal of the Korean Society for Information Management, 18(2), 203-230.)
  12. 조선례, 이재윤 (2012). 약학 분야 학술정보서비스를 위한 학술지 동시인용 분석. 정보관리연구, 43(1), 159-185. http://dx.doi.org/10.1633/JIM.2012.43.1.159(Joe, Seon-Rye, & Lee, Jae Yun (2012). Journal co-citation analysis for library services in pharmaceutics. Journal of Information Management, 43(1), 159-185. http://dx.doi.org/10.1633/JIM.2012.43.1.159)
  13. 조재인 (2011). 네트워크 텍스트 분석을 통한 문헌정보학 최근 연구 동향 분석. 정보관리학회지, 28(4), 65-83. http://dx.doi.org/10.3743/KOSIM.2011.28.4.065(Cho, Jane (2011). A study for research area of library and information science by network text analysis. Journal of the Korean Society for Information Management, 28(4), 65-83. http://dx.doi.org/10.3743/KOSIM.2011.28.4.065)
  14. Borg, I., & Groenen, P. J. F. (2005). Modern multidimensional scaling: Theory and Applications (2nd ed). New York: Springer Science+Business Media, Inc.
  15. Borner, K., Chen, C., & Boyack, K. (2003). Visualizing knowledge domains. Annual Review of Information Science & Technology, 37, 179-255.
  16. Chen, C. (2006). Information visualization: Beyond the horizon (2nd ed.). Springer-Verlag London Limited.
  17. Choi, Sanghee (2011). An informetric analysis on intellectual structures with multiple features of academic library research papers. Journal of the Korean Society for Information Management, 28(2), 65-78. http://dx.doi.org/10.3743/KOSIM.2011.28.2.065
  18. Deerwester, S., Dumais, S. T., Furnas, G. W., Landauer, T. K., & Harshman, R. (1990). Indexing by latent semantic analysis. Journal of the American Society for Information Science, 41(6), 391-407. http://dx.doi.org/10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9
  19. Green, P. E., & Rao, V. (1972). Applied multidimensional scaling. Hinsdal, IL: Dryden.
  20. Lee, Jae Yun, & Choi, Sanghee (2011). Intellectual structure and infrastructure of informetrics. Journal of the Korean Society for Information Management, 28(2), 11-36. http://dx.doi.org/10.3743/KOSIM.2011.28.2.011
  21. Park, Myung-Kyu, & Kim, Heejung (2011). A bibliometric analysis of the literature on information literacy. Journal of the Korean Society for Information Management, 28(2), 53-63. http://dx.doi.org/10.3743/KOSIM.2011.28.2.053
  22. Small, H. (1973). Co-citation in the scientific literature: A new measure of the relationship between publications. Journal of the American Society for Information Science, 24(4), 265-269. https://doi.org/10.1002/asi.4630240406
  23. White, H. D., & Griffith, B. C. (1981). Author cocitation: A literature measure of intellectual structure. Journal of the American Society for Information Science, 32(3), 163-171. http://dx.doi.org/10.1002/asi.4630320302