DOI QR코드

DOI QR Code

An Experimental Study on the Automatic Interlinking of Meaning for the LOD Construction of Record Information

기록정보 LOD 구축을 위한 의미 상호연결 자동화 실험 연구

  • Received : 2017.10.31
  • Accepted : 2017.11.16
  • Published : 2017.11.30

Abstract

In a new technological environment such as big data and AI, LOD will link record information resources with various data from both inside and outside. At the heart of this connection is the interlinking technology, and interlinked LOD will realize the opening of record information as the highest level of open data. Given the ever-increasing amount of records, automation through interlinking algorithms is essential in building LODs. Therefore, this paper analyzed the structure of record information interlinking with the external data and characteristics of the record information to be considered when interconnecting. After collecting samples from the CAMS data of the National Archives, we constructed a record information's LOD. After that, we conducted a test bed that automatically interlinks the personal information of the record metadata with DBPedia. This confirms the automatic interlinking process and the performance and accuracy of the automation technology. Through the implications of the testbed, we have identified the considerations of the record information resources of the LOD interlinking process.

빅데이터, 인공지능 등 신기술 환경에서 LOD는 기록정보자원을 내외부의 다양한 데이터들과 연결되도록 할 것이다. 이러한 연결의 중심에는 상호연결(Interlinking) 기술이 존재하며, 상호연결된 LOD는 기록정보 개방을 데이터 개방(Open Data)의 최상위 단계로 실현할 것이다. 지속적으로 증가하는 기록의 양을 감안하면, LOD 구축 시 상호연결 알고리즘을 통한 자동화는 필수적이다. 이에 본 연구는 기록정보가 외부 데이터와 상호연결되는 구조와 상호연결 시 고려해야 할 기록정보의 특성을 분석하였다. 또한 국가기록원 CAMS 데이터의 샘플을 수집하여 기록정보 LOD를 구축한 뒤, 기록물 메타데이터의 인물정보를 DBPedia와 자동으로 상호연결하는 테스트베드를 진행하였다. 이를 통해 상호연결 자동화 프로세스를 확인하고, 자동화 기술의 성능과 정확도를 확인하였다. 그리고 테스트베드를 통해 얻은 시사점을 통해 기록정보 LOD 상호연결 과정의 고려사항을 파악하였다.

Keywords

References

  1. 김용겸 (2014). 시맨틱 웹의 주요 응용사례와 발전방향. 동중앙아시아연구(구 한몽경상연구), 25(3), 65-86.(Kim, Yong-Kyeom (2014). Main Application Case and Development Directions of Semantic Web. Journal of East and Central Asian Studies, 25(3), 65-86.)
  2. 박옥남 (2012). 기록물 전거통제 기반 Linked data 구축에 대한 연구. 한국비블리아학회지, 23(2), 5-25.(Park, Ok Nam (2012). The Design and Development of Linked Data from Authority Data in National Archives of Korea. Journal of the Korean Biblia Society for Library and Information Science, 23(2), 5-25.) https://doi.org/10.14699/kbiblia.2012.23.2.005
  3. 박지영 (2016). 차세대 기록물 기술표준에 관한 연구. 한국기록관리학회지, 16(1), 223-245.(Park, Zi-young (2016). Analyzing the Next-generation Archival Description Standard: "Recordin Context" of ICA EGAD. Journal of Korean Society of Archives and Records Management,16(1), 223-245.) https://doi.org/10.14404/JKSARM.2016.16.1.223
  4. 박지영 (2017). ISAD(G)에서 RiC-CM으로의 전환에 관한 연구. 한국기록관리학회지, 17(1), 93-115.(Park, Zi-young (2017). Transition of Archival Description from ISAD(G) to Record in Context Conceptual Model. Journal of Korean Society of Archives and Records Management, 17(1), 93-115.) https://doi.org/10.14404/JKSARM.2017.17.1.093
  5. 윤소영 (2013). 공공데이터 활용을 위한 링크드 데이터 국가 연계체계 구축에 관한 연구. 정보관리학회지, 30(1), 259-284.(Yoon, So-Young (2013). A Study on National Linking System Implementation based on Linked Data for Public Data. Journal of the Korean Society for Information Management, 30(1), 259-284.) https://doi.org/10.3743/KOSIM.2013.30.1.259
  6. 이경욱 (2015). LOD InterLinking. 검색일자: 2017. 10. 19. https://www.slideshare.net/ssuser6e1ce5/interlinking-lod(Lee, Kyounguk (2015). LOD InterLinking. Retrieved Retrieved October 19, 2017, from https://www.slideshare.net/ssuser6e1ce5/interlinking-lod)
  7. 이성숙, 박지영, 이혜원 (2017). 링크드 데이터에서 인물 정보의 식별 및 연계 범위 확장에 관한 연구. 정보관리학회지, 34(3), 7-21.(Lee, Sungsook, Park, Ziyoung, & Lee, Hyewon (2017). Expanding the Scope of Identifying and Linking of Personal Information in Linked Data: Focusing on the Linked Data of National Library of Korea. Journal of the Korean Society for Information Management, 34(3), 7-21.)
  8. 이유빈, 이해영 (2017). 온톨로지 기반의 기록물 검색 시스템을 위한 인터페이스 제안. 한국기록관리학회지, 17(1), 217-244.(Lee, Yu-Been & Rieh, Hae-Young (2017). A Suggestion of Interface for Ontology-Based Record Retrieval System. Journal of Korean Society of Archives and Records Management, 17(1), 217-244.) https://doi.org/10.14404/JKSARM.2017.17.1.217
  9. 하승록, 임진희, 이해영 (2017). 오픈소스 도구를 이용한 기록정보 링크드 오픈 데이터 구축 절차 연구. 정보관리학회지, 34(1), 341-371.(Ha, Seung Rok, Yim, Jin Hee, & Rieh, Hae-young (2017). A Study on the Procedure for Constructing Linked Open Data of Records Information by Using Open Source Tool. Journal of the Korean Society for Information Management, 34(1), 341-371.) https://doi.org/10.3743/KOSIM.2017.34.1.341
  10. 한국정보화진흥원 지식자원활용부 (2015). 알기 쉬운 Linked Open Data. 서울: 한국정보화진흥원.(National Information Society Agency (2015). Easy Guide about Linked Open Data. Seoul: National Information Society Agency.)
  11. Auer, S., Bryl, V., & Tramp, S. (2014). Linked Open Data - Creating Knowledge Out of Interlinked Data: Results of the LOD2 Project (Vol. 8661). Springer.
  12. Auer, S., Buhmann, L., Dirschl, C., Erling, O., Hausenblas, M., Isele, R., ... & Stadler, C. (2012). Managing the life-cycle of linked data with the LOD2 stack. In International semantic Web conference, 1-16. Springer, Berlin, Heidelberg.
  13. Berners-Lee, T., Hendler, J., & Lassila, O. (2001). The semantic web. Scientific american, 284(5), 28-37. https://doi.org/10.1038/scientificamerican0501-28
  14. European Commission (2015). Creating Value throught Open Data: Study on the Impact of Re-use of Public Data Resources. Luxembourg.: Publications Office of the European Union.
  15. Euzenat, J., Abadie, N., Bucher, B., Fan, Z., Khrouf, H., Luger, M., ... & Troncy, R. (2011). Dataset interlinking module. Retrieved October 22, 2017, from https://hal.archives-ouvertes.fr/file/index/docid/793433/filename/datalift-421.pdf
  16. Gracy, K. F. (2015). Archival description and linked data: A preliminary study of opportunities and implementation challenges. Archival Science, 15(3), 239-294. https://doi.org/10.1007/s10502-014-9216-2
  17. Heath, T., & Bizer, C. (2011). Linked data: Evolving the Web into a global data space. San Rafael, Calif.: Morgan & Claypool.
  18. ICA/EGAD (2016a). Record In Contexts(RiC) An Archival Desription Draft Standard. 2016 ICA Congress. Retrieved October 20, 2017, from https://www.ica.org/sites/default/files/session-7.8-ica-egad-ric-congress2016.pdf
  19. ICA/EGAD (2016b). Record In Contexts: A Conceptual Model For Archival Description. Consultation Draft v.0.1.
  20. Isele, R. (2011). Link Generation for the Data Web. Retrieved from http://www.wiwiss.fu-berlin.de/en/fachbereich/bwl/pwo/bizer/research/publications/Isele-LinkGeneration-ISSLOD2011.pdf
  21. Isele, R. (2013). Learning Expressive Linkage Rules for Entity Matching using Genetic Programming (Doctoral dissertation).
  22. Schaible, J., & Mayr, P. (2012). Discovering links for metadata enrichment on computer science papers. arXiv preprint arXiv:1212.3677.
  23. Singh, R. (2011). Graphical user interface for silk-a link discovery framework for the web of data. TUDelft.