Classification and Conceptualization of Clinical Documents using Formal Concept Analysis

FCA를 이용한 임상서식지의 분류와 체계화

Kim, Myeong-Gi;Hwang, Seok-Hyeong;Kim, Hong-Gi;Kang, Yu-Gyeong;Choe, Hui-Cheol;Kim, Dong-Sun
김명기;황석형;김홍기;강유경;최희철;김동순

  • Published : 20060000

Abstract

Objective: Ontology is becoming a core research field in the realm of medical informatics. The objective of our ongoing research is to explore the potential role of Formal Concept Analysis(FCA) in a context-based ontology building support in a medical domain. The concept hierarchy plays an important role as the backbone of ontology, but its construction is a complex and time-consuming process. We present a novel approach to the automatic acquisition of taxonomies or concept hierarchies from clinical documents. Methods: Our approach is based on FCA, a mathematical tool used in data analysis and knowledge engineering. It provides methods to group objects and attributes into concepts, pairs of object-sets(clinical documents) and attribute-sets(fields contained in the clinical documents), such that the binary relation can be presented in a concept lattice. Based on the FCA, we have applied out approach for 8 clinical documents used in a university hospital. As a result of our experiments, we can extract 15 concepts with 7 common fields that can be shared with 8 clinical documents. Results: We show how FCA can be used to classify clinical documents and acquire a concept hierarchy for the medical domain out of the clinical documents with maximal property factorization. Conclusion: The whole of our work is based on the concept lattice of which allows to construct a "well defined" ontological concept hierarchy. As an application of this approach, we presented some results of classification of clinical documents with maximally factorized common fields. We have shown that FCA can be useful method to classify and analyze various medical data by constructing concept hierarchy. From that concept hierarchy, we can acquire well-structured facts and knowledges in medical domain.

Keywords

References

  1. Lawrie D, Croft WE. Discovering and comparing topic hierarchies. Proceedings of RIAO2000 conference; 2000 April 12-14; Paris, France
  2. Gu T. Using formal concept analysis for ontology structuring and building [dissertation]. Singapore: Nanyang Technological University; 2003
  3. Cimiano P, Staab S, Tane J. Automatic acquisition of taxonomies from Text-FCA meets NLP. Proceedings of the ECMLlPKDD Workshop on Adaptive Text Extraction and Mining;2003 September 22; Cavtat-Dubrovnik, Croatia
  4. Davey BA, Priestley HA. Introduction to lattices and order. 2nd ed. UK:Cambridge University Press; 2002. pp.65-84
  5. Ganter B, Wille R. Formal concept analysis. 1st ed. Heidelberg: Springer-Verlag; 1999. pp.17-62
  6. Carpineto C, Romano G. Concept data analysis. 1st ed. UK: Wiley; 2004. pp.21 -23
  7. Hwang SH, Kim HG, Yang HS. A FCA-based ontology construction for the design of class hierarchy. Lecture Notes in Computer Science 2005;3482(3):827-835 https://doi.org/10.1007/11424857_90
  8. Available at : http://sourceforge.netlprojects/conexp. Accessed October 1. 2005
  9. Hwang SH, Kang YK, Kim HG, Kim MK. A FCA-based conceptualization of clinical documents. Journal of Korea society of Medical Informatics 2005; 11 (Suppl 1): 53-56
  10. Choi JW, Hong JH, Kim ON, Seo JS, Yoo YS, Boo YK. et al. Review of core elements used in medical records and definitions for medical informatics standard. Journal of Korea society of Medical Informatics 1997;3(1) :233-238
  11. Cho HI. Choi JW, Kim KD, Nam SM, Hong JH. Kim ON, et al. Review of core elements used in medical records and their definitions for medical informatics standard. Journal of Korea society of Medical Informatics 1997;3(2) :91-98
  12. Lee JH, Lee HJ, Chae YM, Hong JH. Standardization study for discharge abstract data. Journal of Korea society of Medical Informatics 1998;4(1):15-28
  13. Park HA, Cho IS, Kim KD, Kim SH. Park JS, Lee YS, et al. Analysis and standardization of nursing record forms for nursing informatics standard. Journal of Korea society of Medical Informatics 1998;4(2) :69-79
  14. Park HA, Cho IS, Kim KD, Park JS, Yoo KS, Yoon SJ, et al. Standardization of nursing documents for special nursing units. Journal of Korea Society of Medical Informatics 2000: 6(3) :31-38