DOI QR코드

DOI QR Code

Bridge Damage Factor Recognition from Inspection Reports Using Deep Learning

딥러닝 기반 교량 점검보고서의 손상 인자 인식

  • 정세환 (서울대학교 건설환경공학부) ;
  • 문성현 (서울대학교 건설환경공학부) ;
  • 지석호 (서울대학교 건설환경공학부, 서울대학교 건설환경종합연구소)
  • Received : 2018.04.18
  • Accepted : 2018.05.08
  • Published : 2018.08.01

Abstract

This paper proposes a method for bridge damage factor recognition from inspection reports using deep learning. Bridge inspection reports contains inspection results including identified damages and causal analysis results. However, collecting such information from inspection reports manually is limited due to their considerable amount. Therefore, this paper proposes a model for recognizing bridge damage factor from inspection reports applying Named Entity Recognition (NER) using deep learning. Named Entity Recognition, Word Embedding, Recurrent Neural Network, one of deep learning methods, were applied to construct the proposed model. Experimental results showed that the proposed model has abilities to 1) recognize damage and damage factor included in a training data, 2) distinguish a specific word as a damage or a damage factor, depending on its context, and 3) recognize new damage words not included in a training data.

본 연구는 딥러닝을 활용하여 교량 점검보고서에서 손상 및 손상 인자를 자동으로 식별하는 방법을 제안한다. 교량 점검보고서에는 점검 결과 발견된 손상 및 원인 분석 결과가 기록되어 있다. 그러나 점검보고서의 양이 방대하여 인력으로 보고서로부터 정보를 수집하는 데 한계가 있다. 따라서 본 연구에서는 딥러닝 기반 개체명 인식 방법을 활용하여 교량 점검보고서 텍스트로부터 손상 및 손상 인자에 해당하는 단어들을 식별할 수 있는 모델을 제안한다. 모델 구현의 주요 방법론으로는 개체명 인식(Named Entity Recognition), 워드 임베딩(Word Embedding), 딥러닝의 일종인 순환신경망(Recurrent Neural Network)을 활용하였다. 실험 결과 제안된 모델은 1)훈련 데이터에 포함된 손상 및 손상 인자 단어들을 잘 식별할 수 있고, 2)단어 주변 맥락에 따라 특정 단어가 손상에 해당하는지 손상 인자에 해당하는지 잘 판별할 수 있을 뿐만 아니라, 3)훈련 데이터에 포함되지 않은 새로운 종류의 손상 단어도 잘 인식할 수 있는 것으로 확인되었다.

Keywords

References

  1. Jeong, C. W. and Kim, J. J. (2012). "Analysis of trend in construction using textmining method." Journal of the Korean Digital Architecture Interior Association, Vol. 12, No. 2, pp. 53-60 (in Korean).
  2. Lee, I. K., Moon, M. K, Park, H. S., Jeon, J. C. and Lee, H. H. (2014). "Statistical analysis of damages in expressway bridges." Magazine of the Korea Institute for Structural Maintenance and Inspection, Vol. 18, No. 2, pp. 2-9 (in Korean).
  3. Lee, J. H., Yi, J. S. and Son, J. (2016). "Unstructured construction data analytics using R programming - focused on overseas construction adjudication cases." Journal of the Architectural Institute of Korea Structure & Construction, Vol. 32, No. 5, pp. 37-44 (in Korean). https://doi.org/10.5659/JAIK_SC.2016.32.5.37
  4. Liu, K. and El-Gohary, N. (2017). "Ontology-based semi-supervised conditional random fields for automated information extraction from bridge inspection reports." Automation in Construction, Vol. 81, pp. 313-327. https://doi.org/10.1016/j.autcon.2017.02.003
  5. Lokuge, W., Gamage, N. and Setunge, S. (2016). "Fault tree analysis method for deterioration of timber bridges using an Australian case study." Built Environment Project and Asset Management, Vol. 6, No. 3, pp. 332-344. https://doi.org/10.1108/BEPAM-01-2016-0001
  6. Mikolov, T., Chen, K., Corrado, G. and Dean, J. (2013). "Efficient estimation of word representations in vector space." Available at: http://arxiv.org/abs/1301.3781.
  7. Peris-Sayol, G., Paya-Zaforteza, I., Balasch-Parisi, S. and Alos-Moya, J. (2017). "Detailed analysis of the causes of bridge fires and their associated damage levels." Journal of Performance of Constructed Facilities, Vol. 31, No. 3, 04016108. https://doi.org/10.1061/(ASCE)CF.1943-5509.0000977
  8. Ryu, J. M. and Shin, E. C. (2014). "Database construction plan of infrastructure safety inspection and in-depth inspection results." Journal of Korean Geosynthetics Society, Vol. 13, No. 4, pp. 133-141 (in Korean). https://doi.org/10.12814/jkgss.2014.13.4.133
  9. Schuster, M. and Paliwal, K. K. (1997). "Bidirectional recurrent neural networks." IEEE Transactions on Signal Processing, Vol. 45, No. 11, pp. 2673-2681. https://doi.org/10.1109/78.650093
  10. Tanabe, L., Xie, N., Thom, L. H., Matten, W. and Wilbur, W. J. (2005). "GENETAG: A tagged corpus for gene/protein named entity recognition." BMC Bioinformatics, Vol. 6 (Suppl 1), pp. 1-7. https://doi.org/10.1186/1471-2105-6-1
  11. Zhu, F., Patumcharoenpol, P., Zhang, C., Yang, Y., Chan, J., Meechai, A., Vongsangnak, W. and Shen, B. (2013). "Biomedical text mining and its applications in cancer research." Journal of Biomedical Informatics, Vol. 46, No. 2, pp. 200-211. https://doi.org/10.1016/j.jbi.2012.10.007