DOI QR코드

DOI QR Code

Text Analytics for Classifying Types of Accident Occurrence Using Accident Report Documents

사고보고문서를 이용한 텍스트 기반 사고발생 유형 및 관계 분석

  • Kim, Beom Soo (Department of Safety Engineering, Pukyong National University) ;
  • Chang, Seongrok (Department of Safety Engineering, Pukyong National University) ;
  • Suh, Yongyoon (Department of Safety Engineering, Pukyong National University)
  • Received : 2018.04.03
  • Accepted : 2018.06.19
  • Published : 2018.06.30

Abstract

Recently, a lot of accident report documents have accumulated in almost all of industries, including critical information of accidents. Accordingly, text data contained in accident report documents are considered useful information for understanding accident processes. However, there has been a lack of systematic approaches to analyzing accident report documents. In this respect, this paper aims at proposing text analytics approach to extracting critical information on accident processes. To be specific, major causes of the accident occurrence are classified based on text information contained in accident report documents by using both textmining and latent Dirichlet allocation (LDA) algorithms. The textmining algorithm is used to structure the document-term matrix and the LDA algorithm is applied to extract latent topics included in a lot of accident report documents. We extract ten topics of accidents as accident types and related keywords of accidents with respect to each accident type. The cause-and-effect diagram is then depicted as a tool for navigating processes of the accident occurrence by structuring causes extracted from LDA. Further, the trends of accidents are identified to explore patterns of accident occurrence in each of types. Three patterns of increasing to decreasing, decreasing to increasing, or only increasing are presented in the case of a chemical plant. The proposed approach helps safety managers systematically supervise the causes and processes of accidents through analysis of text information contained in accident report documents.

Keywords

References

  1. H. S. Lee and J. P. Yim, "A Study on Prevention Measure Establishment through Cause Analysis of Chemical-Accidents", J. Korean Soc. Saf., Vol. 32, No. 3, pp. 21-27, 2017. https://doi.org/10.14346/JKOSOS.2017.32.3.21
  2. D. H. Choi, J. W. Choi and W. C. Shin, "Accident Analysis and Research on Risk of the Actual Conditions", J. Korean Soc. Saf., Vol. 27, No. 5, pp. 111-116, 2012. https://doi.org/10.14346/JKOSOS.2012.27.5.111
  3. G. H. Choi, "Cause Analysis of Accidents Associated with Industrial Machines and Devices", J. Korean Soc. Saf., Vol. 33, No. 1, pp. 16-21, 2018. https://doi.org/10.14346/JKOSOS.2018.33.1.16
  4. F. Abdat, S. Leclercq, X. Cuny and C. Tissot, Extracting Recurrent Scenarios from Narrative Texts Using a Bayesian Network: Application to Serious Occupational Aaccidents with Movement Disturbance, Accident Analysis and Prevention, Vol. 70, pp. 155-166. 2014. https://doi.org/10.1016/j.aap.2014.04.004
  5. R. Moura, M. Beer, E. Patelli and J. Lewis, "Learning from Major Accidents: Graphical Representation and Analysis of Multi-attribute Events to Enhance Risk Communication", Safety Science, Vol. 99, pp. 58-70, 2017. https://doi.org/10.1016/j.ssci.2017.03.005
  6. G. Ahn, M. Seo and S. Hur, "Development of Accident Classification Model and Ontology for Effective Industrial Accident Analysis based on Textmining", J. Korean Soc. Saf., Vol. 32, No. 5, pp. 179-185, 2017. https://doi.org/10.14346/JKOSOS.2017.32.5.179
  7. Y. Suh, "Data Analytics for Social Risk Forecasting and Assessment of New Technology, J. Korean Soc. Saf., Vol. 32, No. 3, pp. 83-89, 2017. https://doi.org/10.14346/JKOSOS.2017.32.3.83
  8. Q. Fang, H. Li, X. Luo, L. Ding, T. M. Rose, W. An and Y. Yu, "A Deep Learning-Based Method for Detecting Non-certified Work on Construction Sites", Advanced Engineering Informatics, Vol. 35, pp. 56-68, 2018. https://doi.org/10.1016/j.aei.2018.01.001
  9. F. Palamara, F. Piglione and N. Piccinini, "Self-Organizing Map and Clustering Algorithms for the Analysis of Occupational Accident Databases", Safety Science, Vol. 49, pp. 1215-1230, 2011. https://doi.org/10.1016/j.ssci.2011.04.003
  10. H. Kwon, J. Kim and Y. Park, "Applying LSA Text Mining Technique in Envisioning Social Impacts of Emerging Technologies: The Case of Drone Technology", Technovation, Vol. 60-61, pp. 15-28, 2017. https://doi.org/10.1016/j.technovation.2017.01.001
  11. D. M. Blei, A. Y. Ng and M. I. Jordan, "Latent Dirichlet Allocation". Journal of Machine Learning Research, Vol. 3, pp. 993-1022, 2003.
  12. S. Moro, P. Cortez and P. Rita, "Business Intelligence in Banking: A Literature Analysis from 2002 to 2013 Using Text Mining and Latent Dirichlet Allocation", Expert Systems with Applications, Vol. 42, No. 3, pp. 1314-1324, 2015. https://doi.org/10.1016/j.eswa.2014.09.024
  13. S. Tirunillai and G. J. Tellis, "Mining Marketing Meaning from Online Chatter: Strategic Brand Analysis of Big Data Using Latent Dirichlet Allocation". Journal of Marketing Research, Vol. 51, No. 4, pp. 463-479, 2014. https://doi.org/10.1509/jmr.12.0106
  14. D. E. Brown, "Text Mining the Contributors to Rail Accidents", IEEE Transactions on Intelligent Transportation Systems, Vol. 17, No. 2, pp. 346-355, 2016. https://doi.org/10.1109/TITS.2015.2472580
  15. Wei, Xing, and W. Bruce Croft., "LDA-Based Document Models for Ad-hoc Retrieval", Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 178-185, 2006.