Issue |
MATEC Web Conf.
Volume 251, 2018
VI International Scientific Conference “Integration, Partnership and Innovation in Construction Science and Education” (IPICSE-2018)
|
|
---|---|---|
Article Number | 06020 | |
Number of page(s) | 8 | |
Section | Risk Management in Construction | |
DOI | https://doi.org/10.1051/matecconf/201825106020 | |
Published online | 14 December 2018 |
An exploration of text mining of narrative reports of injury incidents to assess risk
1
Penn State University, Workforce Education and Development, 305D Keller, University Park, Pennsylvania, USA 16802
2
Penn State University, Applied Cognitive Science Lab, E365 IST Building, University Park, Pennsylvania, USA 16802
3
Federal State Educational Institution of Higher Education, National Research Moscow State University of Civil Engineering, Yaroslavskoye Shosse 26, Moscow, 129337 Russian Federation
4
University of North Texas, Learning Technologies, Discovery Park, G150, 3940 North Elm Street, Denton, Texas USA 76207
5
University of Georgia, Lifelong Education, Administration, and Policy, 850 College Station Road, Athens, Georgia, USA 30602
* Corresponding author: passmore.david@gmail.com
A topic model was explored using unsupervised machine learning to summarized free-text narrative reports of 77,215 injuries that occurred in coal mines in the USA between 2000 and 2015. Latent Dirichlet Allocation modeling processes identified six topics from the free-text data. One topic, a theme describing primarily injury incidents resulting in strains and sprains of musculoskeletal systems, revealed differences in topic emphasis by the location of the mine property at which injuries occurred, the degree of injury, and the year of injury occurrence. Text narratives clustered around this topic refer most frequently to surface or other locations rather than underground locations that resulted in disability and that, also, increased secularly over time. The modeling success enjoyed in this exploratory effort suggests that additional topic mining of these injury text narratives is justified, especially using a broad set of covariates to explain variations in topic emphasis and for comparison of surface mining injuries with injuries occurring during site preparation for construction.
© The Authors, published by EDP Sciences, 2018
This is an open access article distributed under the terms of the Creative Commons Attribution License 4.0 (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.