NLP-derived attributes from construction incident reports remain strongly predictive of independently labeled safety outcomes even after removing potential label leakage, with injury severity now well predicted on a dataset of more than 90,000 reports.
Automatically Learning Construction Injury Precursors from Text
1 Pith paper cite this work. Polarity classification is still indexing.
abstract
In light of the increasing availability of digitally recorded safety reports in the construction industry, it is important to develop methods to exploit these data to improve our understanding of safety incidents and ability to learn from them. In this study, we compare several approaches to automatically learn injury precursors from raw construction accident reports. More precisely, we experiment with two state-of-the-art deep learning architectures for Natural Language Processing (NLP), Convolutional Neural Networks (CNN) and Hierarchical Attention Networks (HAN), and with the established Term Frequency - Inverse Document Frequency representation (TF-IDF) + Support Vector Machine (SVM) approach. For each model, we provide a method to identify (after training) the textual patterns that are, on average, the most predictive of each safety outcome. We show that among those pieces of text, valid injury precursors can be found. The proposed methods can also be used by the user to visualize and understand the models' predictions.
fields
cs.LG 1years
2019 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
AI-based Prediction of Independent Construction Safety Outcomes from Universal Attributes
NLP-derived attributes from construction incident reports remain strongly predictive of independently labeled safety outcomes even after removing potential label leakage, with injury severity now well predicted on a dataset of more than 90,000 reports.