Skip to main content
Vehicle involvements in hydroplaning crashes: Applying interpretable machine learning
Transportation Research Interdisciplinary Perspectives (2020)
  • Subasish Das, Texas A&M Transportation Institute
  • Anandi Dutta, University of Texas at San Antonio
  • Kakan Dey, West Virginia University
  • Mohammad Jalayer, Rowan University
  • Abhisek Mudgal, Indian Institute of Technology (BHU) Varanasi
Although hydroplaning is a major contributor to roadway crashes, it is not typically reported in conventional crash databases. Hence, a framework to classify various crash attributes from police reports and to identify hydroplaning crashes is strongly needed. This study applied natural language processing (NLP) tools to seven years (2010–2016) of crash data from the Louisiana traffic crash database to identify hydroplaning related crashes. This research focused on the development of a framework to apply interpretable machine learning models to unstructured textual content in order to classify the number of vehicle involvements in a crash. This approach evaluated the effectiveness of keywords in determining the classification. This study used three machine learning algorithms. Of these algorithms, the eXtreme Gradient Boosting (XGBoost) model was found to be the most effective classifier. This research provided a platform to understand the application of interpretability in machine learning models. The outcomes of this study prove that underlying trends or precursors can be revealed and analyzed through these models. Furthermore, this indicates that quantitative modeling techniques can be used to address safety concerns.
Publication Date
July 1, 2020
Citation Information
Subasish Das, Anandi Dutta, Kakan Dey, Mohammad Jalayer, et al.. "Vehicle involvements in hydroplaning crashes: Applying interpretable machine learning" Transportation Research Interdisciplinary Perspectives Vol. 6 (2020) p. 100176
Available at: