A Review of Data Analytic Applications in Road Traffic Safety. Part 1: Descriptive and Predictive Modeling

Sensors (Basel). 2020 Feb 18;20(4):1107. doi: 10.3390/s20041107.

Authors

Amir Mehdizadeh¹, Miao Cai², Qiong Hu¹, Mohammad Ali Alamdar Yazdi³, Nasrin Mohabbati-Kalejahi⁴, Alexander Vinel¹, Steven E Rigdon², Karen C Davis⁵, Fadel M Megahed⁶

Affiliations

¹ Department of Industrial and Systems Engineering, Auburn University, Auburn, AL 36849, USA.
² College for Public Health and Social Justice, Saint Louis University, St. Louis, MO 63103, USA.
³ Carey Business School, Johns Hopkins University, Baltimore, MD 21202, USA.
⁴ Jack H. Brown College of Business and Public Administration, California State University at San Bernardino, San Bernardino, CA 92407, USA, nasrin.mohabbati@csusb.edu.
⁵ Department of Computer Science and Software Engineering, Miami University, Oxford, OH 45056, USA.
⁶ Farmer School of Business, Miami University, Oxford, OH 45056, USA.

Abstract

This part of the review aims to reduce the start-up burden of data collection and descriptive analytics for statistical modeling and route optimization of risk associated with motor vehicles. From a data-driven bibliometric analysis, we show that the literature is divided into two disparate research streams: (a) predictive or explanatory models that attempt to understand and quantify crash risk based on different driving conditions, and (b) optimization techniques that focus on minimizing crash risk through route/path-selection and rest-break scheduling. Translation of research outcomes between these two streams is limited. To overcome this issue, we present publicly available high-quality data sources (different study designs, outcome variables, and predictor variables) and descriptive analytic techniques (data summarization, visualization, and dimension reduction) that can be used to achieve safer-routing and provide code to facilitate data collection/exploration by practitioners/researchers. Then, we review the statistical and machine learning models used for crash risk modeling. We show that (near) real-time crash risk is rarely considered, which might explain why the optimization models (reviewed in Part 2) have not capitalized on the research outcomes from the first stream.

Keywords: crash risk modeling; data visualization; descriptive analytics; highway safety; predictive analytics.

Publication types

Review

Abstract

Publication types

Grants and funding