Analytical Model for Traffic Congestion and Accident Analysis

Traffic congestion and road accidents have been important public challenges that impose a big burden on society. According to the statistics report from the Association for Safe International Travel (ASIRT), an average American lost $1,377 and 99 hours sitting in traffic in 2019, over 37,000 people die in road crashes each year, and 2.35 million are injured or disabled that costs $230.6 billion per year. The analysis of traffic congestion and road accidents is very complex as they not only affect each other, but they are also affected by many other factors. In this research, we use the US Accidents data from Kaggle that consists of 3 million records of car accidents from February 2016 to December 2019, and 49 variables for the study. We propose to use statistical techniques and machine learning algorithms including Logistic Regression, KNN, Decision Tree Classifier, and Random Forest Classifier to process and train a large amount of data to obtain predictive models for traffic congestion and road accidents. The proposed predictive models will incorporate multiple environmental parameters and are expected to be more accurate with the input of real-time information. The proposed models will assist people in making smart real-time transportation decisions to improve mobility and reduce accidents.


Mineta Consortium for Transportation Mobility
San José State University

Principal Investigator: 

Hongrui Liu, PhD

PI Contact Information:
San José State University

Funding Source(s) and Amounts Provided (by each agency or organization): 

U.S. Department of Transportation, Office of the Assistant Secretary for Research and Technology – $5,008

Total Project Cost: 


Agency ID or Contract Number: 



September 2020 to August 2021

Implementation of Research Outcomes: 

The research will investigate the association between traffic congestion and road accidents so that it can assist policymakers to implement effective transportation policies or transportation decision tools.

Impacts/Benefits of Implementation: 

The proposed models/tools incorporate multiple environmental parameters and together with real-time data are expected to be more accurate in predicting road traffic conditions. The proposed research will assist people in making smart real time transportation decisions to improve roadway safety and relieve traffic congestion.

Project Number: