This document summarizes a student's analysis of the 2015 Traffic Fatalities dataset released by the National Highway Traffic Safety Administration. The dataset contains 15 tables with information on traffic accidents, vehicles, people involved, and environmental factors. The student cleaned the data, created visualizations, and applied classifiers to predict drunk driving accidents based on time of day. Their analysis aims to provide insights to reduce traffic fatalities by identifying high-risk groups and behaviors. Future work could include fusing multiple tables, analyzing gender bias and distraction effects, and clustering pedestrian accidents to identify at-risk regions. Source codes and visualizations are available on Kaggle where the student is currently ranked 3rd in contributions.
Related topics: