This document discusses predictive analytics for transportation in a high dimensional heterogeneous data world. It covers several topics:
1) The transportation world is generating large amounts of high dimensional data from sources like cameras, GPS, cell phones, and probe vehicles that needs to be combined and analyzed.
2) Connected and automated vehicles will generate huge amounts of detailed data on vehicle movements, passenger activities and intentions that can be used to infer travel patterns and predict crashes.
3) Applying machine learning, advanced computation and domain knowledge is necessary to make sense of this non-standardized, high volume data.