- The document describes a project that uses KNN and regression tree algorithms to predict flight delays using flight data from 1987-2008.
- KNN performed poorly for predicting flight delay status, with an error rate over 20% for all values of k tested. Regression tree had better results, correctly predicting arrival delay 88% of the time.
- Key factors identified by the regression tree model as contributing to arrival delays were departure delays and shorter taxi/air times. The model was found to correlate well (0.88) with actual test data, suggesting it is a good candidate for predicting delays at JFK airport.