This document examines the K-nearest neighbor algorithm using two datasets. For the Challenger Space Shuttle O-rings dataset, removing attributes improved accuracy from 22.22% to 22.22% error. For the larger El Nino dataset, removing latitude, longitude, and temperature attributes improved accuracy the most, from 31.55% to 20.77% error. The optimal K value was 1 for both datasets, showing that using the single nearest neighbor gave the best results.
Related topics: