This document discusses data preprocessing techniques for data mining. It covers why preprocessing is important for obtaining quality mining results from quality data. The major tasks of data preprocessing are described, including data cleaning, integration, transformation, reduction, and discretization. Specific techniques for handling missing data, noisy data, and data integration are also outlined. The goals of data reduction strategies like dimensionality and numerosity reduction are explained.