This document discusses various techniques for data processing including data cleaning, integration, transformation, reduction, and similarity/dissimilarity measures. It describes common types of dirty or incomplete data like missing values, noisy data, and inconsistent data. It also discusses techniques for handling different types of dirty data, such as filling in missing values, smoothing noisy data, and resolving inconsistencies. Major tasks in data processing include data cleaning, integration, transformation, and reduction. Specific techniques discussed include binning, clustering, regression, normalization, aggregation, and dimensionality reduction. The document also provides details on various similarity and dissimilarity measures that can be used to calculate the proximity between data objects.