1) The document proposes an unsupervised, online approach called UDD for detecting duplicate records across query results from multiple web databases.
2) Two classifiers, weighted component similarity summing and support vector machines, are used cooperatively in an iterative process to identify duplicate record pairs without requiring labeled training data.
3) The approach assigns weights to record fields based on their similarity values in duplicate and non-duplicate record pairs, and uses a weighted sum of component similarities to determine if two records are duplicates.