This document discusses data matching techniques. It begins with an introduction to data matching and its goals. The rest of the document is outlined as follows: problem definition, rule-based matching, learning-based matching, matching by clustering, probabilistic approaches, collective matching, and scaling data matching. Bayesian networks are presented as a probabilistic model for data matching. They provide a compact representation of probability distributions and allow reasoning under uncertainty. Learning Bayesian networks from training data is also discussed.