This document discusses mining software engineering data to uncover hidden patterns and trends. It describes various types of software repositories that can be mined, including source control systems, bug tracking databases, mailing lists, and execution traces. The document outlines how data from these repositories has been mined to predict bug-introducing changes, understand how teams structure code, and detect code clones, among other applications. It emphasizes the need to evaluate results statistically and consider limitations from the data studied.