The document provides an introduction to statistical learning theory. It describes the supervised learning setting, where the goal is to select a predictor from a set of candidates that minimizes the expected loss on new data, given training data, candidates, and a loss function. It discusses how empirical risk minimization (ERM), such as by minimizing error on the training set, can approximate this goal. One sufficient condition for ERM consistency is uniform convergence of the empirical risk to the true risk as more data is observed.