Bayesian learning uses prior knowledge and observed training data to determine the probability of hypotheses. Each training example can incrementally increase or decrease the estimated probability of a hypothesis. Prior knowledge is provided by assigning initial probabilities to hypotheses and probability distributions over possible observations for each hypothesis. New instances can be classified by combining the predictions of multiple hypotheses, weighted by their probabilities. Even when computationally intractable, Bayesian methods provide an optimal standard for decision making.