This document provides an introduction to automatic speech recognition (ASR). It defines ASR and outlines key components like feature selection, early template-based models, modern statistical approaches using hidden Markov models, and the current state and challenges of ASR. The document describes how speech is produced, common speech features, and explains techniques like dynamic time warping, Gaussian mixture models, and n-gram language models that are used in modern statistical ASR systems.