This document provides an introduction to automatic speech recognition (ASR). It defines ASR and outlines key components like acoustic models, pronunciation models, and language models. The document explains that ASR performance depends on many factors like the microphone, speaker, language, and output type. It also summarizes common techniques used in ASR systems, such as Gaussian mixture models, hidden Markov models, and decoding. Finally, it discusses the current capabilities and limitations of ASR and potential areas of future work.