The document is a textbook on text-to-speech synthesis. It provides an overview of the key topics and challenges in developing text-to-speech systems. These include analyzing the differences between written and spoken language, understanding human communication processes, organizing the input text into linguistic units like words and sentences, decoding and interpreting the text, predicting prosody from the text, converting text to speech sounds, and techniques for synthesizing speech. The textbook contains 18 chapters that delve deeper into each of these areas from both theoretical and engineering perspectives.