This document provides an overview of generative adversarial networks (GANs) and their applications to speech processing and natural language processing. It begins with a basic explanation of GANs and then discusses several types of conditional GANs including text-to-image, sound-to-image, image-to-image, image-to-label, video generation, and unsupervised domain translation using techniques like CycleGAN. It provides examples of how GANs have been applied to tasks such as speech synthesis, language modeling, and multi-label image classification. The document concludes by discussing challenges with CycleGAN consistency and shared latent space approaches for unsupervised domain translation.