This document discusses the challenges of multimodal information retrieval and how natural language processing (NLP) can improve accessibility through translation across languages and modalities. It highlights the encoder-decoder modeling framework for tasks such as text simplification and illustrates this with case studies on various methodologies including reinforcement learning. The take-home message emphasizes the effectiveness of sequence-to-sequence models in achieving task-specific objectives while acknowledging the limitations of training data.