This document contains a slide deck for a deep learning workshop. It discusses various deep learning applications including image classification using convolutional neural networks, visual sentiment analysis using adjective-noun pairs, neural machine translation, image captioning using attention mechanisms, visual question answering, lipreading from video, generating image descriptions from text, and learning joint audio-visual representations. The slides provide examples and references for many state-of-the-art deep learning papers within multimedia and computer vision. The workshop aims to provide an overview of using deep learning for multimedia tasks.
Related topics: