This document provides an overview of deep learning and computer vision. It discusses how vision is an ambiguous problem affected by small variations, noise, and other factors. It then shows how teaching a computer what a cat is can be challenging. The document outlines areas of machine learning like computer vision, sound analytics, and language analytics. It also discusses concepts like vectors, matrices, tensors, perceptrons, and generative adversarial networks. Finally, it discusses how Microsoft is investing in technologies like FPGAs, CPUs, and GPUs to enable AI training at large scale in the cloud.
Related topics: