Computer vision aims to build machines that can see like humans. The document introduces computer vision, discussing how it takes images from cameras and analyzes them using software to understand scenes. While deep learning is popular, understanding first principles is important for tasks where data is limited, to understand failures, and because curiosity drives humans to understand how things work. The document outlines the modules to be covered, including imaging, features, 3D reconstruction from single and multiple views, and perception.