LinkedIn and 3rd parties use essential and non-essential cookies to provide, secure, analyze and improve our Services, and to show you relevant ads (including professional and job ads) on and off LinkedIn. Learn more in our Cookie Policy.

Select Accept to consent or Reject to decline non-essential cookies for this use. You can update your choices at any time in your settings.

Start free trial Sign in

From the course: Deep Learning with Python: Convolutional Neural Networks

What is computer vision? - Python Tutorial

From the course: Deep Learning with Python: Convolutional Neural Networks

Start my 1-month free trial Buy for my team

What is computer vision?

“

- [Instructor] Computer vision is a fascinating field at the intersection of computer science and artificial intelligence. At its core, computer vision focuses on enabling machines to interpret and understand visual data, such as images and videos, in a way that mimics human perception. This includes recognizing objects, interpreting scenes, and making decisions based on what they observe. Today, computer vision is widely used across many industries. It powers facial recognition systems on smartphones, helps autonomous vehicles detect pedestrians and traffic signs, assists doctors in analyzing medical scans, and enables robots to navigate and interact with their surroundings. Virtually any field that involves visual input can benefit from computer vision technologies. Computer vision tasks can be categorized into six main areas. One of the most basic is image classification, which involves determining what object or category appears in an image. Object detection builds on this by not only identifying objects, but also locating them within the image using bounding boxes. Segmentation goes even further by dividing an image into distinct regions and labeling each pixel. Other key tasks include recognition and identification, such as matching a face to a known identity or interpreting handwritten digits. Motion analysis focuses on understanding how objects move across frames in a video. Finally, 3D scene reconstruction attempts to infer depth and spatial relationships from two-dimensional images. Computer vision is transforming the way machines interact with the world by giving them the ability to see, interpret, and respond to visual information. As the technology continues to advance, its impact will only grow, enabling smarter systems across countless domains.

Contents

- (Locked)
  
  Continuing with deep learning
  
  1m 49s