SlideShare a Scribd company logo
COMP 776: Computer Vision
Basic Info
• Instructor:
Svetlana Lazebnik (lazebnik@cs.unc.edu)
• Office hours:
By appointment, FB 244
• Textbook (recommended):
Forsyth & Ponce, Computer Vision: A Modern Approach
• Class webpage:
http://www.cs.unc.edu/~lazebnik/spring09
Today
• Introduction to computer vision
• Course overview
• Course requirements
The goal of computer vision
• To perceive the story behind the picture
What we see What a computer sees
Source: S. Narasimhan
The goal of computer vision
• To perceive the story behind the picture
• What exactly does this mean?
– Vision as a source of metric 3D information
– Vision as a source of semantic information
Vision as measurement device
Real-time stereo Structure from motion
NASA Mars Rover
Pollefeys et al.
Multi-view stereo for
community photo collections
Goesele et al.
Vision as a source of semantic information
slide credit: Fei-Fei, Fergus & Torralba
Object categorization
sky
building
flag
wall
banner
bus
cars
bus
face
street lamp
slide credit: Fei-Fei, Fergus & Torralba
Scene and context categorization
• outdoor
• city
• traffic
• …
slide credit: Fei-Fei, Fergus & Torralba
Qualitative spatial information
slanted
rigid moving
object
horizontal
vertical
slide credit: Fei-Fei, Fergus & Torralba
rigid moving
object
non-rigid moving
object
Why study computer vision?
Personal photo albums
Surveillance and security
Movies, news, sports
Medical and scientific images
• Vision is useful: Images and video are everywhere!
Why study computer vision?
• Vision is useful
• Vision is interesting
• Vision is difficult
– Half of primate cerebral cortex is devoted to visual processing
– Achieving human-level visual perception is probably “AI-complete”
Why is computer vision difficult?
Challenges: viewpoint variation
Michelangelo 1475-1564 slide credit: Fei-Fei, Fergus & Torralba
Challenges: illumination
image credit: J. Koenderink
Challenges: scale
slide credit: Fei-Fei, Fergus & Torralba
Challenges: deformation
Xu, Beihong 1943
slide credit: Fei-Fei, Fergus & Torralba
Challenges: occlusion
Magritte, 1957 slide credit: Fei-Fei, Fergus & Torralba
Challenges: background clutter
Challenges: Motion
Challenges: object intra-class
variation
slide credit: Fei-Fei, Fergus & Torralba
Challenges: local ambiguity
slide credit: Fei-Fei, Fergus & Torralba
Challenges or opportunities?
• Images are confusing, but they also reveal the structure of
the world through numerous cues
• Our job is to interpret the cues!
Image source: J. Koenderink
Depth cues: Linear perspective
Depth cues: Aerial perspective
Depth ordering cues: Occlusion
Source: J. Koenderink
Shape cues: Texture gradient
Position and lighting cues: Cast shadows
Source: J. Koenderink
Grouping cues: Similarity (color, texture,
proximity)
Grouping cues: “Common fate”
Image credit: Arthus-Bertrand (via F. Durand)
Bottom line
• Perception is an inherently ambiguous problem
– Many different 3D scenes could have given rise to a particular 2D picture
Image source: F. Durand
Bottom line
• Perception is an inherently ambiguous problem
– Many different 3D scenes could have given rise to a particular 2D picture
• Possible solutions
– Bring in more constraints (more images)
– Use prior knowledge about the structure of the world
• Need a combination of different methods
Image source: F. Durand
Connections to other disciplines
Computer Vision
Image Processing
Machine Learning
Artificial Intelligence
Robotics
Psychology
Neuroscience
Computer Graphics
Origins of computer vision
L. G. Roberts, Machine Perception
of Three Dimensional Solids,
Ph.D. thesis, MIT Department of
Electrical Engineering, 1963.
Progress to date
The next slides show some examples of what
current vision systems can do
Earth viewers (3D modeling)
Image from Microsoft’s Virtual Earth
(see also: Google Earth)
Source: S. Seitz
Photosynth
http://labs.live.com/photosynth/
NOTE: Noah Snavely talk at GLUNCH tomorrow!
Source: S. Seitz
Optical character recognition (OCR)
Digit recognition, AT&T labs
http://www.research.att.com/~yann/
Technology to convert scanned docs to text
• If you have a scanner, it probably came with OCR software
License plate readers
http://en.wikipedia.org/wiki/Automatic_number_plate_recognition
Source: S. Seitz
Face detection
Many new digital cameras now detect faces
• Canon, Sony, Fuji, …
Source: S. Seitz
Smile detection?
Sony Cyber-shot® T70 Digital Still Camera Source: S. Seitz
Object recognition (in supermarkets)
LaneHawk by EvolutionRobotics
“A smart camera is flush-mounted in the checkout lane, continuously watching
for items. When an item is detected and recognized, the cashier verifies the
quantity of items that were found under the basket, and continues to close the
transaction. The item can remain under the basket, and with LaneHawk,you are
assured to get paid for it… “
Source: S. Seitz
Face recognition
Who is she? Source: S. Seitz
Vision-based biometrics
“How the Afghan Girl was Identified by Her Iris Patterns” Read the story
Source: S. Seitz
Login without a password…
Fingerprint scanners on
many new laptops,
other devices
Face recognition systems now
beginning to appear more widely
http://www.sensiblevision.com/
Source: S. Seitz
Object recognition (in mobile phones)
This is becoming real:
• Microsoft Research
• Point & Find
Source: S. Seitz
iPhone Apps: (www.kooaba.com)
iPhone Apps: (www.snaptell.com)
The Matrix movies, ESC Entertainment, XYZRGB, NRC
Special effects: shape capture
Source: S. Seitz
Pirates of the Carribean, Industrial Light and Magic
Special effects: motion capture
Source: S. Seitz
Sports
Sportvision first down line
Nice explanation on www.howstuffworks.com
Source: S. Seitz
Smart cars
Mobileye
• Vision systems currently in high-end BMW, GM, Volvo models
• By 2010: 70% of car manufacturers.
Slide content courtesy of Amnon Shashua
Source: S. Seitz
Vision-based interaction (and games)
Nintendo Wii has camera-based IR
tracking built in. See Lee’s work at
CMU on clever tricks on using it to
create a multi-touch display!
Source: S. Seitz
Assistive technologies
Sony EyeToy
Vision in space
Vision systems (JPL) used for several tasks
• Panorama stitching
• 3D terrain modeling
• Obstacle detection, position tracking
• For more, read “Computer Vision on Mars” by Matthies et al.
NASA'S Mars Exploration Rover Spirit captured this westward view from atop
a low plateau where Spirit spent the closing months of 2007.
Source: S. Seitz
Robotics
http://www.robocup.org/NASA’s Mars Spirit Rover
http://en.wikipedia.org/wiki/Spirit_rover
Source: S. Seitz
The computer vision industry
• A list of companies here:
http://www.cs.ubc.ca/spider/lowe/vision.html
Course overview
I. Early vision: Image formation and processing
II. Mid-level vision: Grouping and fitting
III. Multi-view geometry
IV. Recognition
V. Advanced topics
I. Early vision
Cameras and sensors
Light and color
Linear filtering
Edge detection
* =
Feature extraction: corner and blob detection
• Basic image formation and processing
II. “Mid-level vision”
Fitting: Least squares
Hough transform
RANSAC
Alignment
• Fitting and grouping
III. Multi-view geometry
Projective structure from motion
Stereo
Affine structure from motion
Tomasi & Kanade (1993)
Epipolar geometry
IV. Recognition
Patch description and matching Clustering and visual vocabularies
Bag-of-features models Classification
Sources: D. Lowe, L. Fei-Fei
V. Advanced Topics
• Time permitting…
Segmentation
Articulated models
Face detection
Motion and tracking
Course requirements
• Philosophy: computer vision is best experienced hands-on
• Programming assignments: 50%
– Three or four assignments
– Expect the first one in the next couple of classes
– Brush up on your MATLAB skills (see web page for tutorial)
• Final assignment: 30%
– Recognition competition
– Winner gets a prize!
• Participation: 20%
– Come to class regularly
– Ask questions
– Answer questions
Collaboration policy
• Feel free to discuss assignments with each other, but coding
must be done individually
• Feel free to incorporate code or tips you find on the Web,
provided this doesn’t make the assignment trivial and you
explicitly acknowledge your sources
• Remember: I can Google too!
For next time
• Self-study: MATLAB tutorial
• Reading: cameras and image formation (F&P chapter 1)

More Related Content

PPTX
Graphic Design Final 01 | Elements & Principles of Design
PPTX
Machine Learning
PPTX
01Introduction.pptx - C280, Computer Vision
PPTX
Computer vision introduction
PPT
Introduction
PPTX
Computer Vision Crash Course
PPT
vision.ppt
PPT
vision.ppt
Graphic Design Final 01 | Elements & Principles of Design
Machine Learning
01Introduction.pptx - C280, Computer Vision
Computer vision introduction
Introduction
Computer Vision Crash Course
vision.ppt
vision.ppt

Similar to Lec01 introduction (20)

PPT
vision_2.ppt
PPTX
I have not done hard tests for this, but you should gain about
PPTX
IntroComputerVision23.pptx
PPTX
1_Intro2ssssssssssssssssssssssssssssss2.pptx
PPTX
Computer Vision Crash Course
PPTX
Computer vision introduction - What is computer vision
PPTX
01 cie552 introduction
PDF
1.pdf
PDF
Computer vision basics
PPT
Application of image processing.ppt
PDF
Computer Vision – From traditional approaches to deep neural networks
PDF
Lecture 1 computer vision introduction
PPTX
Overview of Computer Vision For Footwear Industry
PPT
vision-1.ppt
PPT
cs498-1-1-CourseIntro.ppt
PDF
PPT s01-machine vision-s2
PPTX
Chapter 1: Computer Vision Introduction.pptx
PPTX
Chapter 1: Computer Vision Introduction.pptx
PPT
Lecture 1, 2 - An Introduction ot Computer Vision
vision_2.ppt
I have not done hard tests for this, but you should gain about
IntroComputerVision23.pptx
1_Intro2ssssssssssssssssssssssssssssss2.pptx
Computer Vision Crash Course
Computer vision introduction - What is computer vision
01 cie552 introduction
1.pdf
Computer vision basics
Application of image processing.ppt
Computer Vision – From traditional approaches to deep neural networks
Lecture 1 computer vision introduction
Overview of Computer Vision For Footwear Industry
vision-1.ppt
cs498-1-1-CourseIntro.ppt
PPT s01-machine vision-s2
Chapter 1: Computer Vision Introduction.pptx
Chapter 1: Computer Vision Introduction.pptx
Lecture 1, 2 - An Introduction ot Computer Vision
Ad

More from BaliThorat1 (20)

PDF
Lec15 sfm
PDF
Lec14 multiview stereo
PPTX
Lec13 stereo converted
PDF
Lec12 epipolar
PPTX
Lec11 single view-converted
PDF
Lec10 alignment
PDF
Lec09 hough
PDF
Lec08 fitting
PDF
8 operating system concept
PDF
7 processor
PDF
6 input output devices
PDF
2 windows operating system
PDF
5 computer memory
PDF
4 computer languages
PDF
1 fundamentals of computer
PDF
1 fundamentals of computer system
PDF
Computer generation and classification
PDF
Algorithm and flowchart
PDF
6 cpu scheduling
PDF
5 process synchronization
Lec15 sfm
Lec14 multiview stereo
Lec13 stereo converted
Lec12 epipolar
Lec11 single view-converted
Lec10 alignment
Lec09 hough
Lec08 fitting
8 operating system concept
7 processor
6 input output devices
2 windows operating system
5 computer memory
4 computer languages
1 fundamentals of computer
1 fundamentals of computer system
Computer generation and classification
Algorithm and flowchart
6 cpu scheduling
5 process synchronization
Ad

Recently uploaded (20)

PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
01-Introduction-to-Information-Management.pdf
PPTX
Introduction to Child Health Nursing – Unit I | Child Health Nursing I | B.Sc...
PPTX
Pharma ospi slides which help in ospi learning
PPTX
Cell Structure & Organelles in detailed.
PPTX
Institutional Correction lecture only . . .
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PDF
VCE English Exam - Section C Student Revision Booklet
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PDF
O7-L3 Supply Chain Operations - ICLT Program
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PDF
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
PPTX
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
Microbial disease of the cardiovascular and lymphatic systems
01-Introduction-to-Information-Management.pdf
Introduction to Child Health Nursing – Unit I | Child Health Nursing I | B.Sc...
Pharma ospi slides which help in ospi learning
Cell Structure & Organelles in detailed.
Institutional Correction lecture only . . .
Microbial diseases, their pathogenesis and prophylaxis
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
VCE English Exam - Section C Student Revision Booklet
Pharmacology of Heart Failure /Pharmacotherapy of CHF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
O7-L3 Supply Chain Operations - ICLT Program
FourierSeries-QuestionsWithAnswers(Part-A).pdf
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
Supply Chain Operations Speaking Notes -ICLT Program
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES

Lec01 introduction