SlideShare a Scribd company logo
Computer Vision:
Extracting Data from the
Visual World
A Brief Example...
!
Steven Mitchell, Ph.D.
Componica, LLC
About us.
Componica, LLC (http://www.componica.com/)

Strong Background in Computer Vision
Copyright 2011 - Componica, LLC (http://www.componica.com/)
About us.
Componica seamlessly combines the worlds of
machine learning, computer visioning & mobile
development & applying the latest in visionary
technology to the world of mobile media. 

Words for Spanish / Russian / French

Copyright 2011 - Componica, LLC (http://www.componica.com/)
Copyright 2011 - Componica, LLC (http://www.componica.com/)
Why is computer vision relevant?
How do these things work?

Should I be concernd?
Copyright 2011 - Componica, LLC (http://www.componica.com/)
In this slideshow:
Facial Detection - Find me a face.

Facial Recognition - Who’s face is it?

Image Registration - Aligning pictures together.

...which leads to augmented reality.

QR Codes - They’re everywhere.

Optical Character Recognition - Reading Stuff.
Copyright 2011 - Componica, LLC (http://www.componica.com/)
Face Detection
This is NOT facial recognition.

Developed by Viola / Jones in 2000. Major break-thru in
image recognition...this was not possible prior.

How much does a cow weigh?

An army of simple face detectors.

"Robust Real-time Object Detection"!
Paul Viola and Michael Jones
Copyright 2011 - Componica, LLC (http://www.componica.com/)
BTW, It’s how the Kinect sees people.
Copyright 2011 - Componica, LLC (http://www.componica.com/)
BTW, It’s how the Kinect sees people.
"Real-Time Human Pose Recognition in Parts from Single Depth Images"!
Shotton, Fitzgibbon, Cook, Sharp, Finocchio, Moore, Kipman, Blake!
Microsoft Research Cambridge & Xbox Incubation
Copyright 2011 - Componica, LLC (http://www.componica.com/)
Facial Recognition
Remove effects caused by lighting and
perspective.

After you find a face, reduce it to numbers.
"Statistical Models of Appearance for Computer Vision"!
T.F. Cootes and C.J.Taylor
Copyright 2011 - Componica, LLC (http://www.componica.com/)
Facial Recognition
Let’s mix some paint...

Comparing numbers in hyperspace
k-Nearest Neighbor, Wikipedia
Copyright 2011 - Componica, LLC (http://www.componica.com/)
The most common way to register images. Find the most
interesting points on the two images.

Compare all the interesting points from one image to the other
forming matching pairs of points between images.
Image Registration - Interesting Points
Copyright 2011 - Componica, LLC (http://www.componica.com/)
Augmented Reality
FAST interest point detection 0.55ms
Building query bit masks 0.12ms
Matching into database 0.35ms
Robust pose estimation 0.1ms
Total frame time 1.12ms
Table 1. Timings for the stages of our approach on a dataset with
images taken from within the range of trained viewpoints.
Figure 5. Increasing the range of viewpoint bins in the training set
allows more viewpoint invariance to be added in a straightforward
manner.
gests that the bit count dissimilarity score provides a reason-
able way of scoring matches. To confirm this we computed
the average number of inlier and outlier matches over all of
the frames in the two sequences, and plotted these against
the dissimilarity score obtained for the match in Figure 4.
For the sequence on the left where the viewpoints are in-
cluded in the training set many good matches are found in
Once you have correspondence, you
can compute 3D geometry.
http://mi.eng.cam.ac.uk/~er258/work/fast.html
http://nghiaho.com
Copyright 2011 - Componica, LLC (http://www.componica.com/)
QR Codes
http://en.wikipedia.org/wiki/QR_Code
!
"Quick Response code" invented
by Toyota subsidiary Denso
Wave in 1994.

Open License

Up to 2.5K of data

Error Correction

Easy to read and generate:

ZXing library
Copyright 2011 - Componica, LLC (http://www.componica.com/)
Optical Character Recognition
iPhone 4th Gen
iPod Touch 4th Gen
Copyright 2011 - Componica, LLC (http://www.componica.com/)
Optical Character Recognition
Copyright 2011 - Componica, LLC (http://www.componica.com/)
Commentary
Ubiquitous Surveillance...extreme dislike.

Birthday Paradox...The probability that, in a set of
n randomly chosen people, some pair of them will
have the same birthday.
Copyright 2011 - Componica, LLC (http://www.componica.com/)
Commentary
Video Cameras may fit the criteria of legally blind.
Copyright 2011 - Componica, LLC (http://www.componica.com/)
Computer visioning technology and society:
opportunities, possibilities:

Smartphones that ID diseases, plants, insects.

Robotic lawnmowers that don’t run over the
neighbor’s cat.

Computers that judge emotions by reading your
face.

Keyless entry based on face, iris.

Automated inspection of manufactured parts.

Conclusion

More Related Content

DOCX
Connect2Console AFG Assignments 1-10
PDF
Presentation1
PPT
57review
PPTX
Magazine design evaluation
PPTX
Super science experiments
PDF
Expediting Learning with New Technology
PDF
Tracking Faces using Active Appearance Models
PDF
ใบงานแบบสำรวจและประวัติของ นาย ธนกรณ์ ธรรมปัญญา
Connect2Console AFG Assignments 1-10
Presentation1
57review
Magazine design evaluation
Super science experiments
Expediting Learning with New Technology
Tracking Faces using Active Appearance Models
ใบงานแบบสำรวจและประวัติของ นาย ธนกรณ์ ธรรมปัญญา

Viewers also liked (7)

PPS
وحدة الفقه الاسلامي للصف التاسع
PPTX
Information management
PDF
Step by-step compsressor Selection and sizing
PDF
Beep...Destroy All Humans!
PDF
Binary Features for Object Detection and Landmarking
PPTX
General knowledge
PDF
Introduction to Computer Vision
وحدة الفقه الاسلامي للصف التاسع
Information management
Step by-step compsressor Selection and sizing
Beep...Destroy All Humans!
Binary Features for Object Detection and Landmarking
General knowledge
Introduction to Computer Vision
Ad

Similar to Computer Vision: Extracting Data from the Visual World (20)

PDF
OTA16 Talk: Innovative Experiences
PDF
A reading of ibm research innovations - for 2018 and ahead
PPTX
Why won’t my bank let me play?
PDF
Veronika Demedetska. Robot Simulation from Scratch
PPTX
From AI-Generated Stories to Interactive Volumetric Content by Adam Myhill, U...
PDF
computervisionpresentationai-210331145836.pdf
PPTX
Panacea - Augmented Reality
PPTX
Free Microsoft Apps
PDF
MTC Spring 2013 - crossplatform woes - robert virkus - 2013-03-13
PDF
2010 And Beyond
PDF
Robotic design: Frontiers in visual and tactile sensing
PDF
Somo AI Breakfast Briefing
PDF
IBM Watson & Cognitive Computing - Tech In Asia 2016
PDF
Building windows phone_apps_-_a_developers_guide_v7_no_cover
PDF
How Augment your Reality: Different perspective on the Reality / Virtuality C...
PDF
Matteo Valoriani - How Augment your Reality: different perspective on the Rea...
PPT
New Technologies In Design Education
PDF
An AI Based ATM Intelligent Security System using Open CV and YOLO
PPTX
I Like iPhone & Android But I am .NET Developer
PDF
Gemini 2.0 and Vertex AI for Innovation Workshop
OTA16 Talk: Innovative Experiences
A reading of ibm research innovations - for 2018 and ahead
Why won’t my bank let me play?
Veronika Demedetska. Robot Simulation from Scratch
From AI-Generated Stories to Interactive Volumetric Content by Adam Myhill, U...
computervisionpresentationai-210331145836.pdf
Panacea - Augmented Reality
Free Microsoft Apps
MTC Spring 2013 - crossplatform woes - robert virkus - 2013-03-13
2010 And Beyond
Robotic design: Frontiers in visual and tactile sensing
Somo AI Breakfast Briefing
IBM Watson & Cognitive Computing - Tech In Asia 2016
Building windows phone_apps_-_a_developers_guide_v7_no_cover
How Augment your Reality: Different perspective on the Reality / Virtuality C...
Matteo Valoriani - How Augment your Reality: different perspective on the Rea...
New Technologies In Design Education
An AI Based ATM Intelligent Security System using Open CV and YOLO
I Like iPhone & Android But I am .NET Developer
Gemini 2.0 and Vertex AI for Innovation Workshop
Ad

Recently uploaded (20)

PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
Spectroscopy.pptx food analysis technology
PDF
Approach and Philosophy of On baking technology
PDF
Electronic commerce courselecture one. Pdf
PPTX
Cloud computing and distributed systems.
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
KodekX | Application Modernization Development
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
NewMind AI Weekly Chronicles - August'25 Week I
Per capita expenditure prediction using model stacking based on satellite ima...
Chapter 3 Spatial Domain Image Processing.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Advanced methodologies resolving dimensionality complications for autism neur...
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
The AUB Centre for AI in Media Proposal.docx
Spectroscopy.pptx food analysis technology
Approach and Philosophy of On baking technology
Electronic commerce courselecture one. Pdf
Cloud computing and distributed systems.
Dropbox Q2 2025 Financial Results & Investor Presentation
Review of recent advances in non-invasive hemoglobin estimation
Diabetes mellitus diagnosis method based random forest with bat algorithm
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
KodekX | Application Modernization Development

Computer Vision: Extracting Data from the Visual World

  • 1. Computer Vision: Extracting Data from the Visual World A Brief Example... ! Steven Mitchell, Ph.D. Componica, LLC
  • 2. About us. Componica, LLC (http://www.componica.com/) Strong Background in Computer Vision Copyright 2011 - Componica, LLC (http://www.componica.com/)
  • 3. About us. Componica seamlessly combines the worlds of machine learning, computer visioning & mobile development & applying the latest in visionary technology to the world of mobile media. Words for Spanish / Russian / French Copyright 2011 - Componica, LLC (http://www.componica.com/)
  • 4. Copyright 2011 - Componica, LLC (http://www.componica.com/) Why is computer vision relevant? How do these things work? Should I be concernd?
  • 5. Copyright 2011 - Componica, LLC (http://www.componica.com/) In this slideshow: Facial Detection - Find me a face. Facial Recognition - Who’s face is it? Image Registration - Aligning pictures together. ...which leads to augmented reality. QR Codes - They’re everywhere. Optical Character Recognition - Reading Stuff.
  • 6. Copyright 2011 - Componica, LLC (http://www.componica.com/) Face Detection This is NOT facial recognition. Developed by Viola / Jones in 2000. Major break-thru in image recognition...this was not possible prior.
 How much does a cow weigh? An army of simple face detectors.
 "Robust Real-time Object Detection"! Paul Viola and Michael Jones
  • 7. Copyright 2011 - Componica, LLC (http://www.componica.com/) BTW, It’s how the Kinect sees people.
  • 8. Copyright 2011 - Componica, LLC (http://www.componica.com/) BTW, It’s how the Kinect sees people. "Real-Time Human Pose Recognition in Parts from Single Depth Images"! Shotton, Fitzgibbon, Cook, Sharp, Finocchio, Moore, Kipman, Blake! Microsoft Research Cambridge & Xbox Incubation
  • 9. Copyright 2011 - Componica, LLC (http://www.componica.com/) Facial Recognition Remove effects caused by lighting and perspective. After you find a face, reduce it to numbers. "Statistical Models of Appearance for Computer Vision"! T.F. Cootes and C.J.Taylor
  • 10. Copyright 2011 - Componica, LLC (http://www.componica.com/) Facial Recognition Let’s mix some paint... Comparing numbers in hyperspace k-Nearest Neighbor, Wikipedia
  • 11. Copyright 2011 - Componica, LLC (http://www.componica.com/) The most common way to register images. Find the most interesting points on the two images. Compare all the interesting points from one image to the other forming matching pairs of points between images. Image Registration - Interesting Points
  • 12. Copyright 2011 - Componica, LLC (http://www.componica.com/) Augmented Reality FAST interest point detection 0.55ms Building query bit masks 0.12ms Matching into database 0.35ms Robust pose estimation 0.1ms Total frame time 1.12ms Table 1. Timings for the stages of our approach on a dataset with images taken from within the range of trained viewpoints. Figure 5. Increasing the range of viewpoint bins in the training set allows more viewpoint invariance to be added in a straightforward manner. gests that the bit count dissimilarity score provides a reason- able way of scoring matches. To confirm this we computed the average number of inlier and outlier matches over all of the frames in the two sequences, and plotted these against the dissimilarity score obtained for the match in Figure 4. For the sequence on the left where the viewpoints are in- cluded in the training set many good matches are found in Once you have correspondence, you can compute 3D geometry. http://mi.eng.cam.ac.uk/~er258/work/fast.html http://nghiaho.com
  • 13. Copyright 2011 - Componica, LLC (http://www.componica.com/) QR Codes http://en.wikipedia.org/wiki/QR_Code ! "Quick Response code" invented by Toyota subsidiary Denso Wave in 1994. Open License Up to 2.5K of data Error Correction Easy to read and generate: ZXing library
  • 14. Copyright 2011 - Componica, LLC (http://www.componica.com/) Optical Character Recognition iPhone 4th Gen iPod Touch 4th Gen
  • 15. Copyright 2011 - Componica, LLC (http://www.componica.com/) Optical Character Recognition
  • 16. Copyright 2011 - Componica, LLC (http://www.componica.com/) Commentary Ubiquitous Surveillance...extreme dislike. Birthday Paradox...The probability that, in a set of n randomly chosen people, some pair of them will have the same birthday.
  • 17. Copyright 2011 - Componica, LLC (http://www.componica.com/) Commentary Video Cameras may fit the criteria of legally blind.
  • 18. Copyright 2011 - Componica, LLC (http://www.componica.com/) Computer visioning technology and society: opportunities, possibilities: Smartphones that ID diseases, plants, insects. Robotic lawnmowers that don’t run over the neighbor’s cat. Computers that judge emotions by reading your face. Keyless entry based on face, iris. Automated inspection of manufactured parts. Conclusion