SlideShare a Scribd company logo
Extracting, harnessing and
generating visual information.
COMPUTER VISION
● It’s ready for prime time
● It’s evolving faster than you think
● It’s a gateway capability
● It’s changing everything
● It creates a richer experience
● It’s easier to use than you think
COMPUTER VISION
O
U
R
JO
U
R
N
EY
IT’S READY FOR PRIME TIME
And what better way to demonstrate that it’s ready for
prime time, than to show it live...
PS — this is an open source application written by IBM
THE DEMO
gigaom.com/2017/05/08/computers-are-opening-their-eyes-and-the
yre-already-better-at-seeing-than-we-are
visual-recognition-demo.mybluemix.net
Photo: youtube.com/watch?v=tE2ihx0nJMI
Three human faces
identified
Attempt to guess ages
and gender of identified
faces
List of objects and colors in
the image and the scene
context
Hierarchical breakdown of
the scene context
CREATE A CUSTOM MODEL
visual-recognition-demo.mybluemix.net/train
Classifier Application
HOW IT WORKS
Face, object &
scene recognition
Cloud Visual API
Production
Images
PRICING
●FEATURE 1 - 1000 UNITS/MO
1001- 1,000,000
UNITS/MO
1,000,001 to 5,000,000
UNITS/MO
5,000,001 - 20,000,000
UNITS/MO
Label Detection Free $1.50 / 1000 units $1.50 / 1000 units $1.00 / 1000 units
OCR Free $1.50 / 1000 units $1.50 / 1000 units $0.60 / 1000 units
Explicit Content Detection Free Free with Label Detection*
Facial Detection Free $1.50 / 1000 units $1.50 / 1000 units $0.60 / 1000 units
Landmark Detection Free $1.50 / 1000 units $1.50 / 1000 units $0.60 / 1000 units
Logo Detection Free $1.50 / 1000 units $1.50 / 1000 units $0.60 / 1000 units
Image Properties Free $1.50 / 1000 units $1.50 / 1000 units $0.60 / 1000 units
BONUS DEMO
Vision-explorer.reactive.ai
gigaom.com/2017/02/06/harnessing-visual-data-using-google-cloud
API RESPONSE
{
"responses": [
{
"labelAnnotations": [
{
"mid": "/m/01yrx",
"description": "cat",
"score": 0.92562944
},
{
"mid": "/m/04rky",
"description": "mammal",
"score": 0.90815818
},
{
"mid": "/m/01l7qd",
"description": "whiskers",
"score": 0.79939437
},
{
"mid": "/m/07k6w8",
"description": "small to medium sized cats",
"score": 0.66373962
},
{
"mid": "/m/0307l",
IMAGE SIZING
VISION API FEATURE RECOMMENDED SIZE NOTES
FACE_DETECTION 1600 x 1200 Distance between eyes is most important
LANDMARK_DETECTION 640 x 480
LOGO_DETECTION 640 x 480
LABEL_DETECTION 640 x 480
TEXT_DETECTION 1024 x 768
OCR requires more resolution to detect
characters
SAFE_SEARCH_DETECTION 640 x 480
cloud.google.com/vision/docs/supported-files
WHERE IS THE A.I.?
In the system’s ability to determine objects and
context within each image.
Compared to having people view and judge images
manually.
Remember: A.I. is subjective
WHAT IS COMPUTER VISION?
And just to clarify…
Visual recognition — exploring face detection,
emotion recognition, text extraction, damage
identification, context awareness, and more.
EXAMPLES IN USE TODAY
Manufacturing
● Ensure products are positioned correctly on an
assembly line
Visual auditing
● Monitor for compliance or deterioration in fleet of
trucks, planes, or windmills
● Train classifiers to understand what defects look
like
Insurance
● Quickly process claims by classifying images of
claims
Social listening
● Track buzz about your company on social media
Security
● Monitor for activity, instantly classify objects as
threat or not
Social commerce
● Use an image of a food dish to find out which
restaurant serves it and find reviews
● Use a travel photo to find vacation suggestions
based on similar experiences
● Use a house image to find similar homes that are
for sale
Retail
● Take a photo of a favorite outfit to find stores with
those clothes in stock or on sale
● Use a travel image to find retail suggestions in that
area
Education
● Create image-based applications to educate about
taxonomies
● Use pictures to find educational material on similar
subjects
R
ESO
U
R
C
ES
Computers have already matched — and exceeded —
human capabilities when it comes to understanding
visual information.
Now it’s just a question of finding uses and applying
this new capability in productive ways.
READY FOR PRIME TIME
R
EC
A
P
IT’S EVOLVING FASTER
THAN YOU THINK
ALREADY BETTER THAN HUMANS
February, 2015 — “...researchers say their system
achieved a 4.94% error rate... In previous experiments,
humans have achieved an estimated 5.1% error rate.”
microsoft.com/en-us/research/blog/microsoft-researchers-algorithm-sets-imagenet-challenge-m
ilestone
AND COMING SOON...CREATE
Write a text description of an
existing image.
Given a text description,
generate an image from scratch.
petapixel.com/2016/09/23/googles-image-captioning-ai
-can-describe-photos-94-accuracy youtu.be/rAbhypxs1qQ?t=5s
[ 2016 ]
PLUS...
[ 2017 ]
Fotogenerator.npocloud.nl
...and A.I. finishes
it for you.
Sketch a wireframe
Computer vision is already better than humans at
identifying objects within images.
And they just started learning how to create images
from scratch...and they’re already pretty good at it.
FASTER THAN YOU THINK
R
EC
A
P
IT’S A GATEWAY CAPABILITY
If you’re looking for a place to begin your A.I. journey,
this is a good starting point...
THE PROBLEM
We live in a visual world, yet capturing useful
information from images has historically required
human vision — which can be slow and costly.
THE GOAL
But if we could extract that useful information
through computer vision, it could provide
invaluable insight for business.
THE SOLUTION
An intelligent visual recognition service that
automatically analyzes and identifies objects and
scenes in image files (video, etc.).
A HIGH PROFILE EXAMPLE
Facial recognition
systems are coming on
strong and being used
in a wide variety of
applications.
youtu.be/K4u4Dpl6NKk?t=1m9s
Visual recognition is 1 of 2 critical capabilities that will
allow artificial intelligence to integrate into and
empower our world in ways we’ve only dreamed of.
It allows computers to interact with humans on our
own terms — to integrate into our daily lives.
GATEWAY CAPABILITY
R
EC
A
P
Speech is the other one.
IT’S CHANGING EVERYTHING
We’re unlocking the entire visual world for computers.
What can they do with it?
VEHICLE NAVIGATION
nvidia.com/object/drive-px.html
ROBOTS
engadget.com/2014/09/08/google-details-object-recognition-tech
FARMING
Automatically sorting
crops.
cloud.google.com/blog/big-data/2016/08/how-a-japanese-cucumber-farmer-is-using-deep-le
arning-and-tensorflow
VISUAL PRODUCT SEARCH
slyce.it/product
INFRINGEMENT MONITORING
Keep an eye out for
intellectual property
infringement.
Photo credit: flickr.com/photos/inthe-arena/12939586573
MONITORING BRAND EXPOSURE
Track how
often your
advertising is
shown.
Photo:sportvision.com/baseball/virtual-advertisements
IT’S EVEN CHANGING ‘PEOPLE
WATCHING’
medium.com/homeland-security/no-longer-just-another-face-in-the-crowd-15e1c74fe24
Giving computers access to the visual world will
empower our work and lives.
Amplifying human productivity in endless ways.
IT’S CHANGING EVERYTHING
R
EC
A
P
IT CREATES A RICHER
EXPERIENCE
“Customer
experience is the
new battlefield.”
~ Gartner, 2015
accenture.com/us-en/insight-artificial-intelligence-ui
EXPERIENCE ABOVE ALL
VIP TREATMENT
A top customer walks into your store.
The system instantly recognizes them, issues a
personalized greeting, and alerts an attendant.
And/or…
Allows you to track store visits just as we track web
visits in Google Analytics (faces vs. IP addresses).
VIP TREATMENT
consumerreports.org/privacy/facial-recognition-who-is-tracking-you-in-public1
EMOTION RECOGNITION
popularmechanics.com/technology/a18636/facewatch-facial-recognition-identify-criminals
Analyze how
people react to
your product.
Or dynamically
adjust a system
based on
people’s
emotion.
A RICHER EXPERIENCE
Computers can now
see, read — and act
upon — the full
spectrum of our
communication.
Text Only
Text + Speech
Text + Speech + Vision
Adding...
Voice tone
Voice inflection
Adding...
Facial expressions
Body language
Language sentiment analysis
en.wikipedia.org/wiki/Albert_Mehrabian
7%
38%
55%
R
EC
A
P
It’s often said that the verbal and audible elements of
communication only make up 45% of what is being
said.
A RICHER EXPERIENCE
IT’S EASIER THAN YOU THINK
PICK YOUR STARTING POINT
A.I. M
aturity
No training data required!
Purpose-Built Platform, Their Training Data
Commercial Platform, Their Training Data
Commercial Platform, Your Training Data
In-House Platform, Your Training Data
NOTHING NEW HERE
Most of the major cloud providers
have purpose-built A.I.-powered APIs.
Many have a free tier.
And they work exactly like
every other API you’re already using.
R
ESO
U
R
C
ESLOTS OF COMMERCIAL OPTIONS
Jumping back to our demo above, here are some
alternative commercial APIs…
● IBM Watson Visual Recognition
● Microsoft Computer Vision API
● Clarifai
● archive.ics.uci.edu/ml
● deeplearning.net/datasets
● mldata.org
● grouplens.org/datasets
● cs.toronto.edu/~kriz/cifar.html
● cs.cornell.edu/people/pabo/movie-review-data
● yann.lecun.com/exdb/mnist (handwriting)
● kdnuggets.com/datasets/index.html (long list)
● image-net.org (competition)
PLUS OPEN SOURCE DATASETS
● gallery.cortanaintelligence.com
● bigml.com/gallery/models
● predictionio.incubator.apache.org/gallery/template-gallery
● github.com/tensorflow/models/tree/master/inception
● algorithmia.com/algorithms *
AND PRE-BUILT M.L. MODELS
Start with a pre-trained cloud API (no training data
required).
Most cloud providers offer a free tier. So start thinking
about the different ways to use computer vision.
Then just start testing and see what you can do.
EASIER THAN YOU THINK
R
EC
A
P
Don't get me wrong, there's an insane amount of complexity behind the scenes.
But fortunately, you need to know that stuff to take advantage of A.I.
NEXT STEPS
HOW-TO GUIDES
● Building Voice-Enabled Products With Amazon Alexa
● Cognitive Customer Engagement Using IBM Watson
● Harnessing Visual Data Using Google Cloud
● Building a Recommendation Engine Using Microsoft Azure
● Predicting Marketing Campaign Response Using Amazon Machine Learning
● Unleashing A.I.-Powered Conversation With IBM Watson
● Get into the Mind of Your Customer Using Google’s Sentiment Analysis Tools
● Discover Your Customers’ Deepest Feelings Using Microsoft Facial Recognition
● Give Your Products the Power of Speech Using Amazon Polly
● Computers Are Opening Their Eyes — and They’re Already Better at Seeing Than We Are
● How to Predict When You’re Going to Lose a Subscriber
● The Future of Business is a Digital Spokesperson — Let’s Build a Preview Using Microsoft’s Bot
Framework
● Predicting Personality Traits from Content Using IBM Watson
R
ESO
U
R
C
ES
How to build the demo
app in this session
● Computer speech is ready for prime time
● It’s coming faster than you think
● It’s a gateway capability
● It’s changing everything
● It creates a richer experience
● It’s easier to use than you think
JOURNEY’S END
R
EC
A
P
COMING UP...
Laying the foundation
● Cutting Through the Hype
2 A.I. Technologies that will have the greatest impact
● Computer Speech
● Computer Vision
2 A.I. Applications with the quickest R.O.I.
● Predictive Engagement
● Predictive Personalization
STA
Y
TU
N
ED
QUESTIONS OR COMMENTS?
Gigaom A.I. Team: ai@gigaom.com
Workshop Facilitator: chris.mohritz@10xeffect.com
C
O
N
TA
C
T
THANK YOU

More Related Content

PDF
How Can Artificial Intelligence Make Business More Human?
PDF
Designing for Sensors 
& the Future of Experiences
PDF
#1NWebinar: Digital on the Runway
PDF
Embedding Experience: Bridging the gap between design & reality
PDF
Putting the "User" back in User Experience (Dallas Techfest Edition)
PDF
Mobile first: A future friendly approach to UX design
PPTX
Silverlight won't save your user experience - you will!
PDF
UX Design for Mobile Interfaces
How Can Artificial Intelligence Make Business More Human?
Designing for Sensors 
& the Future of Experiences
#1NWebinar: Digital on the Runway
Embedding Experience: Bridging the gap between design & reality
Putting the "User" back in User Experience (Dallas Techfest Edition)
Mobile first: A future friendly approach to UX design
Silverlight won't save your user experience - you will!
UX Design for Mobile Interfaces

What's hot (20)

PPTX
Design For Multiple Touchpoints
PPTX
Designing The Interface For Use
PDF
Mobilising Digital Melbourne 21/03/2014
PDF
Top Three Modern Product Trends
KEY
Joshua Porter
PDF
Mobile Information Architecture and Interaction Design
PDF
UX Lesson 6: Visual Hierarchy
PDF
UX Fundamentals for Startups
PDF
Improving your site's usability - what users really want
PDF
Designing Better UX Deliverables - Cambridge Usability Group, 12 May 2014
PDF
CVPR2010: Learnings from founding a computer vision startup: Chapter 6: Produ...
PDF
[Mobile Future] Sofia Svanteson, Ocean Observations
PDF
Pushing Through Failure (Quickly)
PDF
Evangelizing User Experience Design
PDF
UX Design breakdown and Q&A session @ #createTogether
PPTX
Onrec Talk V9
PPT
Owning the Interaction in Dynamic Environments
PDF
NCI-Project-Showcase-Sample-Projects
PDF
The rise of citizen developers
PDF
13 Signs Your UX Needs an Exorcism
Design For Multiple Touchpoints
Designing The Interface For Use
Mobilising Digital Melbourne 21/03/2014
Top Three Modern Product Trends
Joshua Porter
Mobile Information Architecture and Interaction Design
UX Lesson 6: Visual Hierarchy
UX Fundamentals for Startups
Improving your site's usability - what users really want
Designing Better UX Deliverables - Cambridge Usability Group, 12 May 2014
CVPR2010: Learnings from founding a computer vision startup: Chapter 6: Produ...
[Mobile Future] Sofia Svanteson, Ocean Observations
Pushing Through Failure (Quickly)
Evangelizing User Experience Design
UX Design breakdown and Q&A session @ #createTogether
Onrec Talk V9
Owning the Interaction in Dynamic Environments
NCI-Project-Showcase-Sample-Projects
The rise of citizen developers
13 Signs Your UX Needs an Exorcism
Ad

Similar to A.I. in the Enterprise: Computer Vision (20)

PDF
Computers Are Opening Their Eyes - And They're Already Better at Seeing Than ...
PPTX
Artificial Intelligence (AI) – Powering Data and Conversations.pptx
PDF
Getting Started With Using AI In Libraries (PLAN)
PPTX
AI for Beginners - SWFLN Makerpalooza - Session 1
PDF
Building a Visual Recognition Service
PPTX
Panacea - Augmented Reality
PDF
Onboarding AI & Machine Learning
PDF
AI - Artificial Intelligence - Implications for Libraries
PPT
Why You Shouldn't Worry About Artificial Intelligence...Until You Have To
 
PDF
Labs intro for IBIS students
PPTX
Ai for everyone
PDF
Lets Chat AI – And Not Just ChatGPT
PDF
Lets Chat AI - and Not Just ChatGPT
PDF
Enriching your app with Image recognition and AWS AI services Hebrew Webinar
PPTX
Image Recognition? But why?
PDF
computervisionpresentationai-210331145836.pdf
PPTX
1. AI Introduction ggvfcdrddfeeeClass 9.pptx
PDF
How to Give Your Woo Store Superpowers
PDF
Building an Image Recognition Service - How to leverage IBM Watson for visual...
Computers Are Opening Their Eyes - And They're Already Better at Seeing Than ...
Artificial Intelligence (AI) – Powering Data and Conversations.pptx
Getting Started With Using AI In Libraries (PLAN)
AI for Beginners - SWFLN Makerpalooza - Session 1
Building a Visual Recognition Service
Panacea - Augmented Reality
Onboarding AI & Machine Learning
AI - Artificial Intelligence - Implications for Libraries
Why You Shouldn't Worry About Artificial Intelligence...Until You Have To
 
Labs intro for IBIS students
Ai for everyone
Lets Chat AI – And Not Just ChatGPT
Lets Chat AI - and Not Just ChatGPT
Enriching your app with Image recognition and AWS AI services Hebrew Webinar
Image Recognition? But why?
computervisionpresentationai-210331145836.pdf
1. AI Introduction ggvfcdrddfeeeClass 9.pptx
How to Give Your Woo Store Superpowers
Building an Image Recognition Service - How to leverage IBM Watson for visual...
Ad

More from Christopher Mohritz (20)

PDF
Cutting Through the Hype - What Artificial Intelligence Looks Like in Real Wo...
PDF
What Happens When Computers Can Have a Natural Conversation?
PDF
How to Build Legendary Customer Relationships With Artificial Intelligence
PDF
How to Build a Self-Driving Business
PDF
A.I. Makes Your Business More Human
PDF
Let's Build a Chatbot!
PDF
The Creative Side of Artificial Intelligence
PDF
Every Business Needs a Chatbot
PDF
Connecting Up an Intel Edison Device on AWS IoT
PDF
Virtual Reality is Here and it's Real
PDF
Immersive Environments Powered by IoT
PDF
Voice Control for IoT Devices
PDF
Building a Conversational Speech Interface
PDF
Removing the Friction of Technology
PDF
Exploring the Opportunities of Machine Learning
PDF
Entering an Era of Perfect Information
PDF
Machine Learning & Self-Driving Cars
PDF
Building an Image Recognition Service
PDF
IoT: Entering an Era of Perfect Information
PDF
A.I. in the Enterprise: 10 Real World Lessons Learned
Cutting Through the Hype - What Artificial Intelligence Looks Like in Real Wo...
What Happens When Computers Can Have a Natural Conversation?
How to Build Legendary Customer Relationships With Artificial Intelligence
How to Build a Self-Driving Business
A.I. Makes Your Business More Human
Let's Build a Chatbot!
The Creative Side of Artificial Intelligence
Every Business Needs a Chatbot
Connecting Up an Intel Edison Device on AWS IoT
Virtual Reality is Here and it's Real
Immersive Environments Powered by IoT
Voice Control for IoT Devices
Building a Conversational Speech Interface
Removing the Friction of Technology
Exploring the Opportunities of Machine Learning
Entering an Era of Perfect Information
Machine Learning & Self-Driving Cars
Building an Image Recognition Service
IoT: Entering an Era of Perfect Information
A.I. in the Enterprise: 10 Real World Lessons Learned

Recently uploaded (20)

PDF
COST SHEET- Tender and Quotation unit 2.pdf
PPT
340036916-American-Literature-Literary-Period-Overview.ppt
PDF
IFRS Notes in your pocket for study all the time
PDF
Katrina Stoneking: Shaking Up the Alcohol Beverage Industry
PPTX
The Marketing Journey - Tracey Phillips - Marketing Matters 7-2025.pptx
DOCX
unit 2 cost accounting- Tender and Quotation & Reconciliation Statement
PPTX
Dragon_Fruit_Cultivation_in Nepal ppt.pptx
PDF
Business model innovation report 2022.pdf
PPTX
CkgxkgxydkydyldylydlydyldlyddolydyoyyU2.pptx
PDF
DOC-20250806-WA0002._20250806_112011_0000.pdf
PDF
Elevate Cleaning Efficiency Using Tallfly Hair Remover Roller Factory Expertise
PDF
A Brief Introduction About Julia Allison
PDF
How to Get Business Funding for Small Business Fast
PPTX
New Microsoft PowerPoint Presentation - Copy.pptx
PDF
How to Get Funding for Your Trucking Business
PPTX
Lecture (1)-Introduction.pptx business communication
PDF
Laughter Yoga Basic Learning Workshop Manual
PPTX
job Avenue by vinith.pptxvnbvnvnvbnvbnbmnbmbh
DOCX
unit 1 COST ACCOUNTING AND COST SHEET
PDF
Solara Labs: Empowering Health through Innovative Nutraceutical Solutions
COST SHEET- Tender and Quotation unit 2.pdf
340036916-American-Literature-Literary-Period-Overview.ppt
IFRS Notes in your pocket for study all the time
Katrina Stoneking: Shaking Up the Alcohol Beverage Industry
The Marketing Journey - Tracey Phillips - Marketing Matters 7-2025.pptx
unit 2 cost accounting- Tender and Quotation & Reconciliation Statement
Dragon_Fruit_Cultivation_in Nepal ppt.pptx
Business model innovation report 2022.pdf
CkgxkgxydkydyldylydlydyldlyddolydyoyyU2.pptx
DOC-20250806-WA0002._20250806_112011_0000.pdf
Elevate Cleaning Efficiency Using Tallfly Hair Remover Roller Factory Expertise
A Brief Introduction About Julia Allison
How to Get Business Funding for Small Business Fast
New Microsoft PowerPoint Presentation - Copy.pptx
How to Get Funding for Your Trucking Business
Lecture (1)-Introduction.pptx business communication
Laughter Yoga Basic Learning Workshop Manual
job Avenue by vinith.pptxvnbvnvnvbnvbnbmnbmbh
unit 1 COST ACCOUNTING AND COST SHEET
Solara Labs: Empowering Health through Innovative Nutraceutical Solutions

A.I. in the Enterprise: Computer Vision

  • 1. Extracting, harnessing and generating visual information. COMPUTER VISION
  • 2. ● It’s ready for prime time ● It’s evolving faster than you think ● It’s a gateway capability ● It’s changing everything ● It creates a richer experience ● It’s easier to use than you think COMPUTER VISION O U R JO U R N EY
  • 3. IT’S READY FOR PRIME TIME
  • 4. And what better way to demonstrate that it’s ready for prime time, than to show it live... PS — this is an open source application written by IBM
  • 6. visual-recognition-demo.mybluemix.net Photo: youtube.com/watch?v=tE2ihx0nJMI Three human faces identified Attempt to guess ages and gender of identified faces List of objects and colors in the image and the scene context Hierarchical breakdown of the scene context
  • 7. CREATE A CUSTOM MODEL visual-recognition-demo.mybluemix.net/train
  • 8. Classifier Application HOW IT WORKS Face, object & scene recognition Cloud Visual API Production Images
  • 9. PRICING ●FEATURE 1 - 1000 UNITS/MO 1001- 1,000,000 UNITS/MO 1,000,001 to 5,000,000 UNITS/MO 5,000,001 - 20,000,000 UNITS/MO Label Detection Free $1.50 / 1000 units $1.50 / 1000 units $1.00 / 1000 units OCR Free $1.50 / 1000 units $1.50 / 1000 units $0.60 / 1000 units Explicit Content Detection Free Free with Label Detection* Facial Detection Free $1.50 / 1000 units $1.50 / 1000 units $0.60 / 1000 units Landmark Detection Free $1.50 / 1000 units $1.50 / 1000 units $0.60 / 1000 units Logo Detection Free $1.50 / 1000 units $1.50 / 1000 units $0.60 / 1000 units Image Properties Free $1.50 / 1000 units $1.50 / 1000 units $0.60 / 1000 units
  • 11. API RESPONSE { "responses": [ { "labelAnnotations": [ { "mid": "/m/01yrx", "description": "cat", "score": 0.92562944 }, { "mid": "/m/04rky", "description": "mammal", "score": 0.90815818 }, { "mid": "/m/01l7qd", "description": "whiskers", "score": 0.79939437 }, { "mid": "/m/07k6w8", "description": "small to medium sized cats", "score": 0.66373962 }, { "mid": "/m/0307l",
  • 12. IMAGE SIZING VISION API FEATURE RECOMMENDED SIZE NOTES FACE_DETECTION 1600 x 1200 Distance between eyes is most important LANDMARK_DETECTION 640 x 480 LOGO_DETECTION 640 x 480 LABEL_DETECTION 640 x 480 TEXT_DETECTION 1024 x 768 OCR requires more resolution to detect characters SAFE_SEARCH_DETECTION 640 x 480 cloud.google.com/vision/docs/supported-files
  • 13. WHERE IS THE A.I.? In the system’s ability to determine objects and context within each image. Compared to having people view and judge images manually. Remember: A.I. is subjective
  • 14. WHAT IS COMPUTER VISION? And just to clarify… Visual recognition — exploring face detection, emotion recognition, text extraction, damage identification, context awareness, and more.
  • 15. EXAMPLES IN USE TODAY Manufacturing ● Ensure products are positioned correctly on an assembly line Visual auditing ● Monitor for compliance or deterioration in fleet of trucks, planes, or windmills ● Train classifiers to understand what defects look like Insurance ● Quickly process claims by classifying images of claims Social listening ● Track buzz about your company on social media Security ● Monitor for activity, instantly classify objects as threat or not Social commerce ● Use an image of a food dish to find out which restaurant serves it and find reviews ● Use a travel photo to find vacation suggestions based on similar experiences ● Use a house image to find similar homes that are for sale Retail ● Take a photo of a favorite outfit to find stores with those clothes in stock or on sale ● Use a travel image to find retail suggestions in that area Education ● Create image-based applications to educate about taxonomies ● Use pictures to find educational material on similar subjects R ESO U R C ES
  • 16. Computers have already matched — and exceeded — human capabilities when it comes to understanding visual information. Now it’s just a question of finding uses and applying this new capability in productive ways. READY FOR PRIME TIME R EC A P
  • 18. ALREADY BETTER THAN HUMANS February, 2015 — “...researchers say their system achieved a 4.94% error rate... In previous experiments, humans have achieved an estimated 5.1% error rate.” microsoft.com/en-us/research/blog/microsoft-researchers-algorithm-sets-imagenet-challenge-m ilestone
  • 19. AND COMING SOON...CREATE Write a text description of an existing image. Given a text description, generate an image from scratch. petapixel.com/2016/09/23/googles-image-captioning-ai -can-describe-photos-94-accuracy youtu.be/rAbhypxs1qQ?t=5s [ 2016 ]
  • 20. PLUS... [ 2017 ] Fotogenerator.npocloud.nl ...and A.I. finishes it for you. Sketch a wireframe
  • 21. Computer vision is already better than humans at identifying objects within images. And they just started learning how to create images from scratch...and they’re already pretty good at it. FASTER THAN YOU THINK R EC A P
  • 22. IT’S A GATEWAY CAPABILITY
  • 23. If you’re looking for a place to begin your A.I. journey, this is a good starting point...
  • 24. THE PROBLEM We live in a visual world, yet capturing useful information from images has historically required human vision — which can be slow and costly.
  • 25. THE GOAL But if we could extract that useful information through computer vision, it could provide invaluable insight for business.
  • 26. THE SOLUTION An intelligent visual recognition service that automatically analyzes and identifies objects and scenes in image files (video, etc.).
  • 27. A HIGH PROFILE EXAMPLE Facial recognition systems are coming on strong and being used in a wide variety of applications. youtu.be/K4u4Dpl6NKk?t=1m9s
  • 28. Visual recognition is 1 of 2 critical capabilities that will allow artificial intelligence to integrate into and empower our world in ways we’ve only dreamed of. It allows computers to interact with humans on our own terms — to integrate into our daily lives. GATEWAY CAPABILITY R EC A P Speech is the other one.
  • 30. We’re unlocking the entire visual world for computers. What can they do with it?
  • 35. INFRINGEMENT MONITORING Keep an eye out for intellectual property infringement. Photo credit: flickr.com/photos/inthe-arena/12939586573
  • 36. MONITORING BRAND EXPOSURE Track how often your advertising is shown. Photo:sportvision.com/baseball/virtual-advertisements
  • 37. IT’S EVEN CHANGING ‘PEOPLE WATCHING’ medium.com/homeland-security/no-longer-just-another-face-in-the-crowd-15e1c74fe24
  • 38. Giving computers access to the visual world will empower our work and lives. Amplifying human productivity in endless ways. IT’S CHANGING EVERYTHING R EC A P
  • 39. IT CREATES A RICHER EXPERIENCE
  • 40. “Customer experience is the new battlefield.” ~ Gartner, 2015 accenture.com/us-en/insight-artificial-intelligence-ui EXPERIENCE ABOVE ALL
  • 41. VIP TREATMENT A top customer walks into your store. The system instantly recognizes them, issues a personalized greeting, and alerts an attendant. And/or… Allows you to track store visits just as we track web visits in Google Analytics (faces vs. IP addresses).
  • 43. EMOTION RECOGNITION popularmechanics.com/technology/a18636/facewatch-facial-recognition-identify-criminals Analyze how people react to your product. Or dynamically adjust a system based on people’s emotion.
  • 44. A RICHER EXPERIENCE Computers can now see, read — and act upon — the full spectrum of our communication. Text Only Text + Speech Text + Speech + Vision Adding... Voice tone Voice inflection Adding... Facial expressions Body language Language sentiment analysis en.wikipedia.org/wiki/Albert_Mehrabian 7% 38% 55% R EC A P
  • 45. It’s often said that the verbal and audible elements of communication only make up 45% of what is being said. A RICHER EXPERIENCE
  • 46. IT’S EASIER THAN YOU THINK
  • 47. PICK YOUR STARTING POINT A.I. M aturity No training data required! Purpose-Built Platform, Their Training Data Commercial Platform, Their Training Data Commercial Platform, Your Training Data In-House Platform, Your Training Data
  • 48. NOTHING NEW HERE Most of the major cloud providers have purpose-built A.I.-powered APIs. Many have a free tier. And they work exactly like every other API you’re already using.
  • 49. R ESO U R C ESLOTS OF COMMERCIAL OPTIONS Jumping back to our demo above, here are some alternative commercial APIs… ● IBM Watson Visual Recognition ● Microsoft Computer Vision API ● Clarifai
  • 50. ● archive.ics.uci.edu/ml ● deeplearning.net/datasets ● mldata.org ● grouplens.org/datasets ● cs.toronto.edu/~kriz/cifar.html ● cs.cornell.edu/people/pabo/movie-review-data ● yann.lecun.com/exdb/mnist (handwriting) ● kdnuggets.com/datasets/index.html (long list) ● image-net.org (competition) PLUS OPEN SOURCE DATASETS
  • 51. ● gallery.cortanaintelligence.com ● bigml.com/gallery/models ● predictionio.incubator.apache.org/gallery/template-gallery ● github.com/tensorflow/models/tree/master/inception ● algorithmia.com/algorithms * AND PRE-BUILT M.L. MODELS
  • 52. Start with a pre-trained cloud API (no training data required). Most cloud providers offer a free tier. So start thinking about the different ways to use computer vision. Then just start testing and see what you can do. EASIER THAN YOU THINK R EC A P Don't get me wrong, there's an insane amount of complexity behind the scenes. But fortunately, you need to know that stuff to take advantage of A.I.
  • 54. HOW-TO GUIDES ● Building Voice-Enabled Products With Amazon Alexa ● Cognitive Customer Engagement Using IBM Watson ● Harnessing Visual Data Using Google Cloud ● Building a Recommendation Engine Using Microsoft Azure ● Predicting Marketing Campaign Response Using Amazon Machine Learning ● Unleashing A.I.-Powered Conversation With IBM Watson ● Get into the Mind of Your Customer Using Google’s Sentiment Analysis Tools ● Discover Your Customers’ Deepest Feelings Using Microsoft Facial Recognition ● Give Your Products the Power of Speech Using Amazon Polly ● Computers Are Opening Their Eyes — and They’re Already Better at Seeing Than We Are ● How to Predict When You’re Going to Lose a Subscriber ● The Future of Business is a Digital Spokesperson — Let’s Build a Preview Using Microsoft’s Bot Framework ● Predicting Personality Traits from Content Using IBM Watson R ESO U R C ES How to build the demo app in this session
  • 55. ● Computer speech is ready for prime time ● It’s coming faster than you think ● It’s a gateway capability ● It’s changing everything ● It creates a richer experience ● It’s easier to use than you think JOURNEY’S END R EC A P
  • 56. COMING UP... Laying the foundation ● Cutting Through the Hype 2 A.I. Technologies that will have the greatest impact ● Computer Speech ● Computer Vision 2 A.I. Applications with the quickest R.O.I. ● Predictive Engagement ● Predictive Personalization STA Y TU N ED
  • 57. QUESTIONS OR COMMENTS? Gigaom A.I. Team: ai@gigaom.com Workshop Facilitator: chris.mohritz@10xeffect.com C O N TA C T