SlideShare a Scribd company logo
Gaining ML insight with
Google Vision API and
MongoDB on Google Cloud
Used across flagship products:
Google is an AI company
Uniqueprojectdirectories
Time
Bringing state-of-the-art AI to the enterprise
Retail
Financial
Services
Manufacturing Healthcare &
Life Sciences
Government
Media &
Entertainment
Technology
Energy
GamingMarketing
AI building blocks
Sight Language
AutoML Video Intelligence AutoML Translation
Conversation Structured Data
Vision API including
Vision Product Search
Natural Language API
Dialogflow
Enterprise Edition
AutoML Tables
AutoML Vision
for cloud + edge models
AutoML Natural Language Cloud Text-to-Speech Recommendation AI
Video Intelligence API Translation API Cloud Speech-to-Text Cloud Inference API
Two types of AI building blocks
API
Pre-trained ML models
AutoML
Custom ML models
Leverage Google’s predefined dataset to automatically
detect a vast number of objects, landmarks, logos, etc.
No model training required
Train your own custom model with labels you
define with an easy-to-use graphical interface
No coding required
Vision API
Detect popular places
and landmarks
Classify content with
predefined labels
OCR support for
50+ languages
Detect brands and
product logos
Identify products from
your catalog
Identify image
properties (colors, etc.)
Get hints for best
image cropping
Detect faces and
emotions
Moderate explicit
content
Find similar images on
the web
Detect objects and
retrieve coordinates
Extract printed and
handwritten text
Gives journalists a new way to search, access,
and analyze millions of historic photos
NYT digitized more than a century of perishable photographs
and other materials. With the Vision API, Times reporters can
now easily search millions of high-res scans to enhance their
reporting with even more visual storytelling.
Bringing historic content to life
Preserves a priceless chronicle of more than
100 years of events that have shaped our world
MEDIA & ENTERTAINMENT
A small team empowers 38 media brands
MEDIA & ENTERTAINMENT
Reduced cost and the need to rely on
outside vendors
Enabling marketing, ad, sales teams and
more to take full advantage of all content
With just 7 people, CBS Interactive is using Video,
Natural Language, and Vision APIs to serve 38 digital
media brands with content discovery and
recommendation solutions.
Box is using Vision API to help their customers
manage and gain insights from their image files, and
speed up image-centric processes and workflows.
TECHNOLOGY
Bringing image recognition and
OCR to cloud content management
Improved extensive content management
for customers in every industry
Intelligent structure for 30 billion files
managed with powerful capabilities
Image source: https://www.box.com/skills
Use-case
● TBD
MongoDB World 2019: Gaining ML Insight with Google Vision API and MongoDB
● end to end security defaults that cannot be
disabled:
○ always-on authentication
○ network isolation for dedicated clusters
○ TLS / SSL
○ encryption for data at rest
○ granular role-based access controls
● supports multi-region clusters
● sharding option for high-throughput
● managed backups
● Free Tier available on Google Cloud Platform
Demo
We will demonstrate how easy it is to use the Google Vision API to gain additional insights
from a batch of photos that have no prior metadata attached. By using this workflow, we
will be able to quickly build a descriptive metadata database that can be leveraged for a
variety of business use-cases.
● TBD
Demo
Let’s review...
● TBD
Can I try this out myself?...
Find all code and steps for this demo
here:
Or visit cloud.google.com/community and search for “MongoDB”
https://cloud.google.com/community/tutorials/mongodb-atlas-appengineflex-nodejs-app
Next steps...
01
Visit us at our booth
Chat with our team, learn
more about GCP + Atlas
02
Sign up for a new GCP
account and get
credits
Get a 12-month, $300 credit
free trial when you sign up
for GCP with a new account.
03
Create a new
MongoDB Atlas
cluster on GCP
Create a free MongoDB Atlas
database on GCP
18
Thank you
19
[End of Slides]
04
Vision AI
Solutions
Document Understanding AI
You have a goldmine of documents that is
hard, expensive or impossible to tap into.
Document Understanding AI lets you easily
and cost-effectively extract valuable
insights from your documents.
Process documents & extract insights automatically
See what’s there
01
Make it useful
03
Understand it
02
Detecting objects and
understanding language
within documents
Document Understanding AI:
How it works
Image Search:
How it works
Ensures
privacy &
compliance
Engage customers in new and exciting ways
Enable shoppers to find products simply by sharing a photo
Help reduce friction with product search and purchase
Empower retail sales associates with information
Detect multiple products in one image
Additional use cases
Enhance recommendations for similar and complementary products
Analyze style trends and competitive pricing
Vision Product Search
Vision Product Search:
How it works
Customer uploads image

More Related Content

PDF
Microsoft & Machine Learning / Artificial Intelligence
PDF
Turkish Airlines Hackathon & Microsoft
PDF
MongoDB .local London 2019: Gaining ML insight on Google Cloud with Google Vi...
PDF
MongoDB .local London 2019: Gaining ML insight on Google Cloud with Google Vi...
PDF
Easy path to machine learning (2023-2024)
PDF
Google Cloud: Data Analysis and Machine Learningn Technologies
PPTX
AI services in google
PPTX
Google Cloud Vision API
Microsoft & Machine Learning / Artificial Intelligence
Turkish Airlines Hackathon & Microsoft
MongoDB .local London 2019: Gaining ML insight on Google Cloud with Google Vi...
MongoDB .local London 2019: Gaining ML insight on Google Cloud with Google Vi...
Easy path to machine learning (2023-2024)
Google Cloud: Data Analysis and Machine Learningn Technologies
AI services in google
Google Cloud Vision API

Similar to MongoDB World 2019: Gaining ML Insight with Google Vision API and MongoDB (20)

PDF
Design Day Workshop
PDF
Easy path to machine learning (2022)
PPTX
Google Cloud Platform: Prototype ->Production-> Planet scale
PDF
Google Cloud for Data Crunchers - Strata Conf 2011
PDF
Cloud-Native Roadshow Google Cloud Platform - Los Angeles
PDF
Easy path to machine learning (Spring 2021)
PDF
Want to integrate your business phone system or contact center with your CRM?
PDF
Introduction to Cloud Computing and Google Cloud Platform.
PPTX
Getting started with cloud
PPTX
A Kickstart to Google Cloud
PPTX
Hands-On with Google’s Machine Learning APIs, 12/3/2017
PDF
Introduction to Google Cloud Platform and APIs
PPTX
CloudMile Product & Service (EN)
PDF
Gears: Hipster as a Service
PDF
Machine Learning for Any Size of Data, Any Type of Data
PDF
Automatic multi-modal metadata annotation based on trained cognitive solution...
PPTX
UNIT III_Cloud APIs for CV_unit III power point
PDF
Cloud computing for image processing and bio informatics
KEY
CloudOps evening presentation from Google
PDF
Google Analytics Konferenz 2018_Machine Learning / AI mit Google_Lukman Ramse...
Design Day Workshop
Easy path to machine learning (2022)
Google Cloud Platform: Prototype ->Production-> Planet scale
Google Cloud for Data Crunchers - Strata Conf 2011
Cloud-Native Roadshow Google Cloud Platform - Los Angeles
Easy path to machine learning (Spring 2021)
Want to integrate your business phone system or contact center with your CRM?
Introduction to Cloud Computing and Google Cloud Platform.
Getting started with cloud
A Kickstart to Google Cloud
Hands-On with Google’s Machine Learning APIs, 12/3/2017
Introduction to Google Cloud Platform and APIs
CloudMile Product & Service (EN)
Gears: Hipster as a Service
Machine Learning for Any Size of Data, Any Type of Data
Automatic multi-modal metadata annotation based on trained cognitive solution...
UNIT III_Cloud APIs for CV_unit III power point
Cloud computing for image processing and bio informatics
CloudOps evening presentation from Google
Google Analytics Konferenz 2018_Machine Learning / AI mit Google_Lukman Ramse...
Ad

More from MongoDB (20)

PDF
MongoDB SoCal 2020: Migrate Anything* to MongoDB Atlas
PDF
MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!
PDF
MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...
PDF
MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDB
PDF
MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...
PDF
MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series Data
PDF
MongoDB SoCal 2020: MongoDB Atlas Jump Start
PDF
MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]
PDF
MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2
PDF
MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...
PDF
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
PDF
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
PDF
MongoDB .local San Francisco 2020: MongoDB Atlas Jumpstart
PDF
MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...
PDF
MongoDB .local San Francisco 2020: Aggregation Pipeline Power++
PDF
MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...
PDF
MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep Dive
PDF
MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & Golang
PDF
MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...
PDF
MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...
MongoDB SoCal 2020: Migrate Anything* to MongoDB Atlas
MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!
MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...
MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDB
MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...
MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series Data
MongoDB SoCal 2020: MongoDB Atlas Jump Start
MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]
MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2
MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
MongoDB .local San Francisco 2020: MongoDB Atlas Jumpstart
MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...
MongoDB .local San Francisco 2020: Aggregation Pipeline Power++
MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...
MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep Dive
MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & Golang
MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...
MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...
Ad

Recently uploaded (20)

PDF
Approach and Philosophy of On baking technology
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
Spectroscopy.pptx food analysis technology
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
A Presentation on Artificial Intelligence
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Encapsulation theory and applications.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
Approach and Philosophy of On baking technology
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Chapter 3 Spatial Domain Image Processing.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
MYSQL Presentation for SQL database connectivity
Spectroscopy.pptx food analysis technology
Building Integrated photovoltaic BIPV_UPV.pdf
Programs and apps: productivity, graphics, security and other tools
Unlocking AI with Model Context Protocol (MCP)
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
A Presentation on Artificial Intelligence
“AI and Expert System Decision Support & Business Intelligence Systems”
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
The AUB Centre for AI in Media Proposal.docx
Spectral efficient network and resource selection model in 5G networks
Reach Out and Touch Someone: Haptics and Empathic Computing
The Rise and Fall of 3GPP – Time for a Sabbatical?
Encapsulation theory and applications.pdf
Encapsulation_ Review paper, used for researhc scholars

MongoDB World 2019: Gaining ML Insight with Google Vision API and MongoDB

  • 1. Gaining ML insight with Google Vision API and MongoDB on Google Cloud
  • 2. Used across flagship products: Google is an AI company Uniqueprojectdirectories Time
  • 3. Bringing state-of-the-art AI to the enterprise Retail Financial Services Manufacturing Healthcare & Life Sciences Government Media & Entertainment Technology Energy GamingMarketing
  • 4. AI building blocks Sight Language AutoML Video Intelligence AutoML Translation Conversation Structured Data Vision API including Vision Product Search Natural Language API Dialogflow Enterprise Edition AutoML Tables AutoML Vision for cloud + edge models AutoML Natural Language Cloud Text-to-Speech Recommendation AI Video Intelligence API Translation API Cloud Speech-to-Text Cloud Inference API
  • 5. Two types of AI building blocks API Pre-trained ML models AutoML Custom ML models Leverage Google’s predefined dataset to automatically detect a vast number of objects, landmarks, logos, etc. No model training required Train your own custom model with labels you define with an easy-to-use graphical interface No coding required
  • 6. Vision API Detect popular places and landmarks Classify content with predefined labels OCR support for 50+ languages Detect brands and product logos Identify products from your catalog Identify image properties (colors, etc.) Get hints for best image cropping Detect faces and emotions Moderate explicit content Find similar images on the web Detect objects and retrieve coordinates Extract printed and handwritten text
  • 7. Gives journalists a new way to search, access, and analyze millions of historic photos NYT digitized more than a century of perishable photographs and other materials. With the Vision API, Times reporters can now easily search millions of high-res scans to enhance their reporting with even more visual storytelling. Bringing historic content to life Preserves a priceless chronicle of more than 100 years of events that have shaped our world MEDIA & ENTERTAINMENT
  • 8. A small team empowers 38 media brands MEDIA & ENTERTAINMENT Reduced cost and the need to rely on outside vendors Enabling marketing, ad, sales teams and more to take full advantage of all content With just 7 people, CBS Interactive is using Video, Natural Language, and Vision APIs to serve 38 digital media brands with content discovery and recommendation solutions.
  • 9. Box is using Vision API to help their customers manage and gain insights from their image files, and speed up image-centric processes and workflows. TECHNOLOGY Bringing image recognition and OCR to cloud content management Improved extensive content management for customers in every industry Intelligent structure for 30 billion files managed with powerful capabilities Image source: https://www.box.com/skills
  • 12. ● end to end security defaults that cannot be disabled: ○ always-on authentication ○ network isolation for dedicated clusters ○ TLS / SSL ○ encryption for data at rest ○ granular role-based access controls ● supports multi-region clusters ● sharding option for high-throughput ● managed backups ● Free Tier available on Google Cloud Platform
  • 13. Demo We will demonstrate how easy it is to use the Google Vision API to gain additional insights from a batch of photos that have no prior metadata attached. By using this workflow, we will be able to quickly build a descriptive metadata database that can be leveraged for a variety of business use-cases. ● TBD
  • 14. Demo
  • 16. Can I try this out myself?... Find all code and steps for this demo here: Or visit cloud.google.com/community and search for “MongoDB” https://cloud.google.com/community/tutorials/mongodb-atlas-appengineflex-nodejs-app
  • 17. Next steps... 01 Visit us at our booth Chat with our team, learn more about GCP + Atlas 02 Sign up for a new GCP account and get credits Get a 12-month, $300 credit free trial when you sign up for GCP with a new account. 03 Create a new MongoDB Atlas cluster on GCP Create a free MongoDB Atlas database on GCP
  • 21. Document Understanding AI You have a goldmine of documents that is hard, expensive or impossible to tap into. Document Understanding AI lets you easily and cost-effectively extract valuable insights from your documents.
  • 22. Process documents & extract insights automatically See what’s there 01 Make it useful 03 Understand it 02
  • 23. Detecting objects and understanding language within documents
  • 26. Ensures privacy & compliance Engage customers in new and exciting ways Enable shoppers to find products simply by sharing a photo Help reduce friction with product search and purchase Empower retail sales associates with information Detect multiple products in one image Additional use cases Enhance recommendations for similar and complementary products Analyze style trends and competitive pricing Vision Product Search
  • 27. Vision Product Search: How it works Customer uploads image