SlideShare a Scribd company logo
AI Capabilities in
Image Recognition
Technology and Ideas we can use in our app
Agenda
Overview
Introduction to Google Vision
Capabilities of Google Vision
How it can benefit Fastra app
A look at Amazon Rekognition
Ideas?
Overview
What is AI?
What is the role of AI in Image processing?
Google Vision
“Now, users can naturally find GIFs based on popular movie lines, music lyrics and catchphrases.
This has resulted in click-through rates increasing up to 32%.”
- Nick Hasty, Director, GIPHY
What it can do
01 Understand the content of an image. It's a boat! It's a tiger! It's a bird!
02 Can detect monuments, objects, faces.
03 Can detect company logos, texts, offensive content.
What it can do
04 Using capabilities of Google, can search the image content from web and detect
the source and links.
05 Can detect copyright material, celebrities, news events etc.
06 Can detect facial expression and other things related to face recognition.
Detecting Faces
It can find human faces in photos, videos or live streams.
Can also detect facial landmarks like nose, eyes and
mouth.
Detecting objects
It can easily detect broad sets of objects in your images,
from flowers, animals, or transportation to thousands of
other object categories commonly found within images.
AI in image recognition
AI in image recognition
AI in image recognition
Detecting inappropriate content
Powered by Google SafeSearch, easily moderate content
from your crowd sourced images. Vision API enables you
to detect different types of inappropriate content from
adult to violent content.
AI in image recognition
AI in image recognition
Power of Google Search
Vision API uses the power of Google Image Search to find
topical entities like celebrities, logos, or news events.
Combine this with Visually Similar Search to find similar
images on the web.
AI in image recognition
AI in image recognition
AI in image recognition
Reads Text
Optical Character Recognition (OCR) enables you to
detect text within your images, along with automatic
language identification. Vision API supports a broad set
of languages.
AI in image recognition
AI in image recognition
Emotion Detection
Detects every kind of emotion on faces.
AI in image recognition
AI in image recognition
Logos and Landmark Detection
Detects various objects, landmarks, monuments or logos.
AI in image recognition
AI in image recognition
AI in image recognition
AI in image recognition
AI in image recognition
AI in image recognition
AI in image recognition
AI in image recognition
Google Vision Pricing
Google Vision SDKs
● Mobile SDKs - iOS and Android
● Backend SDKs
What we can do in Fastra
● Automatic tag creation
● Suggestion to user on the basis of emotion
detected in Selfies
● Different Slefie/Fun Categories on the basis of
background landmark
● Filters
● Celebrity recognition
● Linking to current affairs/trends
● Text Detection in Fun
● Filter out objectionable images
Amazon Rekognition
Amazon Rekognition Video also allows you to easily and
quickly review hours of video footage to search for
persons of interest, track their movement, and detect
their activities.
Amazon Rekognition - Capabilities
● Image moderation
● Facial analysis
● Celebrity Recognition
● Face comparison
● Text In Image
● Video analysis
AI in image recognition
Ideas?

More Related Content

PDF
AI SEO, GEO, AEO, AI's Impact on the Future of SEO – Prepare Your Website for...
PPTX
What Is Google Lens?
PDF
Find the Perfect Image Image Search Made Easy in 2023
PDF
How can you get started with machine learning
PDF
A.I. in the Enterprise: Computer Vision
PDF
Top AI-Driven SEO Strategies to Rank High in 2025
PPTX
8 facial recognition apps that will rule 2020!
KEY
The Rise of the Personal Intelligent Search Agent
AI SEO, GEO, AEO, AI's Impact on the Future of SEO – Prepare Your Website for...
What Is Google Lens?
Find the Perfect Image Image Search Made Easy in 2023
How can you get started with machine learning
A.I. in the Enterprise: Computer Vision
Top AI-Driven SEO Strategies to Rank High in 2025
8 facial recognition apps that will rule 2020!
The Rise of the Personal Intelligent Search Agent

Similar to AI in image recognition (20)

PPTX
Video search tool
PPTX
Recreating t-800's vision
PPTX
Artificial intelligence.pptx
PPTX
Imagine Cup Junior 2020
PPTX
PDF
googlelens-180321163044.pdf
PPTX
With just a few clicks, you can generate wonderful slideshows that suit your ...
DOCX
Computer Vision and Amazon Rekognition
PDF
How Deep Learning Changes the Design Process #NEXT17
PPTX
How To Make Google Love Your AI Content.pptx
PDF
Visual Search : Top Digital marketing trend of 2021
PDF
How to Use AI for a More Effective Social Media Strategy.pdf
PDF
Automatic multi-modal metadata annotation based on trained cognitive solution...
PPT
digital marketing agency | ecommerce SEO packages | ecommerce SEO | digital a...
PPTX
How to Integrate Google Video Intelligence API
PDF
Artificial intelligence in android development
PPTX
The Creative Ai storm
PPTX
Mobile-First Indexing: Re-thinking Position Zero
PPTX
SHIVANGI_26_10A.pptx
PPT
Trace Criminal using IBM Watson
Video search tool
Recreating t-800's vision
Artificial intelligence.pptx
Imagine Cup Junior 2020
googlelens-180321163044.pdf
With just a few clicks, you can generate wonderful slideshows that suit your ...
Computer Vision and Amazon Rekognition
How Deep Learning Changes the Design Process #NEXT17
How To Make Google Love Your AI Content.pptx
Visual Search : Top Digital marketing trend of 2021
How to Use AI for a More Effective Social Media Strategy.pdf
Automatic multi-modal metadata annotation based on trained cognitive solution...
digital marketing agency | ecommerce SEO packages | ecommerce SEO | digital a...
How to Integrate Google Video Intelligence API
Artificial intelligence in android development
The Creative Ai storm
Mobile-First Indexing: Re-thinking Position Zero
SHIVANGI_26_10A.pptx
Trace Criminal using IBM Watson
Ad

More from Paramvir Singh (13)

PDF
Ai and using ml in mobile apps
PDF
Android gps, location services, camera and sensors - Paramvir Singh
PDF
Dependency injection and dagger2 in android paramvir singh
PDF
Android: Network optimization by Paramvir Singh
PPTX
Android Session 6 - UI Part 1
PPTX
Android ui part 2
PPTX
Android Connecting to internet Part 2
PPTX
Android Connecting to Internet
PPTX
Android Starting App Development
PPTX
Android one, why it is important for Android developers in India
PPTX
Clean code, Better coding practices
PPTX
Android enterprise application development
PPTX
Near field communication
Ai and using ml in mobile apps
Android gps, location services, camera and sensors - Paramvir Singh
Dependency injection and dagger2 in android paramvir singh
Android: Network optimization by Paramvir Singh
Android Session 6 - UI Part 1
Android ui part 2
Android Connecting to internet Part 2
Android Connecting to Internet
Android Starting App Development
Android one, why it is important for Android developers in India
Clean code, Better coding practices
Android enterprise application development
Near field communication
Ad

Recently uploaded (20)

PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Encapsulation theory and applications.pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PPTX
Cloud computing and distributed systems.
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Machine learning based COVID-19 study performance prediction
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
KodekX | Application Modernization Development
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
Big Data Technologies - Introduction.pptx
PPTX
sap open course for s4hana steps from ECC to s4
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Empathic Computing: Creating Shared Understanding
Unlocking AI with Model Context Protocol (MCP)
Encapsulation theory and applications.pdf
Building Integrated photovoltaic BIPV_UPV.pdf
Mobile App Security Testing_ A Comprehensive Guide.pdf
Cloud computing and distributed systems.
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
NewMind AI Weekly Chronicles - August'25 Week I
Encapsulation_ Review paper, used for researhc scholars
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Machine learning based COVID-19 study performance prediction
Advanced methodologies resolving dimensionality complications for autism neur...
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
KodekX | Application Modernization Development
Review of recent advances in non-invasive hemoglobin estimation
Big Data Technologies - Introduction.pptx
sap open course for s4hana steps from ECC to s4
Digital-Transformation-Roadmap-for-Companies.pptx
Empathic Computing: Creating Shared Understanding

AI in image recognition

  • 1. AI Capabilities in Image Recognition Technology and Ideas we can use in our app
  • 2. Agenda Overview Introduction to Google Vision Capabilities of Google Vision How it can benefit Fastra app A look at Amazon Rekognition Ideas?
  • 3. Overview What is AI? What is the role of AI in Image processing?
  • 4. Google Vision “Now, users can naturally find GIFs based on popular movie lines, music lyrics and catchphrases. This has resulted in click-through rates increasing up to 32%.” - Nick Hasty, Director, GIPHY
  • 5. What it can do 01 Understand the content of an image. It's a boat! It's a tiger! It's a bird! 02 Can detect monuments, objects, faces. 03 Can detect company logos, texts, offensive content.
  • 6. What it can do 04 Using capabilities of Google, can search the image content from web and detect the source and links. 05 Can detect copyright material, celebrities, news events etc. 06 Can detect facial expression and other things related to face recognition.
  • 7. Detecting Faces It can find human faces in photos, videos or live streams. Can also detect facial landmarks like nose, eyes and mouth.
  • 8. Detecting objects It can easily detect broad sets of objects in your images, from flowers, animals, or transportation to thousands of other object categories commonly found within images.
  • 12. Detecting inappropriate content Powered by Google SafeSearch, easily moderate content from your crowd sourced images. Vision API enables you to detect different types of inappropriate content from adult to violent content.
  • 15. Power of Google Search Vision API uses the power of Google Image Search to find topical entities like celebrities, logos, or news events. Combine this with Visually Similar Search to find similar images on the web.
  • 19. Reads Text Optical Character Recognition (OCR) enables you to detect text within your images, along with automatic language identification. Vision API supports a broad set of languages.
  • 22. Emotion Detection Detects every kind of emotion on faces.
  • 25. Logos and Landmark Detection Detects various objects, landmarks, monuments or logos.
  • 35. Google Vision SDKs ● Mobile SDKs - iOS and Android ● Backend SDKs
  • 36. What we can do in Fastra ● Automatic tag creation ● Suggestion to user on the basis of emotion detected in Selfies ● Different Slefie/Fun Categories on the basis of background landmark ● Filters ● Celebrity recognition ● Linking to current affairs/trends ● Text Detection in Fun ● Filter out objectionable images
  • 37. Amazon Rekognition Amazon Rekognition Video also allows you to easily and quickly review hours of video footage to search for persons of interest, track their movement, and detect their activities.
  • 38. Amazon Rekognition - Capabilities ● Image moderation ● Facial analysis ● Celebrity Recognition ● Face comparison ● Text In Image ● Video analysis