SlideShare a Scribd company logo
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Adrian Hornsby, Technical Evangelist @ AWS
Building Serverless AI-powered Applications
on AWS
@adhorn
• Technical Evangelist, Developer Advocate,
… Software Engineer
• Own bed in Finland
• Previously:
• Solutions Architect @AWS
• Lead Cloud Architect @Dreambroker
• Director of Engineering, Software Engineer, DevOps, Manager, ... @Hdm
• Researcher @Nokia Research Center
• and a bunch of other stuff.
• Climber, like Ginger shots.
What to Expect from the Session
1. A little bit history & theory never kills
2. AI in AWS
3. Building AI-powered apps x3
Building AI-powered Serverless Applications on AWS
No servers to provision
or manage
Scales with usage
Never pay for idle Availability and fault
tolerance built in
Serverless means…
EVENT DRIVEN CONTINUOUS SCALING PAY BY USAGE
Serverless means…
Exposing functionality rather than the whole
server(s).
Serverless means…
The rise of AI
Artificial Intelligence
At Amazon
Artificial Intelligence
At Amazon
Data
GPUs
& Acceleration
Cloud
Computing
Algorithms
AWS
The Advent Of Deep
Learning
Building AI-powered Serverless Applications on AWS
Machine Learning In The Hands Of Every Developer
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Machine Learning On AWS Today
Machine Learning In The Hands Of Every Developer
Text In, Life-like Speech Out
Amazon Polly
“Today in Seattle, WA
it’s 11°F”
“Today in Seattle Washington
it’s 11 degrees Fahrenheit”
47 lifelike voices spread across 24 languages
“Today in Seattle, WA, it’s 11°F”
‘"We live for the music" live from the Madison Square Garden.’
1. Automatic, Accurate Text Processing
A Focus On Voice Quality & Pronunciation
2. Intelligible and Easy to Understand
1. Automatic, Accurate Text Processing
A Focus On Voice Quality & Pronunciation
2. Intelligible and Easy to Understand
3. Add Semantic Meaning to Text
“Richard’s number is 2122341237“
“Richard’s number is 2122341237“
Telephone Number
A Focus On Voice Quality & Pronunciation
1. Automatic, Accurate Text Processing
2. Intelligible and Easy to Understand
3. Add Semantic Meaning to Text
4. Customized Pronunciation
“My daughter’s name is Kaja.”
“My daughter’s name is Kaja.”
A Focus On Voice Quality & Pronunciation
1. Automatic, Accurate Text Processing
A Focus On Voice Quality & Pronunciation
https://www.w3.org/TR/speech-synthesis/<speak>
The spelling of my last name is
<prosody rate='x-slow'>
<say-as interpret-as="characters">Adrian</say-as>
</prosody></speak>
Duolingo voices its language learning service Using Polly
Duolingo is a free language learning service where
users help translate the web and rate translations.
With Amazon Polly our users
benefit from the most lifelike
Text-to-Speech voices
available on the market.
Severin Hacker
CTO, Duolingo
”
“
• Spoken language crucial for
language learning
• Accurate pronunciation matters
• Faster iteration thanks to TTS
• As good as natural human speech
Building AI-powered Serverless Applications on AWS
<API>
Amazon Polly
</API>
aws polly synthesize-speech
--text "It was nice to live such a wonderful live show"
--output-format mp3
--voice-id Joanna
--text-type text johanna.mp3
Pollycast
Building AI-powered Serverless Applications on AWS
<demo>
Amazon PollyCast
</demo>
* Initial project by James Siri, Piotr Lewalski
https://github.com/adhorn/pollycast
Amazon Rekognition
Deep learning-based image recognition service
Search, verify, and organize millions of images
Object and Scene
Detection
Facial
Analysis
Face
Comparison
Facial
Recognition
Object & Scene Detection
Object & Scene Detection
Facial Analysis
Facial Analysis
Facial Search
Facial Search
Collections
Amazon Rekognition
Customers
• Digital Asset Management
• Media and Entertainment
• Travel and Hospitality
• Influencer Marketing
• Systems Integration
• Digital Advertising
• Consumer Storage
• Law Enforcement
• Public Safety
• eCommerce
• Education
<API>
Amazon Rekognition
</API>
aws rekognition detect-faces
--image '{"S3Object":{"Bucket":"adhorn-reko","Name":"horse.jpg"}}'
--attributes "ALL"
aws rekognition detect-labels
--image '{"S3Object":{"Bucket":"adhorn-reko","Name":"horse.jpg"}}'
Poliko
http://poliko.adhorn.me
Poliko
Take Pic
Amazon Cognito
2. Detect Labels
4. Synthesize-speech
Amazon Rekognition
Amazon Polly
3. Detect Faces
Amazon S3
“Static website hosting” enabled
Cognito support for Identity
Username
Password
Sign In
SAML
Identity Provider
Amazon Cognito2. Get AWS credentials
API Gateway
DynamoDB S3
Lambda
Cognito User Pools
Rekognition
Polly
Policies
CognitoAmazon S3
<demo>
Poliko
powered by Amazon Polly & Rekognition
</demo>
https://github.com/adhorn/poliko
http://poliko.adhorn.me
* Initial project by Martin Elwin
Backend powered by Step
Functions
Start
Sequential Steps
U p l o a d R AW f i l e
D e l e t e R AW f i l e
End
AWS Step Functions
S e l e c t i m a g e
c o n v e rt e r
RA W t o J P E G RA W t o P NGRA W t o TI FF
L o a d i n Da t a b a se
Start
End
Un s u p p or te d i m a g e
t yp eParallel Steps
AWS Step Functions
P r o c e s s p h o t o
Re s i ze i m a g e
Start
End
E xt r a c t m e t a d a ta Fa c i a l r e c o g n it i on
L o a d i n Da t a b a se
Branching Steps
AWS Step Functions
Building AI-powered Serverless Applications on AWS
AWS Step Functions
<demo>
Image Recognition and Processing Backend
Step Functions
</demo>
https://github.com/awslabs/lambda-refarch-imagerecognition
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Questions?
Adrian Hornsby, Technical Evangelist
@adhorn
adhorn@amazon.com

More Related Content

PPTX
re:Invent re:Cap - An overview of Artificial Intelligence and Machine Learnin...
PDF
10 Lessons from 10 Years of AWS
PDF
An Introduction to AWS IoT - Web Summit Lisbon
PPTX
Design, Build, and Modernize Your Web Applications with AWS
PDF
Innovations and The Cloud
PPTX
Devoxx: Building AI-powered applications on AWS
PPTX
Serverless in Action on AWS
PPTX
Innovations and the Cloud
re:Invent re:Cap - An overview of Artificial Intelligence and Machine Learnin...
10 Lessons from 10 Years of AWS
An Introduction to AWS IoT - Web Summit Lisbon
Design, Build, and Modernize Your Web Applications with AWS
Innovations and The Cloud
Devoxx: Building AI-powered applications on AWS
Serverless in Action on AWS
Innovations and the Cloud

Similar to Building AI-powered Serverless Applications on AWS (20)

PPTX
Building Serverless AI-powered Apps on AWS
PDF
Developing Sophisticated Serverless Applications with AI
PDF
Artificial Intelligence on the AWS Platform
PDF
Amazon AI (February 2017)
PPTX
Building AI-powered Apps on AWS
PDF
An Overview to Artificial Intelligence Services at AWS
PDF
An Introduction to AI Services on AWS - Web Summit Lisbon
PDF
AI Services on AWS - CTO Club JLM
PDF
Ai services AWS - Taglit
PPTX
AI and Innovations on AWS
PPTX
AI on a PI
PDF
Harnessing Artificial Intelligence_Alastair Cousins
PDF
An Introduction to Amazon AI Services
PDF
AIM301 - Breaking Language Barriers With AI - Tel Aviv Summit 2019
PDF
Breaking language barriers with AI | AWS Summit Tel Aviv 2019
PDF
AI Today
PDF
AWS의 새로운 언어, 음성, 텍스트 처리 인공지능 서비스::Vikram Anbazhagan::AWS Summit Seoul 2018
PDF
Ai Services on AWS - AWS IL Meetup
PDF
Artificial Intelligence on the AWS Platform
PDF
Breaking Language Barriers with AI - Web Summit 2018
Building Serverless AI-powered Apps on AWS
Developing Sophisticated Serverless Applications with AI
Artificial Intelligence on the AWS Platform
Amazon AI (February 2017)
Building AI-powered Apps on AWS
An Overview to Artificial Intelligence Services at AWS
An Introduction to AI Services on AWS - Web Summit Lisbon
AI Services on AWS - CTO Club JLM
Ai services AWS - Taglit
AI and Innovations on AWS
AI on a PI
Harnessing Artificial Intelligence_Alastair Cousins
An Introduction to Amazon AI Services
AIM301 - Breaking Language Barriers With AI - Tel Aviv Summit 2019
Breaking language barriers with AI | AWS Summit Tel Aviv 2019
AI Today
AWS의 새로운 언어, 음성, 텍스트 처리 인공지능 서비스::Vikram Anbazhagan::AWS Summit Seoul 2018
Ai Services on AWS - AWS IL Meetup
Artificial Intelligence on the AWS Platform
Breaking Language Barriers with AI - Web Summit 2018
Ad

More from Adrian Hornsby (20)

PPTX
How can your business benefit from going serverless?
PDF
Can Automotive be as agile as Unicorns?
PDF
Moving Forward with AI - as presented at the Prosessipäivät 2018
PPTX
Chaos Engineering: Why Breaking Things Should Be Practised.
PPTX
Chaos Engineering: Why Breaking Things Should Be Practised.
PPTX
Model Serving for Deep Learning
PDF
AI in Finance: Moving forward!
PPTX
Building a Multi-Region, Active-Active Serverless Backends.
PDF
Moving Forward with AI
PPTX
AI: State of the Union
PPTX
Serverless Architectural Patterns
PPTX
re:Invent re:Cap - Big Data & IoT at Any Scale
PPTX
AWS Startup Day Bangalore: Being Well-Architected in the Cloud
PPTX
Journey Towards Scaling Your API to 10 Million Users
PDF
AWSome Day - Opening Keynote
PPTX
Innovations fueled by IoT and the Cloud
PPTX
AWS Batch: Simplifying batch computing in the cloud
PPTX
Being Well Architected in the Cloud (Updated)
PPTX
Deep Dive on Object Storage: Amazon S3 and Amazon Glacier
PPTX
Serverless Streaming Data Processing using Amazon Kinesis Analytics
How can your business benefit from going serverless?
Can Automotive be as agile as Unicorns?
Moving Forward with AI - as presented at the Prosessipäivät 2018
Chaos Engineering: Why Breaking Things Should Be Practised.
Chaos Engineering: Why Breaking Things Should Be Practised.
Model Serving for Deep Learning
AI in Finance: Moving forward!
Building a Multi-Region, Active-Active Serverless Backends.
Moving Forward with AI
AI: State of the Union
Serverless Architectural Patterns
re:Invent re:Cap - Big Data & IoT at Any Scale
AWS Startup Day Bangalore: Being Well-Architected in the Cloud
Journey Towards Scaling Your API to 10 Million Users
AWSome Day - Opening Keynote
Innovations fueled by IoT and the Cloud
AWS Batch: Simplifying batch computing in the cloud
Being Well Architected in the Cloud (Updated)
Deep Dive on Object Storage: Amazon S3 and Amazon Glacier
Serverless Streaming Data Processing using Amazon Kinesis Analytics
Ad

Recently uploaded (20)

PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
KodekX | Application Modernization Development
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
Big Data Technologies - Introduction.pptx
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
Cloud computing and distributed systems.
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
cuic standard and advanced reporting.pdf
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Empathic Computing: Creating Shared Understanding
PPT
Teaching material agriculture food technology
PPTX
sap open course for s4hana steps from ECC to s4
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Diabetes mellitus diagnosis method based random forest with bat algorithm
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
KodekX | Application Modernization Development
Encapsulation_ Review paper, used for researhc scholars
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Big Data Technologies - Introduction.pptx
Review of recent advances in non-invasive hemoglobin estimation
MYSQL Presentation for SQL database connectivity
Cloud computing and distributed systems.
The AUB Centre for AI in Media Proposal.docx
Programs and apps: productivity, graphics, security and other tools
cuic standard and advanced reporting.pdf
Network Security Unit 5.pdf for BCA BBA.
Dropbox Q2 2025 Financial Results & Investor Presentation
NewMind AI Weekly Chronicles - August'25 Week I
Empathic Computing: Creating Shared Understanding
Teaching material agriculture food technology
sap open course for s4hana steps from ECC to s4

Building AI-powered Serverless Applications on AWS

  • 1. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Adrian Hornsby, Technical Evangelist @ AWS Building Serverless AI-powered Applications on AWS @adhorn
  • 2. • Technical Evangelist, Developer Advocate, … Software Engineer • Own bed in Finland • Previously: • Solutions Architect @AWS • Lead Cloud Architect @Dreambroker • Director of Engineering, Software Engineer, DevOps, Manager, ... @Hdm • Researcher @Nokia Research Center • and a bunch of other stuff. • Climber, like Ginger shots.
  • 3. What to Expect from the Session 1. A little bit history & theory never kills 2. AI in AWS 3. Building AI-powered apps x3
  • 5. No servers to provision or manage Scales with usage Never pay for idle Availability and fault tolerance built in Serverless means…
  • 6. EVENT DRIVEN CONTINUOUS SCALING PAY BY USAGE Serverless means…
  • 7. Exposing functionality rather than the whole server(s). Serverless means…
  • 13. Machine Learning In The Hands Of Every Developer
  • 14. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Machine Learning On AWS Today
  • 15. Machine Learning In The Hands Of Every Developer
  • 16. Text In, Life-like Speech Out Amazon Polly “Today in Seattle, WA it’s 11°F” “Today in Seattle Washington it’s 11 degrees Fahrenheit” 47 lifelike voices spread across 24 languages
  • 17. “Today in Seattle, WA, it’s 11°F” ‘"We live for the music" live from the Madison Square Garden.’ 1. Automatic, Accurate Text Processing A Focus On Voice Quality & Pronunciation
  • 18. 2. Intelligible and Easy to Understand 1. Automatic, Accurate Text Processing A Focus On Voice Quality & Pronunciation
  • 19. 2. Intelligible and Easy to Understand 3. Add Semantic Meaning to Text “Richard’s number is 2122341237“ “Richard’s number is 2122341237“ Telephone Number A Focus On Voice Quality & Pronunciation 1. Automatic, Accurate Text Processing
  • 20. 2. Intelligible and Easy to Understand 3. Add Semantic Meaning to Text 4. Customized Pronunciation “My daughter’s name is Kaja.” “My daughter’s name is Kaja.” A Focus On Voice Quality & Pronunciation 1. Automatic, Accurate Text Processing
  • 21. A Focus On Voice Quality & Pronunciation https://www.w3.org/TR/speech-synthesis/<speak> The spelling of my last name is <prosody rate='x-slow'> <say-as interpret-as="characters">Adrian</say-as> </prosody></speak>
  • 22. Duolingo voices its language learning service Using Polly Duolingo is a free language learning service where users help translate the web and rate translations. With Amazon Polly our users benefit from the most lifelike Text-to-Speech voices available on the market. Severin Hacker CTO, Duolingo ” “ • Spoken language crucial for language learning • Accurate pronunciation matters • Faster iteration thanks to TTS • As good as natural human speech
  • 24. <API> Amazon Polly </API> aws polly synthesize-speech --text "It was nice to live such a wonderful live show" --output-format mp3 --voice-id Joanna --text-type text johanna.mp3
  • 27. <demo> Amazon PollyCast </demo> * Initial project by James Siri, Piotr Lewalski https://github.com/adhorn/pollycast
  • 28. Amazon Rekognition Deep learning-based image recognition service Search, verify, and organize millions of images Object and Scene Detection Facial Analysis Face Comparison Facial Recognition
  • 29. Object & Scene Detection
  • 30. Object & Scene Detection
  • 36. Amazon Rekognition Customers • Digital Asset Management • Media and Entertainment • Travel and Hospitality • Influencer Marketing • Systems Integration • Digital Advertising • Consumer Storage • Law Enforcement • Public Safety • eCommerce • Education
  • 37. <API> Amazon Rekognition </API> aws rekognition detect-faces --image '{"S3Object":{"Bucket":"adhorn-reko","Name":"horse.jpg"}}' --attributes "ALL" aws rekognition detect-labels --image '{"S3Object":{"Bucket":"adhorn-reko","Name":"horse.jpg"}}'
  • 39. http://poliko.adhorn.me Poliko Take Pic Amazon Cognito 2. Detect Labels 4. Synthesize-speech Amazon Rekognition Amazon Polly 3. Detect Faces Amazon S3 “Static website hosting” enabled
  • 40. Cognito support for Identity Username Password Sign In SAML Identity Provider Amazon Cognito2. Get AWS credentials API Gateway DynamoDB S3 Lambda Cognito User Pools Rekognition Polly
  • 42. <demo> Poliko powered by Amazon Polly & Rekognition </demo> https://github.com/adhorn/poliko http://poliko.adhorn.me * Initial project by Martin Elwin
  • 43. Backend powered by Step Functions
  • 44. Start Sequential Steps U p l o a d R AW f i l e D e l e t e R AW f i l e End AWS Step Functions
  • 45. S e l e c t i m a g e c o n v e rt e r RA W t o J P E G RA W t o P NGRA W t o TI FF L o a d i n Da t a b a se Start End Un s u p p or te d i m a g e t yp eParallel Steps AWS Step Functions
  • 46. P r o c e s s p h o t o Re s i ze i m a g e Start End E xt r a c t m e t a d a ta Fa c i a l r e c o g n it i on L o a d i n Da t a b a se Branching Steps AWS Step Functions
  • 49. <demo> Image Recognition and Processing Backend Step Functions </demo> https://github.com/awslabs/lambda-refarch-imagerecognition
  • 50. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Questions? Adrian Hornsby, Technical Evangelist @adhorn adhorn@amazon.com

Editor's Notes

  • #4: you have a lot to cover and you are happy to field questions after the talk.
  • #12: A trillion is 1,000,000,000,000, also know as 10 to the 12th power, or one million million. It’s such a large number it’s hard to get your head around it, so sometimes trillion just means “wow, a lot.”
  • #13: AWS is an AI enabler .. For all the reason mentioned here – When AWS was established in 2006, one core premise was to allow anyone, even a student in his door-room, to get access to the same technologies that Fortune 500 companies have – we called it democratization of technology.
  • #16: And the result of this is that we see a ton of machine learning up on AWS today, literally from A through to Z. So everything from Ancestry, who are using machine learning and deep learning to be able to process genomic information and build out family trees, all the way through to Zillow, who use machine learning to do house-price estimation up on the website.
  • #17: Amazon Web Services provides a rich ecosystem to help you build smarter applications. In this context, it is worth highlighting the higher level AI services based on deep learning algorithms, like Amazon Rekognition, an image recognition service, Amazon Polly, a text to speech synthesizer, and Amazon Lex, a voice and text chatbot service. We also provide the infrastructure including GPU EC2 instances for fast parallel processing which you can use in combination with any of the popular deep learning libraries like Apache mxnet, Tensorflow, Theano, etc, all of which are available on the AWS deep learning AMI. For your general machine learning purposes, you can also use EC2, Amazon Elastic MapReduce and Spark with SparkML to run any machine learning algorithm. Another popular library is the python scikit-learn, which you can deploy on AWS Lambda or containers, or EC2. So what I am trying to convey is that there is a lot of choice, which basically boils down to picking the right tool for the right job, where you can make trade-offs between ‘do your own’ with all the flexibility, or picking a managed solution which allows you to get results fast without having to do the heavy lifting.
  • #18: The basics are pretty simple, but the service has deep functionality. You can send the service a simple string of text, and it will generate the life like voice in your choice of 47 different voices. But it’s not naive of the context of the text. For example, the text here - ‘WA’ and ‘degree F’, that would sound strange if it were spoken out loud. Instead, Polly will automatically expand the text strings ‘WA’ and ‘degree F’, to ‘Washington’ and ‘degrees fahrenheit’, to create more life like speech. The developer doesn’t have to do anything - just send the text, and get life like voice back.
  • #26: 30
  • #29: 24
  • #30: a fully managed deep learning based image recognition service. Designed from the get-go to run at scale. It comprehends scenes, objects, concepts and faces. Given an image, it will return a list of labels. Given an image with one or more faces,it will return bounding boxes for each face, along with face attributes. Given two images with faces, it will compare the largest face from the source image and find similarity with faces found in the tagret image. Rekognition provides quality face recognition at scale, and supports creation of collection of millions of faces and search of similar faces in the collection. Now lets dive into each of these features and look at the API that support these features.
  • #34: Image moderation Rekognition automatically detects explicit or suggestive adult content in your images, and provides confidence scores.
  • #39: 26
  • #44: 24
  • #51: 24