SlideShare a Scribd company logo
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Breaking Language Barriers with AI
Boaz Ziniman
Technical Evangelist - Amazon Web Service
@ziniman ziniman
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Natural language processing (NLP)
• Automatic Speech Recognition (ASR)
• Natural Language Understanding (NLU)
• Text to Speech
• Translation
1970 1980 1990 2000 2010 2020
HUMAN ACCURACY
50% 55%
60% 62%
70%
95%
Source: MindMeld
Breaking Language Barriers with AI - AWS Summit
Breaking Language Barriers with AI - AWS Summit
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Common Language Use Cases
Information Bots
Education
Accessibility
Knowledge Management
Voice of Customer
Applications
Customer Service/
Call Centers
Enterprise
Digital Assistant
Semantic Search
Captioning Workflows
LocalizationPersonalization
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
M L F R A M E W O R K S &
I N F R A S T R U C T U R E
A I S E R V I C E S
R E K O G N I T I O N
I M A G E
P O L L Y T R A N S C R I B E T R A N S L A T E C O M P R E H E N D
& C O M P R E H E N D
M E D I C A L
L E XR E K O G N I T I O N
V I D E O
Vision Speech Chatbots
A M A Z O N
S A G E M A K E R
B U I L D T R A I N
F O R E C A S TT E X T R A C T P E R S O N A L I Z E
D E P L O Y
Pre-built algorithms & notebooks
Data labeling (G R O U N D T R U T H )
One-click model training & tuning
Optimization (N E O )
One-click deployment & hosting
M L S E R V I C E S
F r a m e w o r k s I n t e r f a c e s I n f r a s t r u c t u r e
E C 2 P 3
& P 3 d n
E C 2 C 5 F P G A s G R E E N G R A S S E L A S T I C
I N F E R E N C E
Reinforcement learningAlgorithms & models ( A W S M A R K E T P L A C E
F O R M A C H I N E L E A R N I N G )
Language Forecasting Recommendations
The Amazon ML Stack: Broadest & Deepest Set of
Capabilities
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
AI Services
Pre-trained AI services that require
no ML skills or training
Easily add intelligence to your
existing apps and workflows
Quality and accuracy from
continuously-learning APIs
A I S E R V I C E S
R E K O G N I T I O N
I M A G E
P O L L Y T R A N S C R I B E T R A N S L A T E C O M P R E H E N D
& C O M P R E H E N D
M E D I C A L
L E XR E K O G N I T I O N
V I D E O
Vision Speech Chatbots
F O R E C A S TT E X T R A C T P E R S O N A L I Z E
Language Forecasting Recommendations
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Turn text into lifelike speech using deep learning
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Amazon Polly – Use Cases
Contact
Centers
Special Needs
AI Assistant
Voiced videos
and presentations
Language
learning
Amazon Polly
Navigation
Podcasting,
Voiced blogs
and news articles
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
“Today in Seattle, WA
it’s 11°F”
Amazon Polly: Text In, Life-like Speech Out
54 voices across 27 languages
“Today in Seattle Washington
it’s 11 degrees Fahrenheit”
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Amazon Polly: Text In, Life-like Speech Out
“Today in Mumbai,
India it’s 32°C”
“Today in Mumbai, India it’s 32
degrees Celcius”
54 voices across 27 languages
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Amazon Polly: Text In, Life-like Speech Out
“Das ist mein 1. Mal in
der AWS Summit
Berlin.”
“Das ist mein erstes Mal
in der AWS Summit
Berlin.”
54 voices across 27 languages
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
A Focus On Voice Quality & Pronunciation
1. Automatic, Accurate Text Processing
“Today in Seattle, WA, it’s 11°F”
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
A Focus On Voice Quality & Pronunciation
1. Automatic, Accurate Text Processing
2. Intelligible and Easy to Understand
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
A Focus On Voice Quality & Pronunciation
1. Automatic, Accurate Text Processing
2. Intelligible and Easy to Understand
3. Add Semantic Meaning to Text
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
A Focus On Voice Quality & Pronunciation
https://www.w3.org/TR/speech-synthesis/
<speak>
The spelling of my name is
<prosody rate='x-slow'>
<say-as interpret-as="characters">Boaz</say-as>
</prosody>
</speak>
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
A Focus On Voice Quality & Pronunciation
1. Automatic, Accurate Text Processing
2. Intelligible and Easy to Understand
3. Add Semantic Meaning to Text
“Richard’s number is 2122341237“
<say-as interpret-as="telephone">
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Voice Modification: Vocal Tract Length
<speak>
This is Brian without any voice modifications.
<amazon:effect vocal-tract-length="+15%"> Imagine now that I got bigger… </amazon:effect>
<amazon:effect vocal-tract-length="+25%"> Suppose that I got even bigger still… </amazon:effect>
Now let's go back and hear the effect when I go in the opposite direction.
<amazon:effect vocal-tract-length="-15%"> Can you tell that I'm getting smaller? </amazon:effect>
<amazon:effect vocal-tract-length="-25%"> Now I'm even smaller than before. </amazon:effect>
</speak>
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Breaking Language Barriers with AI - AWS Summit
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Polly API example
aws polly synthesize-speech 
--output-format mp3 --voice-id Matthew --text-type ssml 
--text '<speak>
<amazon:auto-breaths>
<prosody rate="x-slow" pitch="low">Here is my little secret.</prosody>
<amazon:breath duration="long" volume="x-loud"/>
<amazon:effect name="whispered">
<prosody rate="x-slow">
<prosody pitch="x-low">I</prosody>
killed Mufasa!
</prosody>
</amazon:effect>
</amazon:auto-breaths>
</speak>' 
mufasa.mp3
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Automatic speech recognition
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Automatic speech recognition service
“Hello, this is Allan
speaking”
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Amazon Transcribe – Key Features
Channel
Identification
Custom
vocabulary
Speaker
Identification
Word-level
time stamps
Punctuation and
capitalization
Word-level
confidence scores
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
ringDNA
RingDNA is an end-to-end communications
platform for sales teams.
Hundreds of enterprise organizations use
RingDNA to increase productivity, engage in
smarter sales conversations, gain predictive
sales insights and improve their win rate.
Speech to Text
"A critical component of RingDNA’s Conversation
AI requires best of breed speech-to-text to deliver
transcriptions of every phone call. RingDNA is
excited about Amazon Transcribe since it provides
high-quality speech recognition at scale, helping us
to better transcribe every call to text"
Howard Brown, CEO & Founder, RingDNA
https://www.youtube.com/watch?v=1ZJ_f1bDdog
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Natural and accurate language translation
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
21 Languages
417 Combinations
Key Features
Real-time
< 500ms / sentence on average
< 150ms / conversational / short form
Tag Handling
XML tags placement maintains
styling and formatting through
translation
< / >
Data Security
Data ownership
Encryption
Access Management
Ease of Use
Simple API calls and partner
solutions
$15/1M characters
Or $0.000075 per word;
Pay as you go, 2M characters
monthly free tier
HIPAA Eligible
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Amazon Translate
Natural and fluent language translation
“Hello, what’s up? Do
you want to go see a
movie tonight?”
" Hallo, was gibt's? Willst
du heute Abend einen
Film sehen? "
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Translate API example
boazz: ~$ aws translate translate-text 
--text "Hello, what’s up? Do you want to go see a movie tonight?" 
--source-language-code auto --target-language-code de
{
"TargetLanguageCode": "de",
"TranslatedText": " Hallo, was gibt's? Willst du heute Abend einen Film
sehen?",
"SourceLanguageCode": "en"
}
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Translate API example
import boto3
translate = boto3.client("translate")
lang_flag_pairs = [("fr", "!"), ("de", """), ("es", "#"),
("pt", "$"),("zh", "%"), ("ja", "&"),
("ru", "'"),("it", "("), ("zh-TW", ")"),
("tr", "*"), ("cs", "+"), (”he", ",")]
for lang, flag in lang_flag_pairs:
print(flag)
print(translate.translate_text(
Text="Hello, World",
SourceLanguageCode="en",
TargetLanguageCode=lang
)['TranslatedText'])
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Translate API example
!
Bonjour, Monde
"
Hallo, Welt
#
Hola, Mundo
$
Olá, Mundo
%
&
'
Привет, Мир
(
Ciao, Mondo
)
,
*
Merhaba, Dünya.
+
Ahoj, světe.
,
‫ש‬‫ל‬‫ו‬‫ם‬,‫ע‬‫ו‬‫ל‬‫ם‬.
https://github.com/ziniman/aws-translate-demo
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Scaling
real-time translation
Using Amazon Translate, Lionbridge is able to
scale machine translation in order to localize
content faster and in more languages. Using
Translate, Lionbridge was able to reduce
translation costs by 20 percent.
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
https://github.com/ziniman/aws-translate-demo
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Discover insights and relationships in text
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Amazon Comprehend
Di s c o v e r i n s i g h t s a n d r e l a t i o n s h i p s i n t e x t
Entities
Key Phrases
Language
Sentiment
Syntax
Grouping
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Accurately extract health information from patient
notes, clinical trial reports, and other electronic
health records using Amazon Comprehend
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Amazon Comprehend Medical
Entities
Medication
Medical condition
Test, treatments, and
procedures anatomy
Protected Health
Information (PHI)
Relationship extraction
Medication
Test, treatments, and procedures
Entity traits
Negation
Diagnosis signs and symptom
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Run Amazon Comprehend on S3 Bucket
import boto3
import json
s3 = boto3.resource('s3’)
bucket_name = ‘my_bucket’
region_name = ‘us-east-1’
bucket = s3.Bucket(bucket_name)
comprehend = boto3.client(service_name='comprehend', region_name=region)
for obj in bucket.objects.all():
body = obj.get()['Body'].read()
text = body
sentiment_response = comprehend.detect_sentiment(Text=text, LanguageCode='en’)
print(json.dumps(sentiment_response, sort_keys=True, indent=4))
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Conversational interfaces for your applications
powered by the same deep learning technologies as
Alexa
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Advent of conversational interactions
2nd gen:
Pointers & sliders
3nd gen:
Conversational
interfaces
1st gen:
Punch cards & memory
registers
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Amazon Lex – use cases
CONTACT CENTER BOTS
Customer service IVR
Account inquiries
Bill payments
Service updates
Single Sign On
Users / Roles
Groups
Auditing / Monitoring
Risk & Compliancy
Insights
SECURITY
INFORMATIONAL BOTS
Answer questions
News updates
Weather information
Game scores
APPLICATION BOTS
Conversational interfaces
Book tickets
Order food
Manage bank accounts
Single Sign On
Users / Roles
Groups
Auditing / Monitoring
Risk & Compliancy
Insights
SECURITY
PRODUCTIVITY BOTS
Enterprise efficiencies
Check sales numbers
Inventory status
Expense reports
IoT BOTS
Device interactions
Kiosks
Appliances
Auto
A service for building conversational interfaces into your applications using voice and text
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Lex Use Case: Digital Assistant to Book a Hotel
Book hotel
NYC
“Book a hotel in
NYC”
Automatic speech
recognition
Hotel booking
New York City
Natural language
understanding
Intent/slot
model
UtterancesHotel booking
City New York City
Check in November 30
Check out December 2
“Your hotel is booked for
November 30.”
Amazon Polly
Confirmation: “Your hotel is
booked for November 30.”
“Can I go ahead
with the booking?”
a
in
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Put AI to work for your business
Modernize your contact center to improve customer service
conversational chat bots | call transcription | intelligent routing | sentiment analysis | VoC analytics
text-to speech | multilingual omni-channel communication
P O L L Y T R A N S C R I B E T R A N S L A T E C O M P R E H E N D L E X
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Amazon Connect
Real time and
historical analytics
Skills-based routing
[Automatic Call Distribution (ACD)]
Call
recording
High-quality
voice capability
Easy to use, cloud-based contact center solution
that scales to support businesses of any size
With tools that grow with your needs
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Improving Contact Centers With Artificial
Intelligence
A m a z o n
L e x
A m a z o n
T r a n s c r i b e
A m a z o n
C o m p r e h e n d
T R A N S C R I P T
A m a z o n
C o n n e c t
Analytics
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Improving Contact Centers With Artificial
Intelligence
A m a z o n
L e x
A m a z o n
T r a n s c r i b e
A m a z o n
C o m p r e h e n d
T R A N S C R I P T
A m a z o n
C o n n e c t
Analytics
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Connect Demo
+49 800 5052761
+49 800 5052761
Next session for Boaz Ziniman is…
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Babel fish
"The Babel fish is small, yellow, leech-like - and probably the oddest thing in the universe. It
feeds on brain wave energy, absorbing all unconscious frequencies and then excreting
telepathically a matrix formed from the conscious frequencies and nerve signals picked up from
the speech centres of the brain, the practical upshot of which is that if you stick one in your ear,
you can instantly understand anything said to you in any form of language: the speech you hear
decodes the brain wave matrix."
The Hitchhiker's Guide to the Galaxy, Douglas Adams
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Build your own Babel fish
Person Speaks in
English
Transcribe voice to text in
English
Translate text to
German
Speak in
German
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
chat.boaz.cloud
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Babel Chat
chat.boaz.cloud
{
"room":"/",
"message":{
"user":"Boaz",
"text":"I'm doing great",
"timestamp":1540978160578
},
"clientId":"07958771059731382"
}
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Put AI to work for your business
Reduce localization costs & improve accuracy
custom vocabulary | timestamp generation | secure real-time translation | language identification
P O L L Y T R A N S C R I B E T R A N S L A T E C O M P R E H E N D
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Put AI to work for your business
Understand the voice of your customer
sentiment analysis | app localization | translation services | transcription services | cataloging media | accessibility
T R A N S C R I B E T R A N S L A T E C O M P R E H E N DR E K O G N I T I O N
I M A G E
R E K O G N I T I O N
V I D E O
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
M L F R A M E W O R K S &
I N F R A S T R U C T U R E
A I S E R V I C E S
R E K O G N I T I O N
I M A G E
P O L L Y T R A N S C R I B E T R A N S L A T E C O M P R E H E N D
& C O M P R E H E N D
M E D I C A L
L E XR E K O G N I T I O N
V I D E O
Vision Speech Chatbots
A M A Z O N
S A G E M A K E R
B U I L D T R A I N
F O R E C A S TT E X T R A C T P E R S O N A L I Z E
D E P L O Y
Pre-built algorithms & notebooks
Data labeling (G R O U N D T R U T H )
One-click model training & tuning
Optimization (N E O )
One-click deployment & hosting
M L S E R V I C E S
F r a m e w o r k s I n t e r f a c e s I n f r a s t r u c t u r e
E C 2 P 3
& P 3 d n
E C 2 C 5 F P G A s G R E E N G R A S S E L A S T I C
I N F E R E N C E
Reinforcement learningAlgorithms & models ( A W S M A R K E T P L A C E
F O R M A C H I N E L E A R N I N G )
Language Forecasting Recommendations
The Amazon ML Stack: Broadest & Deepest Set of
Capabilities
Thank you!
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Boaz Ziniman
Technical Evangelist - Amazon Web Service
@ziniman ziniman

More Related Content

PDF
AIM301 - Breaking Language Barriers With AI - Tel Aviv Summit 2019
PDF
Breaking Voice and Language Barriers with AI - Chatbot Summit Tel Aviv
PDF
Breaking language barriers with AI | AWS Summit Tel Aviv 2019
PDF
Introduction to AI/ML with AWS
PDF
Breaking Language Barriers with AI - Web Summit 2018
PDF
Artificial Intelligence for Developers - OOP Munich
PPTX
Ai Services on AWS
PDF
The future of AI - IDC Herzelia
AIM301 - Breaking Language Barriers With AI - Tel Aviv Summit 2019
Breaking Voice and Language Barriers with AI - Chatbot Summit Tel Aviv
Breaking language barriers with AI | AWS Summit Tel Aviv 2019
Introduction to AI/ML with AWS
Breaking Language Barriers with AI - Web Summit 2018
Artificial Intelligence for Developers - OOP Munich
Ai Services on AWS
The future of AI - IDC Herzelia

More from Boaz Ziniman (20)

PDF
AWS Cost Optimization - JLM
PDF
What can you do with Serverless in 2020
PDF
Six ways to reduce your AWS bill
PDF
From Cloud to Edge & back again
PDF
Modern Applications Development on AWS
PDF
Enriching your app with Image recognition and AWS AI services Hebrew Webinar
PDF
AI Services and Serverless Workshop
PDF
Drive Down the Cost of your Data Lake by Using the Right Data Tiering
PDF
Serverless Beyond Functions - CTO Club Made in JLM
PDF
Websites Go Serverless - ServerlessDays TLV 2019
PDF
SKL208 - Turbocharge your Business with AI and Machine Learning - Tel Aviv Su...
PDF
Websites go Serverless - AWS Summit Berlin
PDF
AWS Lambda updates from re:Invent
PDF
Introduction to Serverless Computing - OOP Munich
PDF
IoT from Cloud to Edge & Back Again - WebSummit 2018
PDF
How Websites go Serverless - WebSummit Lisbon 2018
PDF
Introduction to Serverless computing and AWS Lambda - Floor28
PDF
AI Services for Developers - Floor28
PDF
Building Alexa Skills - Floor28
PDF
Websites go Serverless - Floor28
AWS Cost Optimization - JLM
What can you do with Serverless in 2020
Six ways to reduce your AWS bill
From Cloud to Edge & back again
Modern Applications Development on AWS
Enriching your app with Image recognition and AWS AI services Hebrew Webinar
AI Services and Serverless Workshop
Drive Down the Cost of your Data Lake by Using the Right Data Tiering
Serverless Beyond Functions - CTO Club Made in JLM
Websites Go Serverless - ServerlessDays TLV 2019
SKL208 - Turbocharge your Business with AI and Machine Learning - Tel Aviv Su...
Websites go Serverless - AWS Summit Berlin
AWS Lambda updates from re:Invent
Introduction to Serverless Computing - OOP Munich
IoT from Cloud to Edge & Back Again - WebSummit 2018
How Websites go Serverless - WebSummit Lisbon 2018
Introduction to Serverless computing and AWS Lambda - Floor28
AI Services for Developers - Floor28
Building Alexa Skills - Floor28
Websites go Serverless - Floor28
Ad

Recently uploaded (20)

PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PPTX
Cloud computing and distributed systems.
PDF
GamePlan Trading System Review: Professional Trader's Honest Take
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Advanced IT Governance
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
GDG Cloud Iasi [PUBLIC] Florian Blaga - Unveiling the Evolution of Cybersecur...
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PPTX
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Modernizing your data center with Dell and AMD
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPTX
Big Data Technologies - Introduction.pptx
Mobile App Security Testing_ A Comprehensive Guide.pdf
Cloud computing and distributed systems.
GamePlan Trading System Review: Professional Trader's Honest Take
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
“AI and Expert System Decision Support & Business Intelligence Systems”
Per capita expenditure prediction using model stacking based on satellite ima...
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Advanced IT Governance
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
NewMind AI Monthly Chronicles - July 2025
GDG Cloud Iasi [PUBLIC] Florian Blaga - Unveiling the Evolution of Cybersecur...
Advanced methodologies resolving dimensionality complications for autism neur...
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
CIFDAQ's Market Insight: SEC Turns Pro Crypto
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Modernizing your data center with Dell and AMD
NewMind AI Weekly Chronicles - August'25 Week I
Big Data Technologies - Introduction.pptx
Ad

Breaking Language Barriers with AI - AWS Summit

  • 1. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Breaking Language Barriers with AI Boaz Ziniman Technical Evangelist - Amazon Web Service @ziniman ziniman
  • 2. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Natural language processing (NLP) • Automatic Speech Recognition (ASR) • Natural Language Understanding (NLU) • Text to Speech • Translation 1970 1980 1990 2000 2010 2020 HUMAN ACCURACY 50% 55% 60% 62% 70% 95% Source: MindMeld
  • 5. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Common Language Use Cases Information Bots Education Accessibility Knowledge Management Voice of Customer Applications Customer Service/ Call Centers Enterprise Digital Assistant Semantic Search Captioning Workflows LocalizationPersonalization
  • 6. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T M L F R A M E W O R K S & I N F R A S T R U C T U R E A I S E R V I C E S R E K O G N I T I O N I M A G E P O L L Y T R A N S C R I B E T R A N S L A T E C O M P R E H E N D & C O M P R E H E N D M E D I C A L L E XR E K O G N I T I O N V I D E O Vision Speech Chatbots A M A Z O N S A G E M A K E R B U I L D T R A I N F O R E C A S TT E X T R A C T P E R S O N A L I Z E D E P L O Y Pre-built algorithms & notebooks Data labeling (G R O U N D T R U T H ) One-click model training & tuning Optimization (N E O ) One-click deployment & hosting M L S E R V I C E S F r a m e w o r k s I n t e r f a c e s I n f r a s t r u c t u r e E C 2 P 3 & P 3 d n E C 2 C 5 F P G A s G R E E N G R A S S E L A S T I C I N F E R E N C E Reinforcement learningAlgorithms & models ( A W S M A R K E T P L A C E F O R M A C H I N E L E A R N I N G ) Language Forecasting Recommendations The Amazon ML Stack: Broadest & Deepest Set of Capabilities
  • 7. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T AI Services Pre-trained AI services that require no ML skills or training Easily add intelligence to your existing apps and workflows Quality and accuracy from continuously-learning APIs A I S E R V I C E S R E K O G N I T I O N I M A G E P O L L Y T R A N S C R I B E T R A N S L A T E C O M P R E H E N D & C O M P R E H E N D M E D I C A L L E XR E K O G N I T I O N V I D E O Vision Speech Chatbots F O R E C A S TT E X T R A C T P E R S O N A L I Z E Language Forecasting Recommendations
  • 8. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Turn text into lifelike speech using deep learning
  • 9. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Amazon Polly – Use Cases Contact Centers Special Needs AI Assistant Voiced videos and presentations Language learning Amazon Polly Navigation Podcasting, Voiced blogs and news articles
  • 10. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T “Today in Seattle, WA it’s 11°F” Amazon Polly: Text In, Life-like Speech Out 54 voices across 27 languages “Today in Seattle Washington it’s 11 degrees Fahrenheit”
  • 11. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Amazon Polly: Text In, Life-like Speech Out “Today in Mumbai, India it’s 32°C” “Today in Mumbai, India it’s 32 degrees Celcius” 54 voices across 27 languages
  • 12. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Amazon Polly: Text In, Life-like Speech Out “Das ist mein 1. Mal in der AWS Summit Berlin.” “Das ist mein erstes Mal in der AWS Summit Berlin.” 54 voices across 27 languages
  • 13. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T A Focus On Voice Quality & Pronunciation 1. Automatic, Accurate Text Processing “Today in Seattle, WA, it’s 11°F”
  • 14. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T A Focus On Voice Quality & Pronunciation 1. Automatic, Accurate Text Processing 2. Intelligible and Easy to Understand
  • 15. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T A Focus On Voice Quality & Pronunciation 1. Automatic, Accurate Text Processing 2. Intelligible and Easy to Understand 3. Add Semantic Meaning to Text
  • 16. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. A Focus On Voice Quality & Pronunciation https://www.w3.org/TR/speech-synthesis/ <speak> The spelling of my name is <prosody rate='x-slow'> <say-as interpret-as="characters">Boaz</say-as> </prosody> </speak>
  • 17. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T A Focus On Voice Quality & Pronunciation 1. Automatic, Accurate Text Processing 2. Intelligible and Easy to Understand 3. Add Semantic Meaning to Text “Richard’s number is 2122341237“ <say-as interpret-as="telephone">
  • 18. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Voice Modification: Vocal Tract Length <speak> This is Brian without any voice modifications. <amazon:effect vocal-tract-length="+15%"> Imagine now that I got bigger… </amazon:effect> <amazon:effect vocal-tract-length="+25%"> Suppose that I got even bigger still… </amazon:effect> Now let's go back and hear the effect when I go in the opposite direction. <amazon:effect vocal-tract-length="-15%"> Can you tell that I'm getting smaller? </amazon:effect> <amazon:effect vocal-tract-length="-25%"> Now I'm even smaller than before. </amazon:effect> </speak>
  • 19. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
  • 20. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
  • 22. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Polly API example aws polly synthesize-speech --output-format mp3 --voice-id Matthew --text-type ssml --text '<speak> <amazon:auto-breaths> <prosody rate="x-slow" pitch="low">Here is my little secret.</prosody> <amazon:breath duration="long" volume="x-loud"/> <amazon:effect name="whispered"> <prosody rate="x-slow"> <prosody pitch="x-low">I</prosody> killed Mufasa! </prosody> </amazon:effect> </amazon:auto-breaths> </speak>' mufasa.mp3
  • 23. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Automatic speech recognition
  • 24. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Automatic speech recognition service “Hello, this is Allan speaking”
  • 25. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Amazon Transcribe – Key Features Channel Identification Custom vocabulary Speaker Identification Word-level time stamps Punctuation and capitalization Word-level confidence scores
  • 26. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T ringDNA RingDNA is an end-to-end communications platform for sales teams. Hundreds of enterprise organizations use RingDNA to increase productivity, engage in smarter sales conversations, gain predictive sales insights and improve their win rate. Speech to Text "A critical component of RingDNA’s Conversation AI requires best of breed speech-to-text to deliver transcriptions of every phone call. RingDNA is excited about Amazon Transcribe since it provides high-quality speech recognition at scale, helping us to better transcribe every call to text" Howard Brown, CEO & Founder, RingDNA https://www.youtube.com/watch?v=1ZJ_f1bDdog
  • 27. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Natural and accurate language translation
  • 28. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T 21 Languages 417 Combinations Key Features Real-time < 500ms / sentence on average < 150ms / conversational / short form Tag Handling XML tags placement maintains styling and formatting through translation < / > Data Security Data ownership Encryption Access Management Ease of Use Simple API calls and partner solutions $15/1M characters Or $0.000075 per word; Pay as you go, 2M characters monthly free tier HIPAA Eligible
  • 29. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Amazon Translate Natural and fluent language translation “Hello, what’s up? Do you want to go see a movie tonight?” " Hallo, was gibt's? Willst du heute Abend einen Film sehen? "
  • 30. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Translate API example boazz: ~$ aws translate translate-text --text "Hello, what’s up? Do you want to go see a movie tonight?" --source-language-code auto --target-language-code de { "TargetLanguageCode": "de", "TranslatedText": " Hallo, was gibt's? Willst du heute Abend einen Film sehen?", "SourceLanguageCode": "en" }
  • 31. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Translate API example import boto3 translate = boto3.client("translate") lang_flag_pairs = [("fr", "!"), ("de", """), ("es", "#"), ("pt", "$"),("zh", "%"), ("ja", "&"), ("ru", "'"),("it", "("), ("zh-TW", ")"), ("tr", "*"), ("cs", "+"), (”he", ",")] for lang, flag in lang_flag_pairs: print(flag) print(translate.translate_text( Text="Hello, World", SourceLanguageCode="en", TargetLanguageCode=lang )['TranslatedText'])
  • 32. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Translate API example ! Bonjour, Monde " Hallo, Welt # Hola, Mundo $ Olá, Mundo % & ' Привет, Мир ( Ciao, Mondo ) , * Merhaba, Dünya. + Ahoj, světe. , ‫ש‬‫ל‬‫ו‬‫ם‬,‫ע‬‫ו‬‫ל‬‫ם‬. https://github.com/ziniman/aws-translate-demo
  • 33. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Scaling real-time translation Using Amazon Translate, Lionbridge is able to scale machine translation in order to localize content faster and in more languages. Using Translate, Lionbridge was able to reduce translation costs by 20 percent.
  • 34. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. https://github.com/ziniman/aws-translate-demo
  • 35. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Discover insights and relationships in text
  • 36. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Amazon Comprehend Di s c o v e r i n s i g h t s a n d r e l a t i o n s h i p s i n t e x t Entities Key Phrases Language Sentiment Syntax Grouping
  • 37. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Accurately extract health information from patient notes, clinical trial reports, and other electronic health records using Amazon Comprehend
  • 38. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Amazon Comprehend Medical Entities Medication Medical condition Test, treatments, and procedures anatomy Protected Health Information (PHI) Relationship extraction Medication Test, treatments, and procedures Entity traits Negation Diagnosis signs and symptom
  • 39. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Run Amazon Comprehend on S3 Bucket import boto3 import json s3 = boto3.resource('s3’) bucket_name = ‘my_bucket’ region_name = ‘us-east-1’ bucket = s3.Bucket(bucket_name) comprehend = boto3.client(service_name='comprehend', region_name=region) for obj in bucket.objects.all(): body = obj.get()['Body'].read() text = body sentiment_response = comprehend.detect_sentiment(Text=text, LanguageCode='en’) print(json.dumps(sentiment_response, sort_keys=True, indent=4))
  • 40. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Conversational interfaces for your applications powered by the same deep learning technologies as Alexa
  • 41. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Advent of conversational interactions 2nd gen: Pointers & sliders 3nd gen: Conversational interfaces 1st gen: Punch cards & memory registers
  • 42. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Amazon Lex – use cases CONTACT CENTER BOTS Customer service IVR Account inquiries Bill payments Service updates Single Sign On Users / Roles Groups Auditing / Monitoring Risk & Compliancy Insights SECURITY INFORMATIONAL BOTS Answer questions News updates Weather information Game scores APPLICATION BOTS Conversational interfaces Book tickets Order food Manage bank accounts Single Sign On Users / Roles Groups Auditing / Monitoring Risk & Compliancy Insights SECURITY PRODUCTIVITY BOTS Enterprise efficiencies Check sales numbers Inventory status Expense reports IoT BOTS Device interactions Kiosks Appliances Auto A service for building conversational interfaces into your applications using voice and text
  • 43. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Lex Use Case: Digital Assistant to Book a Hotel Book hotel NYC “Book a hotel in NYC” Automatic speech recognition Hotel booking New York City Natural language understanding Intent/slot model UtterancesHotel booking City New York City Check in November 30 Check out December 2 “Your hotel is booked for November 30.” Amazon Polly Confirmation: “Your hotel is booked for November 30.” “Can I go ahead with the booking?” a in
  • 44. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Put AI to work for your business Modernize your contact center to improve customer service conversational chat bots | call transcription | intelligent routing | sentiment analysis | VoC analytics text-to speech | multilingual omni-channel communication P O L L Y T R A N S C R I B E T R A N S L A T E C O M P R E H E N D L E X
  • 45. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Amazon Connect Real time and historical analytics Skills-based routing [Automatic Call Distribution (ACD)] Call recording High-quality voice capability Easy to use, cloud-based contact center solution that scales to support businesses of any size With tools that grow with your needs
  • 46. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Improving Contact Centers With Artificial Intelligence A m a z o n L e x A m a z o n T r a n s c r i b e A m a z o n C o m p r e h e n d T R A N S C R I P T A m a z o n C o n n e c t Analytics
  • 47. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Improving Contact Centers With Artificial Intelligence A m a z o n L e x A m a z o n T r a n s c r i b e A m a z o n C o m p r e h e n d T R A N S C R I P T A m a z o n C o n n e c t Analytics
  • 48. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Connect Demo +49 800 5052761 +49 800 5052761 Next session for Boaz Ziniman is…
  • 49. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Babel fish "The Babel fish is small, yellow, leech-like - and probably the oddest thing in the universe. It feeds on brain wave energy, absorbing all unconscious frequencies and then excreting telepathically a matrix formed from the conscious frequencies and nerve signals picked up from the speech centres of the brain, the practical upshot of which is that if you stick one in your ear, you can instantly understand anything said to you in any form of language: the speech you hear decodes the brain wave matrix." The Hitchhiker's Guide to the Galaxy, Douglas Adams
  • 50. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Build your own Babel fish Person Speaks in English Transcribe voice to text in English Translate text to German Speak in German
  • 51. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. chat.boaz.cloud
  • 52. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Babel Chat chat.boaz.cloud { "room":"/", "message":{ "user":"Boaz", "text":"I'm doing great", "timestamp":1540978160578 }, "clientId":"07958771059731382" }
  • 53. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Put AI to work for your business Reduce localization costs & improve accuracy custom vocabulary | timestamp generation | secure real-time translation | language identification P O L L Y T R A N S C R I B E T R A N S L A T E C O M P R E H E N D
  • 54. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Put AI to work for your business Understand the voice of your customer sentiment analysis | app localization | translation services | transcription services | cataloging media | accessibility T R A N S C R I B E T R A N S L A T E C O M P R E H E N DR E K O G N I T I O N I M A G E R E K O G N I T I O N V I D E O
  • 55. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T M L F R A M E W O R K S & I N F R A S T R U C T U R E A I S E R V I C E S R E K O G N I T I O N I M A G E P O L L Y T R A N S C R I B E T R A N S L A T E C O M P R E H E N D & C O M P R E H E N D M E D I C A L L E XR E K O G N I T I O N V I D E O Vision Speech Chatbots A M A Z O N S A G E M A K E R B U I L D T R A I N F O R E C A S TT E X T R A C T P E R S O N A L I Z E D E P L O Y Pre-built algorithms & notebooks Data labeling (G R O U N D T R U T H ) One-click model training & tuning Optimization (N E O ) One-click deployment & hosting M L S E R V I C E S F r a m e w o r k s I n t e r f a c e s I n f r a s t r u c t u r e E C 2 P 3 & P 3 d n E C 2 C 5 F P G A s G R E E N G R A S S E L A S T I C I N F E R E N C E Reinforcement learningAlgorithms & models ( A W S M A R K E T P L A C E F O R M A C H I N E L E A R N I N G ) Language Forecasting Recommendations The Amazon ML Stack: Broadest & Deepest Set of Capabilities
  • 56. Thank you! S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Boaz Ziniman Technical Evangelist - Amazon Web Service @ziniman ziniman