SlideShare a Scribd company logo
How to build voice
interfaces professionally
Fabian Dill, DieProduktMacher
Grand Event about Chatbots and Voice Interfaces
June 23rd
Agenda
When to consider voice user interfaces (VUIs)
How to design great experiences for VUIs
How to develop VUIs
2
1
2
3
When to consider VUIs?
3
Conversational is the new normal
MESSAGING APPS HAVE SURPASSED
SOCIAL NETWORKS (Monthly active users for top 4 social networks and messaging apps)
Big 4 Social
Network Apps
Big 4 Messaging Apps
4
Why now? Because we can
0
5
10
15
20
25
30
2010 2011 2012 2013 2014 2015
Error rate
0
2
4
6
8
10
12
14
16
Error rate
1. Deep Learning
Algorithm
5
ERROR RATE IN IMAGE RECOGNITION
(ILSVRC RESULTS)
ERROR RATE IN SPEECH RECOGNITION
Two types of conversational interfaces are on the rise.
6
What kind of conversational interfaces?
TEXT | Chatbots VOICE | existing platforms custom build
7
• Siri (Apple)
• Google Assistent (Google)
• Google Home (Google)
• Cortana (Microsoft)
• Alexa (Amazon)
• Bragi
Voice interfaces are not restricted to Alexa skills
In addition to these voice interface
platforms custom build solutions
are possible:
• Matrix Labs
• Google Speech API
• Alexa Services / Amazon Lex
8
You can also built your custom voice interfaces
9
Half of the Germans already used a voice interface
10
Before Alexa VUIs were tied to Smartphones
11
As you can see in the typical applications
12
In the following I will focus on Alexa skills
OPERATORS &
EXECUTORS
• Commerce
(fashion, flowers,
food delivery)
• Booking
(Tickets, Travel, …)
• Banking
DOMAIN
KNOWLEDGE EXPERTS
• Customer Support
• Advice and
Recommendations
• Domain-specific
knowledge:
• Sports
• News
• Travel
• Insurance
13
Three types of Conversational UIs (Chatbots)
PERSONAL
ASSISTANT
• Google Assistant
• Facebook M
• Apple Siri
• Microsoft Cortana
• Amazon Alexa
14
(SHORT)
INFORMATION
Types of Voice Interfaces
TV program, weather,
news, knowledge,
status information
ENTERTAINMENT
Jokes, Games,
…
ADVICE AND
RECOMMENDATION
Fashion,
Travel, ...
TRANSACTIONAL
Shopping,
Booking
CUSTOMER CARE?
15
Smart Home
Physical Device
Game
Complex Interaction
Handsfree environment (e.g. car, climbing, assembly, …)
Your user experience can significantly improve
When to consider Voice Interfaces?
16
How to design great
experiences for VUIs
17
PRODUCT
ATTRIBUTES
• Goal
• Target Group
• Core Value
PERSONALITY
• Age and gender
• Adjective
• Name
EXAMPLE
DIALOG
• Text
• Talk
• Record
Our current best practices
18
ASK
TELL
REMEMBER
… keeps the dialog going, listens to further user input
… use it for questions, reprompts, error messages
Endpoint of the dialog, shuts down the session
Use it for results, stop and goodbye
Creating a DynamoDB storage is just one line of code!
Create dialogs
19
How to develop VUIs
20
The usual Alexa skill environment 
21
• Use your favorite IDE (code completions, shortcuts, etc.)
• Realise parallel development (>1 developer)
• Extend live product with new features
• Be able to rollback (in the unlikely case someone messed it up)
Why professional development?
22
Serverless to the rescue
23
Deploying with Serverless
Our road to a professional development
enviroment for VUIs (Alexa)
AUTOM.
TESTING
How to create
unit
and integration
tests
MULTI-
LANGUAGE
SUPPORT
Separate
language from
functionality
STAGING
Create stages
for interaction
model and
backend
lambdas
LOCAL
DEVELOPMENT
Set up a local
development
environment
25
• Develop a skill in several languages (English, German)
• Separate utterances from functionality
• Require the correct language file at runtime
Multi-language Support Preparation
26
Multi-language Support
Input / Output
language based on
user settings in Alexa
App
locale: 'de-DE'
handler.js
i18n/de-DE.js
i18n/en-US.js
i18n/en-UK.js
locale: 'de-DE'
// require specific language file
const languages =
require(`./locales/${lang}.js`)
// get text message according to locale
const getText = (alexa, message,
attributes, index) => {
return Language.getText(
alexa.event.request.locale,
message,
attributes,
index)
}
ALEXA
AWS
lambda
27
Separate language from functionality (DEMO)
SERVERLESS
INDIVIDUAL DEVELOPMENT INTEGRATION PRODUCTION
INTERACTION
MODEL / SKILL
BACKEND /
LAMBDA
GIT REPO /
GITFLOW
static arn
28
Parallel Development,
Staging and releases with Alexa
manual copy manual copy manual copy
FEATURE
DEVELOP
MASTER
RELEASE
29
Copying interaction models =
manual 5-step process 
30
• Error prone  developer has to be on the correct branch
• Next step: automate or use continuous integration system
31
We use Serverless to connect correct branch with
referring stage
• Install as a serverless plugin
• Start a local server
• Share localhost with the internet
• Configure Alexa skill to use your HTTPS
endpoint
32
We are working on a serverless plugin to allow for
local development
33
Develop and test locally until commit
• Several use cases applicable for voice interfaces
• Design dialogs by prototyping them extensively
• Alexa infrastructure is not yet ready for a professional setup
• There are several ways to optimize your development process (we shared one of them)
• We are on our journey to continuously improve this process
• We are looking for input, comments, other experiences and new team members 
34
Conclusion
Contact:
35
Fabian Dill
CEO
Email: fabian.dill@produktmacher.com
Telefon: 089 / 189 46 54

More Related Content

PPTX
20140207 xamarin-red fabriq-microsoft-techdays-nativemobileappdevelopmentwith...
PDF
Eindhoven Mobile Development First Meetup Slides
PPTX
Introduction to BOT Framework- Global Azure Bootcamp 2017
PDF
FirstMeetupSlides
PDF
Solvion Trendwerkstatt - Microsoft Azure + Bots
PDF
Building a New Generation of Mobile Games with Speech
PPTX
X-Platform native apps in C# and .NET using Xamarin tools (iOS/WP/Android)
PPTX
Where Design and Development Meet: Cross Platform Mobile Experiences
20140207 xamarin-red fabriq-microsoft-techdays-nativemobileappdevelopmentwith...
Eindhoven Mobile Development First Meetup Slides
Introduction to BOT Framework- Global Azure Bootcamp 2017
FirstMeetupSlides
Solvion Trendwerkstatt - Microsoft Azure + Bots
Building a New Generation of Mobile Games with Speech
X-Platform native apps in C# and .NET using Xamarin tools (iOS/WP/Android)
Where Design and Development Meet: Cross Platform Mobile Experiences

Similar to How to build voice interfaces in a professional way (20)

PDF
Build Alexa Skills for Hindi - Workshop
PPSX
Presentation final
PDF
Как да станем софтуерни инженери и да стартираме ИТ бизнес?
PDF
Flutter vs. MAUI - Techorama BE 2022
PDF
Flutter vs. MAUI - what should you pick and why?
PPT
Multi-platform Mobile apps with Phonegap
PDF
OpenNTF Updates 03/05/13
PDF
White paper native, web or hybrid mobile app development
PPTX
Маргарита Остапчук (Microsoft Украина) «Разработка на универсальной платформе...
PPTX
Абрамович Максим, "Rad studio xe4"
PDF
The Great Mobile Debate: Native vs. Hybrid App Development
PDF
Top 10 Cross-Platform Frameworks of 2025.pdf
PDF
Xamarin the good, the bad and the ugly
PPTX
AI and Python: Developing a Conversational Interface using Python
PPTX
AppStudio for ArcGIS: The Basics - Esri FedGIS 2016
PPTX
Guest Lecture _ Python Basics _ Alexa Skill Dev _ by Shivam Dutt Sharma
PDF
Info Session GDSC USICT
PPTX
Stronger than Chuck Norris: SharePoint in clouds, on earth, in subway and in ...
PPTX
Mobile App Landscape for the Non-Technical
PDF
Embarcadero's Connected Development
Build Alexa Skills for Hindi - Workshop
Presentation final
Как да станем софтуерни инженери и да стартираме ИТ бизнес?
Flutter vs. MAUI - Techorama BE 2022
Flutter vs. MAUI - what should you pick and why?
Multi-platform Mobile apps with Phonegap
OpenNTF Updates 03/05/13
White paper native, web or hybrid mobile app development
Маргарита Остапчук (Microsoft Украина) «Разработка на универсальной платформе...
Абрамович Максим, "Rad studio xe4"
The Great Mobile Debate: Native vs. Hybrid App Development
Top 10 Cross-Platform Frameworks of 2025.pdf
Xamarin the good, the bad and the ugly
AI and Python: Developing a Conversational Interface using Python
AppStudio for ArcGIS: The Basics - Esri FedGIS 2016
Guest Lecture _ Python Basics _ Alexa Skill Dev _ by Shivam Dutt Sharma
Info Session GDSC USICT
Stronger than Chuck Norris: SharePoint in clouds, on earth, in subway and in ...
Mobile App Landscape for the Non-Technical
Embarcadero's Connected Development
Ad

Recently uploaded (20)

PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
MYSQL Presentation for SQL database connectivity
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Machine learning based COVID-19 study performance prediction
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Approach and Philosophy of On baking technology
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Unlocking AI with Model Context Protocol (MCP)
Programs and apps: productivity, graphics, security and other tools
Spectral efficient network and resource selection model in 5G networks
A comparative analysis of optical character recognition models for extracting...
Mobile App Security Testing_ A Comprehensive Guide.pdf
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
20250228 LYD VKU AI Blended-Learning.pptx
Network Security Unit 5.pdf for BCA BBA.
MYSQL Presentation for SQL database connectivity
MIND Revenue Release Quarter 2 2025 Press Release
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Reach Out and Touch Someone: Haptics and Empathic Computing
Assigned Numbers - 2025 - Bluetooth® Document
Building Integrated photovoltaic BIPV_UPV.pdf
The Rise and Fall of 3GPP – Time for a Sabbatical?
Machine learning based COVID-19 study performance prediction
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Approach and Philosophy of On baking technology
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Unlocking AI with Model Context Protocol (MCP)
Ad

How to build voice interfaces in a professional way

  • 1. How to build voice interfaces professionally Fabian Dill, DieProduktMacher Grand Event about Chatbots and Voice Interfaces June 23rd
  • 2. Agenda When to consider voice user interfaces (VUIs) How to design great experiences for VUIs How to develop VUIs 2 1 2 3
  • 4. Conversational is the new normal MESSAGING APPS HAVE SURPASSED SOCIAL NETWORKS (Monthly active users for top 4 social networks and messaging apps) Big 4 Social Network Apps Big 4 Messaging Apps 4
  • 5. Why now? Because we can 0 5 10 15 20 25 30 2010 2011 2012 2013 2014 2015 Error rate 0 2 4 6 8 10 12 14 16 Error rate 1. Deep Learning Algorithm 5 ERROR RATE IN IMAGE RECOGNITION (ILSVRC RESULTS) ERROR RATE IN SPEECH RECOGNITION
  • 6. Two types of conversational interfaces are on the rise. 6 What kind of conversational interfaces? TEXT | Chatbots VOICE | existing platforms custom build
  • 7. 7 • Siri (Apple) • Google Assistent (Google) • Google Home (Google) • Cortana (Microsoft) • Alexa (Amazon) • Bragi Voice interfaces are not restricted to Alexa skills
  • 8. In addition to these voice interface platforms custom build solutions are possible: • Matrix Labs • Google Speech API • Alexa Services / Amazon Lex 8 You can also built your custom voice interfaces
  • 9. 9 Half of the Germans already used a voice interface
  • 10. 10 Before Alexa VUIs were tied to Smartphones
  • 11. 11 As you can see in the typical applications
  • 12. 12 In the following I will focus on Alexa skills
  • 13. OPERATORS & EXECUTORS • Commerce (fashion, flowers, food delivery) • Booking (Tickets, Travel, …) • Banking DOMAIN KNOWLEDGE EXPERTS • Customer Support • Advice and Recommendations • Domain-specific knowledge: • Sports • News • Travel • Insurance 13 Three types of Conversational UIs (Chatbots) PERSONAL ASSISTANT • Google Assistant • Facebook M • Apple Siri • Microsoft Cortana • Amazon Alexa
  • 14. 14 (SHORT) INFORMATION Types of Voice Interfaces TV program, weather, news, knowledge, status information ENTERTAINMENT Jokes, Games, … ADVICE AND RECOMMENDATION Fashion, Travel, ... TRANSACTIONAL Shopping, Booking CUSTOMER CARE?
  • 15. 15 Smart Home Physical Device Game Complex Interaction Handsfree environment (e.g. car, climbing, assembly, …) Your user experience can significantly improve When to consider Voice Interfaces?
  • 16. 16 How to design great experiences for VUIs
  • 17. 17 PRODUCT ATTRIBUTES • Goal • Target Group • Core Value PERSONALITY • Age and gender • Adjective • Name EXAMPLE DIALOG • Text • Talk • Record Our current best practices
  • 18. 18 ASK TELL REMEMBER … keeps the dialog going, listens to further user input … use it for questions, reprompts, error messages Endpoint of the dialog, shuts down the session Use it for results, stop and goodbye Creating a DynamoDB storage is just one line of code! Create dialogs
  • 20. 20 The usual Alexa skill environment 
  • 21. 21 • Use your favorite IDE (code completions, shortcuts, etc.) • Realise parallel development (>1 developer) • Extend live product with new features • Be able to rollback (in the unlikely case someone messed it up) Why professional development?
  • 24. Our road to a professional development enviroment for VUIs (Alexa) AUTOM. TESTING How to create unit and integration tests MULTI- LANGUAGE SUPPORT Separate language from functionality STAGING Create stages for interaction model and backend lambdas LOCAL DEVELOPMENT Set up a local development environment
  • 25. 25 • Develop a skill in several languages (English, German) • Separate utterances from functionality • Require the correct language file at runtime Multi-language Support Preparation
  • 26. 26 Multi-language Support Input / Output language based on user settings in Alexa App locale: 'de-DE' handler.js i18n/de-DE.js i18n/en-US.js i18n/en-UK.js locale: 'de-DE' // require specific language file const languages = require(`./locales/${lang}.js`) // get text message according to locale const getText = (alexa, message, attributes, index) => { return Language.getText( alexa.event.request.locale, message, attributes, index) } ALEXA AWS lambda
  • 27. 27 Separate language from functionality (DEMO)
  • 28. SERVERLESS INDIVIDUAL DEVELOPMENT INTEGRATION PRODUCTION INTERACTION MODEL / SKILL BACKEND / LAMBDA GIT REPO / GITFLOW static arn 28 Parallel Development, Staging and releases with Alexa manual copy manual copy manual copy FEATURE DEVELOP MASTER RELEASE
  • 29. 29 Copying interaction models = manual 5-step process 
  • 30. 30
  • 31. • Error prone  developer has to be on the correct branch • Next step: automate or use continuous integration system 31 We use Serverless to connect correct branch with referring stage
  • 32. • Install as a serverless plugin • Start a local server • Share localhost with the internet • Configure Alexa skill to use your HTTPS endpoint 32 We are working on a serverless plugin to allow for local development
  • 33. 33 Develop and test locally until commit
  • 34. • Several use cases applicable for voice interfaces • Design dialogs by prototyping them extensively • Alexa infrastructure is not yet ready for a professional setup • There are several ways to optimize your development process (we shared one of them) • We are on our journey to continuously improve this process • We are looking for input, comments, other experiences and new team members  34 Conclusion