SlideShare a Scribd company logo
2
Most read
4
Most read
8
Most read
© 2019 Nagarro – All rights reserved 1© 2019 Nagarro – All rights reserved
Chatbot Testing:
Challenges and approach
for testing Chatbot
applications
Author: Rajni Singh & Neeraj Jain
© 2019 Nagarro – All rights reserved 2
Agenda
Context
Understanding Chatbots and real time examples of uncertainities
Challenges
Challenges in testing chatbot
Testing Approach
Scope, Approach & Tools and Use case
© 2019 Nagarro – All rights reserved 3© 2019 Nagarro – All rights reserved
Understanding Chatbots, the evolution
and its challenges
Context
01
© 2019 Nagarro – All rights reserved 4
How it works?
Chatbot
application
(Virtual assistant)
Cognitive
Services
Business
applications and
Services
http(s)
Integration
User input in
plain language:
• Text
• Speech
Channels allows user
to connect to
chatbot application
Chatbot is a “computer program” which
• Maintains a sensible and contextual conversation with
a human
• Operates on cognitive services such as NLP is used to
make it more intelligent.
• Processes business logic and prepare response to user
input
• Can take actions
© 2019 Nagarro – All rights reserved 5
Types of Chatbot
Chabot can be broadly categorized in two categories.
AI-BASED
RULE BASED
Built with artificial intelligence technique to
understand the human language together with
sentiments that does not rely on scripted
conversation. It accepts free form of input and
responds based on the existing domain knowledge
with self-learning improvement with maturity.
Chatbot behavior is based on a set of rules,
flows, and triggers to respond to very specific
commands being asked by the user. The
conversation is usually scripted and chatbot
responds to each question with a predefined
rule and each step is picked with an explicit
option. A simple example might be a chatbot
that tells you the stock price on a given date.
© 2019 Nagarro – All rights reserved 6
Uncertainties: Why chatbot needs to be tested?
Facebook Inc.’s chatbots hit a70% failure
rate and zero emotional intelligence
“Tay's responses have turned the bot
into a joke, but they raise serious
questions”
In 2016, users were finding it
impossible to unsubscribe WSJ
Ticketmaster Breach Traces to
Embedded Chatbot Software
© 2019 Nagarro – All rights reserved 7© 2019 Nagarro – All rights reserved
Challenges
02
What could go wrong in
Chatbot testing ?
© 2019 Nagarro – All rights reserved 8
03
0201
06
04
05
Dependency on
humungous amount of
data
There is no defined input /
output
Continuous learning from
past behavior
Difficult to predict all
scenarios
AI applications are non-
deterministic and
probabilistic
Continuous monitoring
Shift from traditional testing to AI testing
© 2019 Nagarro – All rights reserved 9
Chatbot testing challenges
Multichannel user experience
Domain specific validations
Uncertainty of user conversation
Lack of robust testing strategy
Chatbot intelligence not being questioned
Chatbot security being compromised
© 2019 Nagarro – All rights reserved 10© 2019 Nagarro – All rights reserved
Scope, parameters, and
testing tools
Approach
03
© 2019 Nagarro – All rights reserved 11
Chatbot testing scope
API
Regression
Specialized testing
Context remembrance
and switching
On Boarding &
Personality
Conversation flow
Domain Specific
Crowd
Performance &
Security
NLP & Cognitive
Services
EnhancedUserExperience
© 2019 Nagarro – All rights reserved 12
Parameters for testing
Navigation
Error Management
Speed Accuracy
Onboarding
Intelligence
PersonalitySecurity
© 2019 Nagarro – All rights reserved 13© 2019 Nagarro – All rights reserved
Meet Ginger
Our in-house customized
personal assistant
© 2019 Nagarro – All rights reserved 14
 Any question
about Nagarro
Ask Ginger:
Ginger
 Actionable insight
that is customized
to your role
Ginger bytes:
 Notification about
important event,
conferences,
announcement
Ginger says:
© 2019 Nagarro – All rights reserved 15
Ask Ginger
© 2019 Nagarro – All rights reserved 16© 2019 Nagarro – All rights reserved
Beyond traditional
testing for AI-infused
Chatbots
© 2019 Nagarro – All rights reserved 17
End-to-end automation testing framework
Cloud/On-Premise Solution
Conversational
Management
API
Admin
Linguistic Framework
Intent
Entity
Synonyms
Action Binding
NLP
Model
Request
Response
Context
Session
User
Configuration Security Training
Integration
LiveConnectors
EnterpriseServices
Training Set
Data import-export
© 2019 Nagarro – All rights reserved 18
Goal completion rate
Self-service rate
AI and ML rates
User retention rate
Fallback rate
Measure the performance of the bot
© 2019 Nagarro – All rights reserved 19
Data Model Manager
RTMBAF
Performance Monitoring
Performance Tuning
Performance MonitoringTraining
Re-Train
Measure Measure
Enterprise
Chatbot Model
Read Client Interaction
Watson
LUIS Dialog flow RASA
others
Client Using Chatbot
Client Questions
NLP-based model life-cycle of the Chatbot
© 2019 Nagarro – All rights reserved 20
Advanced security testing
1
2
3
4
End-to-end encryption
Two-factor authentication
User authentication
Intent authorization
5 Channel authentication
6 Compliance validation
7 Authentication timeout
8 Self-destructing messages
© 2019 Nagarro – All rights reserved 21
Domain-specific testing
This testing should start with identifying the
domain of the chatbot and then identify the
keywords of that domain.
A Chatbot built for IT should treat the word
“Selenium” as an automation tool while a chatbot
built for pharma industry should treat the
“Selenium” as chemical.
© 2019 Nagarro – All rights reserved 22
Recommendations
Monitor, measure
and update
(NLP, conversational
etc.)
Automate end to
end
(text, voice, etc.)
Non-functional
testing
Domain-specific
testing
© 2019 Nagarro – All rights reserved 23
HELPING YOU CREATE AN
INTELLIGENT ENTERPRISE
Thank You

More Related Content

PPTX
PPTX
Chatbot Technology
PDF
How do Chatbots Work? A Guide to Chatbot Architecture
PDF
BOTS TESTING BOTS: From manual to automated testing for conversational AI
PDF
ChatGPT - AI.pdf
PPTX
AI Agent and Chatbot Trends For Enterprises
PDF
ChatGPT Use- Cases
PPTX
ChatGPT.pptx
Chatbot Technology
How do Chatbots Work? A Guide to Chatbot Architecture
BOTS TESTING BOTS: From manual to automated testing for conversational AI
ChatGPT - AI.pdf
AI Agent and Chatbot Trends For Enterprises
ChatGPT Use- Cases
ChatGPT.pptx

What's hot (20)

PDF
Robotic Process Automation (RPA)
PPTX
How Chatbots Empower Healthcare Ecosystem?
PDF
Contextual chatbots and voicebot solutions for your business
PDF
ChatGPT ChatBot
PPTX
Artificial Intelligence: Predictions for 2017
DOC
Internship report writing_criteria_for_gmd
PPTX
Robotic Process Automation Webinar Slides
DOCX
Synopsis for student interaction portal
PDF
Mother of Language`s Langchain
PPT
Chat bots and AI
PDF
Generative Models and ChatGPT
PDF
Generative AI at the edge.pdf
PDF
Deep Learning - The Past, Present and Future of Artificial Intelligence
PPSX
Chatbot
PPTX
AI Based Personal Virtual Assistant
PDF
The Chatbots Are Coming: A Guide to Chatbots, AI and Conversational Interfaces
 
PPTX
HyperAutomation (3).pptx
PDF
OpenAI’s GPT 3 Language Model - guest Steve Omohundro
PPTX
Artificially Intelligent chatbot Implementation
PPTX
Google BARD v/s ChatGPT _ A review
Robotic Process Automation (RPA)
How Chatbots Empower Healthcare Ecosystem?
Contextual chatbots and voicebot solutions for your business
ChatGPT ChatBot
Artificial Intelligence: Predictions for 2017
Internship report writing_criteria_for_gmd
Robotic Process Automation Webinar Slides
Synopsis for student interaction portal
Mother of Language`s Langchain
Chat bots and AI
Generative Models and ChatGPT
Generative AI at the edge.pdf
Deep Learning - The Past, Present and Future of Artificial Intelligence
Chatbot
AI Based Personal Virtual Assistant
The Chatbots Are Coming: A Guide to Chatbots, AI and Conversational Interfaces
 
HyperAutomation (3).pptx
OpenAI’s GPT 3 Language Model - guest Steve Omohundro
Artificially Intelligent chatbot Implementation
Google BARD v/s ChatGPT _ A review
Ad

Similar to Chatbot testing (20)

PPTX
Test strategy for Conversational AI
PPTX
What is Chatbot Testing? How to Perform Chatbot Testing?
PDF
Test your bot_x_conf
PDF
Cocolevio AI Chatbot 9.20.19
PDF
The rise of Chatbots and Virtual Assistants in Customer Experience
PDF
AI Chatbot Development – A Comprehensive Guide  .pdf
PPTX
ATAGTR2017 ChatBot
PDF
Mutation testing for DSLs - The case of task-oriented chatbots
PDF
antraaa-181127090143.pdf
PPTX
Chatbot Abstract
PDF
HealthCare ChatBot Using Machine Learning
PDF
deloitte-nl-chatbots-moving-beyond-the-hype.pdf
PDF
deloitte-nl-chatbots-moving-beyond-the-hype.pdf
PPTX
ISTA 2017: Practical Chatbots - Technology Overview with Real-Life Stories
PDF
Enhancing The Capability of Chatbots
PDF
How to build an AI-powered chatbot.pdf
PDF
How to build an AI-powered chatbot.pdf
PDF
How to build an AI-powered chatbot.pdf
PDF
A Review on the Determinants of a suitable Chatbot Framework- Empirical evide...
PDF
IRJET- Interactive Interview Chatbot
Test strategy for Conversational AI
What is Chatbot Testing? How to Perform Chatbot Testing?
Test your bot_x_conf
Cocolevio AI Chatbot 9.20.19
The rise of Chatbots and Virtual Assistants in Customer Experience
AI Chatbot Development – A Comprehensive Guide  .pdf
ATAGTR2017 ChatBot
Mutation testing for DSLs - The case of task-oriented chatbots
antraaa-181127090143.pdf
Chatbot Abstract
HealthCare ChatBot Using Machine Learning
deloitte-nl-chatbots-moving-beyond-the-hype.pdf
deloitte-nl-chatbots-moving-beyond-the-hype.pdf
ISTA 2017: Practical Chatbots - Technology Overview with Real-Life Stories
Enhancing The Capability of Chatbots
How to build an AI-powered chatbot.pdf
How to build an AI-powered chatbot.pdf
How to build an AI-powered chatbot.pdf
A Review on the Determinants of a suitable Chatbot Framework- Empirical evide...
IRJET- Interactive Interview Chatbot
Ad

More from Nagarro (20)

PPTX
How Generative AI is shaping a sustainable future in Energy & Utilities
PDF
Testing the Migration of Monolithic Applications to Microservices on the Cloud
PDF
Intelligent automation beyond test execution
PDF
Flutter: An open-source UI software development kit
PDF
Remote Collaboration: Working Canvas
PDF
Remote Collaboration: Working and Leading from Home
PDF
10 Gründe, warum Ihre Testautomatisierung zum Scheitern verurteilt ist
PDF
Integrating AI in software quality in absence of a well-defined requirements
PDF
Intelligent Digital Mesh Testing
PDF
Software Quality without Testing
PPTX
Advanced Test Automation: Agile Model
PDF
Testing @ digital speed
PDF
How to get started? Digital Transformation: A Down-to-Earth Approach
PDF
Connecting the dots – Industrial IoT is more than just sensor deployment
PDF
A walk through the AI Use Cases in the Connected Enterprise
PDF
Cloud-enabled analytics
PDF
Why Cloud Computing is mandatory for Connected Enterprise
PDF
Testing Microservices
PDF
Are Your Mobile Apps Secure? (Part I)
PDF
Mobile Apps and Security Attacks: An Introduction
How Generative AI is shaping a sustainable future in Energy & Utilities
Testing the Migration of Monolithic Applications to Microservices on the Cloud
Intelligent automation beyond test execution
Flutter: An open-source UI software development kit
Remote Collaboration: Working Canvas
Remote Collaboration: Working and Leading from Home
10 Gründe, warum Ihre Testautomatisierung zum Scheitern verurteilt ist
Integrating AI in software quality in absence of a well-defined requirements
Intelligent Digital Mesh Testing
Software Quality without Testing
Advanced Test Automation: Agile Model
Testing @ digital speed
How to get started? Digital Transformation: A Down-to-Earth Approach
Connecting the dots – Industrial IoT is more than just sensor deployment
A walk through the AI Use Cases in the Connected Enterprise
Cloud-enabled analytics
Why Cloud Computing is mandatory for Connected Enterprise
Testing Microservices
Are Your Mobile Apps Secure? (Part I)
Mobile Apps and Security Attacks: An Introduction

Recently uploaded (20)

PDF
Architecture types and enterprise applications.pdf
PDF
A novel scalable deep ensemble learning framework for big data classification...
PPTX
The various Industrial Revolutions .pptx
PDF
Enhancing emotion recognition model for a student engagement use case through...
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PDF
Zenith AI: Advanced Artificial Intelligence
PDF
2021 HotChips TSMC Packaging Technologies for Chiplets and 3D_0819 publish_pu...
PDF
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
PDF
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
PDF
1 - Historical Antecedents, Social Consideration.pdf
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
August Patch Tuesday
PDF
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
PPTX
Modernising the Digital Integration Hub
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PDF
NewMind AI Weekly Chronicles – August ’25 Week III
PPTX
Tartificialntelligence_presentation.pptx
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PPTX
Programs and apps: productivity, graphics, security and other tools
Architecture types and enterprise applications.pdf
A novel scalable deep ensemble learning framework for big data classification...
The various Industrial Revolutions .pptx
Enhancing emotion recognition model for a student engagement use case through...
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
Zenith AI: Advanced Artificial Intelligence
2021 HotChips TSMC Packaging Technologies for Chiplets and 3D_0819 publish_pu...
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
1 - Historical Antecedents, Social Consideration.pdf
NewMind AI Weekly Chronicles - August'25-Week II
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
August Patch Tuesday
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
Modernising the Digital Integration Hub
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
NewMind AI Weekly Chronicles – August ’25 Week III
Tartificialntelligence_presentation.pptx
Assigned Numbers - 2025 - Bluetooth® Document
Programs and apps: productivity, graphics, security and other tools

Chatbot testing

  • 1. © 2019 Nagarro – All rights reserved 1© 2019 Nagarro – All rights reserved Chatbot Testing: Challenges and approach for testing Chatbot applications Author: Rajni Singh & Neeraj Jain
  • 2. © 2019 Nagarro – All rights reserved 2 Agenda Context Understanding Chatbots and real time examples of uncertainities Challenges Challenges in testing chatbot Testing Approach Scope, Approach & Tools and Use case
  • 3. © 2019 Nagarro – All rights reserved 3© 2019 Nagarro – All rights reserved Understanding Chatbots, the evolution and its challenges Context 01
  • 4. © 2019 Nagarro – All rights reserved 4 How it works? Chatbot application (Virtual assistant) Cognitive Services Business applications and Services http(s) Integration User input in plain language: • Text • Speech Channels allows user to connect to chatbot application Chatbot is a “computer program” which • Maintains a sensible and contextual conversation with a human • Operates on cognitive services such as NLP is used to make it more intelligent. • Processes business logic and prepare response to user input • Can take actions
  • 5. © 2019 Nagarro – All rights reserved 5 Types of Chatbot Chabot can be broadly categorized in two categories. AI-BASED RULE BASED Built with artificial intelligence technique to understand the human language together with sentiments that does not rely on scripted conversation. It accepts free form of input and responds based on the existing domain knowledge with self-learning improvement with maturity. Chatbot behavior is based on a set of rules, flows, and triggers to respond to very specific commands being asked by the user. The conversation is usually scripted and chatbot responds to each question with a predefined rule and each step is picked with an explicit option. A simple example might be a chatbot that tells you the stock price on a given date.
  • 6. © 2019 Nagarro – All rights reserved 6 Uncertainties: Why chatbot needs to be tested? Facebook Inc.’s chatbots hit a70% failure rate and zero emotional intelligence “Tay's responses have turned the bot into a joke, but they raise serious questions” In 2016, users were finding it impossible to unsubscribe WSJ Ticketmaster Breach Traces to Embedded Chatbot Software
  • 7. © 2019 Nagarro – All rights reserved 7© 2019 Nagarro – All rights reserved Challenges 02 What could go wrong in Chatbot testing ?
  • 8. © 2019 Nagarro – All rights reserved 8 03 0201 06 04 05 Dependency on humungous amount of data There is no defined input / output Continuous learning from past behavior Difficult to predict all scenarios AI applications are non- deterministic and probabilistic Continuous monitoring Shift from traditional testing to AI testing
  • 9. © 2019 Nagarro – All rights reserved 9 Chatbot testing challenges Multichannel user experience Domain specific validations Uncertainty of user conversation Lack of robust testing strategy Chatbot intelligence not being questioned Chatbot security being compromised
  • 10. © 2019 Nagarro – All rights reserved 10© 2019 Nagarro – All rights reserved Scope, parameters, and testing tools Approach 03
  • 11. © 2019 Nagarro – All rights reserved 11 Chatbot testing scope API Regression Specialized testing Context remembrance and switching On Boarding & Personality Conversation flow Domain Specific Crowd Performance & Security NLP & Cognitive Services EnhancedUserExperience
  • 12. © 2019 Nagarro – All rights reserved 12 Parameters for testing Navigation Error Management Speed Accuracy Onboarding Intelligence PersonalitySecurity
  • 13. © 2019 Nagarro – All rights reserved 13© 2019 Nagarro – All rights reserved Meet Ginger Our in-house customized personal assistant
  • 14. © 2019 Nagarro – All rights reserved 14  Any question about Nagarro Ask Ginger: Ginger  Actionable insight that is customized to your role Ginger bytes:  Notification about important event, conferences, announcement Ginger says:
  • 15. © 2019 Nagarro – All rights reserved 15 Ask Ginger
  • 16. © 2019 Nagarro – All rights reserved 16© 2019 Nagarro – All rights reserved Beyond traditional testing for AI-infused Chatbots
  • 17. © 2019 Nagarro – All rights reserved 17 End-to-end automation testing framework Cloud/On-Premise Solution Conversational Management API Admin Linguistic Framework Intent Entity Synonyms Action Binding NLP Model Request Response Context Session User Configuration Security Training Integration LiveConnectors EnterpriseServices Training Set Data import-export
  • 18. © 2019 Nagarro – All rights reserved 18 Goal completion rate Self-service rate AI and ML rates User retention rate Fallback rate Measure the performance of the bot
  • 19. © 2019 Nagarro – All rights reserved 19 Data Model Manager RTMBAF Performance Monitoring Performance Tuning Performance MonitoringTraining Re-Train Measure Measure Enterprise Chatbot Model Read Client Interaction Watson LUIS Dialog flow RASA others Client Using Chatbot Client Questions NLP-based model life-cycle of the Chatbot
  • 20. © 2019 Nagarro – All rights reserved 20 Advanced security testing 1 2 3 4 End-to-end encryption Two-factor authentication User authentication Intent authorization 5 Channel authentication 6 Compliance validation 7 Authentication timeout 8 Self-destructing messages
  • 21. © 2019 Nagarro – All rights reserved 21 Domain-specific testing This testing should start with identifying the domain of the chatbot and then identify the keywords of that domain. A Chatbot built for IT should treat the word “Selenium” as an automation tool while a chatbot built for pharma industry should treat the “Selenium” as chemical.
  • 22. © 2019 Nagarro – All rights reserved 22 Recommendations Monitor, measure and update (NLP, conversational etc.) Automate end to end (text, voice, etc.) Non-functional testing Domain-specific testing
  • 23. © 2019 Nagarro – All rights reserved 23 HELPING YOU CREATE AN INTELLIGENT ENTERPRISE Thank You