SlideShare a Scribd company logo
Empowering Conversational Agents with Situated Natural Language
Communication Skills by Exploiting Deep Reinforcement Learning
Techniques
Alessandro Suglia
Heriot-Watt University, Edinburgh Centre for
Robotics, Edinburgh, Scotland, UK
D o c t o ra l C o n s o r t i u m A I * I A 2 0 1 7 , B a r i
Supervisor: Prof. Oliver Lemon
Director of the Interaction Lab,
Heriot-Watt University
Dynamic Temporal Contextualized
The answer provided
by one of the
interlocutor affects
the state of the
dialogue
What is dialogue?
The interpretation of
each utterance
depends incredibly on
different environment
conditions
Different topics are
discussed in a given
conversation without
a specific order
[1]: Rieser, V., and O. Lemon. Reinforcement learning for adaptive dialogue systems: a data-driven methodology
for dialogue management and natural language generation. Springer Science & Business Media, 2011.
Goal-directed learning from interaction with an
environment by learning to map situations to actions
by exploiting Deep Neural Networks models
Deep Reinforcement Learning
Deep Reinforcement Learning
Successfully applied to different games such as Go [2] or Poker [3]
Incrementally abstract representations learned from training data
Learning to behave by interacting with the environment
Learning in an incremental and online fashion
Inability to generalise due to the domain specific design
Handsome quantity of data required to effectively learn a policy
[2]: Silver, D., Huang, A., Maddison, C.J., Guez, A., Sifre, L., Van Den Driessche, G., Schrittwieser, J., Antonoglou, I.,
Panneershelvam, V., Lanctot, M., et al.: Mastering the game of Go with deep neural networks and tree search. Nature
529(7587) (2016) 484–489
[3]: Moravčík, Matej, et al. "Deepstack: Expert-level artificial intelligence in no-limit poker." arXiv preprint arXiv:
1701.01724 (2017).
RQ1 : Is it possible to train a conversational agent to interact
within real world contexts consisting of embodied agents and
situated objects?
RQ2: Is it possible to train a conversational agent to generate
accurate contextualized responses for the user?
RQ3: Is the system able to adapt seemly to different domains by
exploiting what it has previously learned?
Natural Language
interaction
Multi-modal
interaction
Agent
embodiment
Thanks!
You can find me at the poster session
@ale_suglia
as247@hw.ac.uk
Any questions?
References
[1]: Rieser, V., and O. Lemon. Reinforcement learning for adaptive dialogue systems: a data-driven
methodology for dialogue management and natural language generation. Springer Science &
Business Media, 2011.
[2]: Silver, D., Huang, A., Maddison, C.J., Guez, A., Sifre, L., Van Den Driessche, G., Schrittwieser, J.,
Antonoglou, I., Panneershelvam, V., Lanctot, M., et al.: Mastering the game of Go with deep neural
networks and tree search. Nature 529(7587) (2016) 484–489
[3]: Moravčík, Matej, et al. "Deepstack: Expert-level artificial intelligence in no-limit poker." arXiv
preprint arXiv:1701.01724 (2017).

More Related Content

PPTX
Google Duplex AI
PDF
Deep learning 1
PPTX
Google Duplex
PPTX
Group duplex
PPTX
An Introduction to Recent Advances in the Field of NLP
PDF
Li Deng at AI Frontiers: Three Generations of Spoken Dialogue Systems (Bots)
PDF
Harm van Seijen, Research Scientist, Maluuba at MLconf SF 2016
PDF
Deep Reinforcement Learning and Its Applications
Google Duplex AI
Deep learning 1
Google Duplex
Group duplex
An Introduction to Recent Advances in the Field of NLP
Li Deng at AI Frontiers: Three Generations of Spoken Dialogue Systems (Bots)
Harm van Seijen, Research Scientist, Maluuba at MLconf SF 2016
Deep Reinforcement Learning and Its Applications

Similar to Empowering Conversational Agents with Situated Natural Language Communication Skills by Exploiting Deep Reinforcement Learning Techniques (20)

PDF
An introduction to deep reinforcement learning
PDF
Dilek Hakkani-Tur at AI Frontiers: Conversational machines: Deep Learning for...
PDF
Rasa Developer Summit - Bing Liu - Interactive Learning of Task-Oriented Dial...
PDF
PDF
#1 Berlin Students in AI, Machine Learning & NLP presentation
PDF
孫民/從電腦視覺看人工智慧 : 下一件大事
PDF
Introduction to reinforcement learning
PPTX
pptvuvubhbhaszvgsgsvxhbughbghbgbhhhhhhh.pptx
PDF
Dyslexic Reading Assistance with Language Processing Algorithms
PDF
DYSLEXICREADING ASSISTANCE WITH LANGUAGEPROCESSING ALGORITHMS
PDF
2017 Tutorial - Deep Learning for Dialogue Systems
PDF
An introduction to reinforcement learning
PDF
Deep Learning for NLP (without Magic) - Richard Socher and Christopher Manning
PPTX
Reinforcement learning
PDF
Horizon: Deep Reinforcement Learning at Scale
PPTX
Adversarial learning for neural dialogue generation
PPTX
What Can RL do.pptx
PPTX
Introduction: Asynchronous Methods for Deep Reinforcement Learning
PDF
Introduction to Deep Learning Lecture 20 Large Language Models
PDF
MILA DL & RL summer school highlights
An introduction to deep reinforcement learning
Dilek Hakkani-Tur at AI Frontiers: Conversational machines: Deep Learning for...
Rasa Developer Summit - Bing Liu - Interactive Learning of Task-Oriented Dial...
#1 Berlin Students in AI, Machine Learning & NLP presentation
孫民/從電腦視覺看人工智慧 : 下一件大事
Introduction to reinforcement learning
pptvuvubhbhaszvgsgsvxhbughbghbgbhhhhhhh.pptx
Dyslexic Reading Assistance with Language Processing Algorithms
DYSLEXICREADING ASSISTANCE WITH LANGUAGEPROCESSING ALGORITHMS
2017 Tutorial - Deep Learning for Dialogue Systems
An introduction to reinforcement learning
Deep Learning for NLP (without Magic) - Richard Socher and Christopher Manning
Reinforcement learning
Horizon: Deep Reinforcement Learning at Scale
Adversarial learning for neural dialogue generation
What Can RL do.pptx
Introduction: Asynchronous Methods for Deep Reinforcement Learning
Introduction to Deep Learning Lecture 20 Large Language Models
MILA DL & RL summer school highlights
Ad

Recently uploaded (20)

PDF
Introduction to the R Programming Language
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPT
DATA COLLECTION METHODS-ppt for nursing research
PPTX
climate analysis of Dhaka ,Banglades.pptx
PDF
Mega Projects Data Mega Projects Data
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
PPTX
SAP 2 completion done . PRESENTATION.pptx
PDF
[EN] Industrial Machine Downtime Prediction
PPTX
Leprosy and NLEP programme community medicine
PDF
Oracle OFSAA_ The Complete Guide to Transforming Financial Risk Management an...
PDF
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
PPTX
Pilar Kemerdekaan dan Identi Bangsa.pptx
PDF
Data Engineering Interview Questions & Answers Batch Processing (Spark, Hadoo...
PPTX
Managing Community Partner Relationships
PPTX
modul_python (1).pptx for professional and student
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
Introduction to the R Programming Language
IBA_Chapter_11_Slides_Final_Accessible.pptx
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
DATA COLLECTION METHODS-ppt for nursing research
climate analysis of Dhaka ,Banglades.pptx
Mega Projects Data Mega Projects Data
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
SAP 2 completion done . PRESENTATION.pptx
[EN] Industrial Machine Downtime Prediction
Leprosy and NLEP programme community medicine
Oracle OFSAA_ The Complete Guide to Transforming Financial Risk Management an...
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
Pilar Kemerdekaan dan Identi Bangsa.pptx
Data Engineering Interview Questions & Answers Batch Processing (Spark, Hadoo...
Managing Community Partner Relationships
modul_python (1).pptx for professional and student
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
Introduction-to-Cloud-ComputingFinal.pptx
Ad

Empowering Conversational Agents with Situated Natural Language Communication Skills by Exploiting Deep Reinforcement Learning Techniques

  • 1. Empowering Conversational Agents with Situated Natural Language Communication Skills by Exploiting Deep Reinforcement Learning Techniques Alessandro Suglia Heriot-Watt University, Edinburgh Centre for Robotics, Edinburgh, Scotland, UK D o c t o ra l C o n s o r t i u m A I * I A 2 0 1 7 , B a r i Supervisor: Prof. Oliver Lemon Director of the Interaction Lab, Heriot-Watt University
  • 2. Dynamic Temporal Contextualized The answer provided by one of the interlocutor affects the state of the dialogue What is dialogue? The interpretation of each utterance depends incredibly on different environment conditions Different topics are discussed in a given conversation without a specific order [1]: Rieser, V., and O. Lemon. Reinforcement learning for adaptive dialogue systems: a data-driven methodology for dialogue management and natural language generation. Springer Science & Business Media, 2011.
  • 3. Goal-directed learning from interaction with an environment by learning to map situations to actions by exploiting Deep Neural Networks models Deep Reinforcement Learning
  • 4. Deep Reinforcement Learning Successfully applied to different games such as Go [2] or Poker [3] Incrementally abstract representations learned from training data Learning to behave by interacting with the environment Learning in an incremental and online fashion Inability to generalise due to the domain specific design Handsome quantity of data required to effectively learn a policy [2]: Silver, D., Huang, A., Maddison, C.J., Guez, A., Sifre, L., Van Den Driessche, G., Schrittwieser, J., Antonoglou, I., Panneershelvam, V., Lanctot, M., et al.: Mastering the game of Go with deep neural networks and tree search. Nature 529(7587) (2016) 484–489 [3]: Moravčík, Matej, et al. "Deepstack: Expert-level artificial intelligence in no-limit poker." arXiv preprint arXiv: 1701.01724 (2017).
  • 5. RQ1 : Is it possible to train a conversational agent to interact within real world contexts consisting of embodied agents and situated objects?
  • 6. RQ2: Is it possible to train a conversational agent to generate accurate contextualized responses for the user?
  • 7. RQ3: Is the system able to adapt seemly to different domains by exploiting what it has previously learned?
  • 9. Thanks! You can find me at the poster session @ale_suglia as247@hw.ac.uk Any questions?
  • 10. References [1]: Rieser, V., and O. Lemon. Reinforcement learning for adaptive dialogue systems: a data-driven methodology for dialogue management and natural language generation. Springer Science & Business Media, 2011. [2]: Silver, D., Huang, A., Maddison, C.J., Guez, A., Sifre, L., Van Den Driessche, G., Schrittwieser, J., Antonoglou, I., Panneershelvam, V., Lanctot, M., et al.: Mastering the game of Go with deep neural networks and tree search. Nature 529(7587) (2016) 484–489 [3]: Moravčík, Matej, et al. "Deepstack: Expert-level artificial intelligence in no-limit poker." arXiv preprint arXiv:1701.01724 (2017).