SlideShare a Scribd company logo
Challenging Reading Comprehension on Daily Conversation:
Passage Completion on Multiparty Dialog
Kaixin Ma, Tomasz Jurczyk, Jinho D. Choi
Department of Mathematics and Computer Science, Emory University
• Test machine’s ability to comprehend daily conversation
• Given a dialog and a passage describing the dialog, the machine is asked to
predict the missing word in the passage.
Objective
• Transcripts from TV show Friends are used, which have 10 seasons, 236
episodes and 3,107 scenes in total.
• Each scene is treated as a separate dialog.
• Collect plot summaries from fan sites.
• A second set of passages are created without using dominate characters.
• In total, 1,681 dialogs are used, 4,646 passages are generated, in which
2,994 come from plot summaries, 615 are different descriptions and 1,037
are ones without using major characters.
Sample dialog from our dataset
Passages associated with this dialog
Queries generated from passages
Experimental splits of our dataset
Corpus Creation
Overall Structure of the model
Utterance level attention from similarity matrix
• A
Dialog-level attention to optimize global view
Results of all of our experiments
• To test models’ performance on longer dialogs, 3 more datasets are created
by adding utterances from consecutive scenes with the same set of queries.
• On all 4 datasets, our models converge in fewer epochs and as dialog gets
longer our models outperform Bi-LSTM by a larger margin.
Attention matrix in the dialog level attention
• Keywords in the query such as misses, take, and good time get relatively
high attention. Utterances 14, 15, 17 which give the answer also got more
attention. Showing the effectiveness of our attention mechanism.
• During error analysis, it is found that Bi-LSTM is better at capturing exact
word or phrase matches, whereas our models are better at answering
questions that require inference from multiple utterances.
Results
• We introduced a new corpus consisting multiparty dialogs and crowdsource
annotations for the task of passage completion
• We presented a neural architecture combining CNN, RNN and two levels of
attention. Models trained on this architecture outperform ones trained on
pure LSTM, especially on longer dialogs.
• To the best of our knowledge, this is the first time the passage completion
task is thoroughly examines with a challenging dataset on multiparty dialog
using deep learning models.
• For the future work, it is interesting to expand annotation for other entity
types and add an entity linker to automatically link mentions with respect to
their entities.
Conclusion
Approaches
Convolution
U1
Convolution
Uk
LSTM↓d LSTM↓d
LSTM↑d LSTM↑d
· · ·
· · ·
Q
LSTM↓q LSTM↓q
LSTM↑q LSTM↑q
· · ·
· · ·
D
· · ·
~u1 ~uk
~h #d
~h #q
~h "q
~h "d
Softmax
Q
Ui
Si
U0
i
⌦
Vi
A
DQ ⌦
⌦ ⌦
Q0
D0
P
~pc
~pr~aq ~ad
1D Convolution
Sum

More Related Content

PPTX
Word embedding
PDF
Text-based Speaker Identification on Multiparty Dialogues Using Multi-documen...
DOCX
A neural probabilistic language model
PPTX
Sequence to sequence model speech recognition
PDF
Open vocabulary problem
PDF
Monte Carlo Modelling of Confidence Intervals in Translation Quality Evaluati...
PDF
Conversational transfer learning for emotion recognition
PDF
histogram-based-emotion
Word embedding
Text-based Speaker Identification on Multiparty Dialogues Using Multi-documen...
A neural probabilistic language model
Sequence to sequence model speech recognition
Open vocabulary problem
Monte Carlo Modelling of Confidence Intervals in Translation Quality Evaluati...
Conversational transfer learning for emotion recognition
histogram-based-emotion

What's hot (19)

PPTX
Speaker recognition using MFCC
PDF
Recurrent Convolutional Neural Networks for Text Classification
PDF
seq2seq learning for end-to-end dialogue systems
PDF
LEPOR: an augmented machine translation evaluation metric - Thesis PPT
PPTX
Chatbot
PPTX
Natural Language Processing Advancements By Deep Learning - A Survey
PDF
Deep Learning in practice : Speech recognition and beyond - Meetup
PDF
cushLEPOR uses LABSE distilled knowledge to improve correlation with human tr...
PDF
Nlp research presentation
PPTX
Understanding GloVe
PPTX
#8 (Java Message Service)
PDF
Semi-supervised Prosody Modeling Using Deep Gaussian Process Latent Variable...
PPT
Bachelor's thesis defence presentation
PDF
Deep Learning for Speech Recognition - Vikrant Singh Tomar
PPTX
Natural Language Processing Advancements By Deep Learning: A Survey
PDF
Significance of Speech Intelligibility Assessors in Medium Classroom Using An...
PPT
M sc thesis_presentation_
PDF
(Deep) Neural Networks在 NLP 和 Text Mining 总结
PPTX
Text Mining for Lexicography
Speaker recognition using MFCC
Recurrent Convolutional Neural Networks for Text Classification
seq2seq learning for end-to-end dialogue systems
LEPOR: an augmented machine translation evaluation metric - Thesis PPT
Chatbot
Natural Language Processing Advancements By Deep Learning - A Survey
Deep Learning in practice : Speech recognition and beyond - Meetup
cushLEPOR uses LABSE distilled knowledge to improve correlation with human tr...
Nlp research presentation
Understanding GloVe
#8 (Java Message Service)
Semi-supervised Prosody Modeling Using Deep Gaussian Process Latent Variable...
Bachelor's thesis defence presentation
Deep Learning for Speech Recognition - Vikrant Singh Tomar
Natural Language Processing Advancements By Deep Learning: A Survey
Significance of Speech Intelligibility Assessors in Medium Classroom Using An...
M sc thesis_presentation_
(Deep) Neural Networks在 NLP 和 Text Mining 总结
Text Mining for Lexicography
Ad

Similar to Challenging Reading Comprehension on Daily Conversation: Passage Completion on Multiparty Dialog (20)

PDF
Transformers to Learn Hierarchical Contexts in Multiparty Dialogue
PDF
NEURAL DISCOURSE MODELLING OF CONVERSATIONS
PDF
NEURAL DISCOURSE MODELLING OF CONVERSATIONS
PDF
Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-b...
PDF
EXTENDING OUTPUT ATTENTIONS IN RECURRENT NEURAL NETWORKS FOR DIALOG GENERATION
PDF
NLP Project: Machine Comprehension Using Attention-Based LSTM Encoder-Decoder...
PDF
FriendsQA: Open-domain Question Answering on TV Show Transcripts
PPTX
NeurIPS_2018_ConvAI2_ParticipantSlides.pptx
PDF
Reading Comprehension on Multi-party Dialog
PDF
MULTILINGUAL SPEECH TO TEXT CONVERSION USING HUGGING FACE FOR DEAF PEOPLE
PDF
Graph-to-Text Generation and its Applications to Dialogue
PDF
X-RECOSA: MULTI-SCALE CONTEXT AGGREGATION FOR MULTI-TURN DIALOGUE GENERATION
PDF
End-to-end Speech Recognition with Recurrent Neural Networks (D3L6 Deep Learn...
PPTX
Conversation with-search-engines (Ren et al. 2020)
PDF
IRJET - Speech to Speech Translation using Encoder Decoder Architecture
PDF
Let Android dream electric sheep: Making emotion model for chat-bot with Pyth...
PDF
End-to-End Plural Coreference Resolution on TV Show Transcripts
PDF
SP Study1018 Paper Reading
PDF
Improving neural question generation using answer separation
PDF
Cleveree: an artificially intelligent web service for Jacob voice chatbot
Transformers to Learn Hierarchical Contexts in Multiparty Dialogue
NEURAL DISCOURSE MODELLING OF CONVERSATIONS
NEURAL DISCOURSE MODELLING OF CONVERSATIONS
Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-b...
EXTENDING OUTPUT ATTENTIONS IN RECURRENT NEURAL NETWORKS FOR DIALOG GENERATION
NLP Project: Machine Comprehension Using Attention-Based LSTM Encoder-Decoder...
FriendsQA: Open-domain Question Answering on TV Show Transcripts
NeurIPS_2018_ConvAI2_ParticipantSlides.pptx
Reading Comprehension on Multi-party Dialog
MULTILINGUAL SPEECH TO TEXT CONVERSION USING HUGGING FACE FOR DEAF PEOPLE
Graph-to-Text Generation and its Applications to Dialogue
X-RECOSA: MULTI-SCALE CONTEXT AGGREGATION FOR MULTI-TURN DIALOGUE GENERATION
End-to-end Speech Recognition with Recurrent Neural Networks (D3L6 Deep Learn...
Conversation with-search-engines (Ren et al. 2020)
IRJET - Speech to Speech Translation using Encoder Decoder Architecture
Let Android dream electric sheep: Making emotion model for chat-bot with Pyth...
End-to-End Plural Coreference Resolution on TV Show Transcripts
SP Study1018 Paper Reading
Improving neural question generation using answer separation
Cleveree: an artificially intelligent web service for Jacob voice chatbot
Ad

More from Jinho Choi (20)

PDF
Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal ...
PDF
Analysis of Hierarchical Multi-Content Text Classification Model on B-SHARP D...
PDF
Competence-Level Prediction and Resume & Job Description Matching Using Conte...
PDF
The Myth of Higher-Order Inference in Coreference Resolution
PDF
Noise Pollution in Hospital Readmission Prediction: Long Document Classificat...
PDF
Abstract Meaning Representation
PDF
Semantic Role Labeling
PDF
CKY Parsing
PDF
CS329 - WordNet Similarities
PDF
CS329 - Lexical Relations
PDF
Automatic Knowledge Base Expansion for Dialogue Management
PDF
Attention is All You Need for AMR Parsing
PDF
Real-time Coreference Resolution for Dialogue Understanding
PDF
Topological Sort
PDF
Tries - Put
PDF
Multi-modal Embedding Learning for Early Detection of Alzheimer's Disease
PDF
Building Widely-Interpretable Semantic Networks for Dialogue Contexts
PDF
How to make Emora talk about Sports Intelligently
PDF
Text-to-SQL with Data-Driven Templates
PDF
Resume Classification with Term Attention Embeddings
Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal ...
Analysis of Hierarchical Multi-Content Text Classification Model on B-SHARP D...
Competence-Level Prediction and Resume & Job Description Matching Using Conte...
The Myth of Higher-Order Inference in Coreference Resolution
Noise Pollution in Hospital Readmission Prediction: Long Document Classificat...
Abstract Meaning Representation
Semantic Role Labeling
CKY Parsing
CS329 - WordNet Similarities
CS329 - Lexical Relations
Automatic Knowledge Base Expansion for Dialogue Management
Attention is All You Need for AMR Parsing
Real-time Coreference Resolution for Dialogue Understanding
Topological Sort
Tries - Put
Multi-modal Embedding Learning for Early Detection of Alzheimer's Disease
Building Widely-Interpretable Semantic Networks for Dialogue Contexts
How to make Emora talk about Sports Intelligently
Text-to-SQL with Data-Driven Templates
Resume Classification with Term Attention Embeddings

Recently uploaded (20)

PDF
Spectral efficient network and resource selection model in 5G networks
PDF
cuic standard and advanced reporting.pdf
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
sap open course for s4hana steps from ECC to s4
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PPTX
A Presentation on Artificial Intelligence
Spectral efficient network and resource selection model in 5G networks
cuic standard and advanced reporting.pdf
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
The Rise and Fall of 3GPP – Time for a Sabbatical?
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Building Integrated photovoltaic BIPV_UPV.pdf
Mobile App Security Testing_ A Comprehensive Guide.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Reach Out and Touch Someone: Haptics and Empathic Computing
Diabetes mellitus diagnosis method based random forest with bat algorithm
The AUB Centre for AI in Media Proposal.docx
MYSQL Presentation for SQL database connectivity
Assigned Numbers - 2025 - Bluetooth® Document
Review of recent advances in non-invasive hemoglobin estimation
sap open course for s4hana steps from ECC to s4
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Encapsulation_ Review paper, used for researhc scholars
Unlocking AI with Model Context Protocol (MCP)
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
A Presentation on Artificial Intelligence

Challenging Reading Comprehension on Daily Conversation: Passage Completion on Multiparty Dialog

  • 1. Challenging Reading Comprehension on Daily Conversation: Passage Completion on Multiparty Dialog Kaixin Ma, Tomasz Jurczyk, Jinho D. Choi Department of Mathematics and Computer Science, Emory University • Test machine’s ability to comprehend daily conversation • Given a dialog and a passage describing the dialog, the machine is asked to predict the missing word in the passage. Objective • Transcripts from TV show Friends are used, which have 10 seasons, 236 episodes and 3,107 scenes in total. • Each scene is treated as a separate dialog. • Collect plot summaries from fan sites. • A second set of passages are created without using dominate characters. • In total, 1,681 dialogs are used, 4,646 passages are generated, in which 2,994 come from plot summaries, 615 are different descriptions and 1,037 are ones without using major characters. Sample dialog from our dataset Passages associated with this dialog Queries generated from passages Experimental splits of our dataset Corpus Creation Overall Structure of the model Utterance level attention from similarity matrix • A Dialog-level attention to optimize global view Results of all of our experiments • To test models’ performance on longer dialogs, 3 more datasets are created by adding utterances from consecutive scenes with the same set of queries. • On all 4 datasets, our models converge in fewer epochs and as dialog gets longer our models outperform Bi-LSTM by a larger margin. Attention matrix in the dialog level attention • Keywords in the query such as misses, take, and good time get relatively high attention. Utterances 14, 15, 17 which give the answer also got more attention. Showing the effectiveness of our attention mechanism. • During error analysis, it is found that Bi-LSTM is better at capturing exact word or phrase matches, whereas our models are better at answering questions that require inference from multiple utterances. Results • We introduced a new corpus consisting multiparty dialogs and crowdsource annotations for the task of passage completion • We presented a neural architecture combining CNN, RNN and two levels of attention. Models trained on this architecture outperform ones trained on pure LSTM, especially on longer dialogs. • To the best of our knowledge, this is the first time the passage completion task is thoroughly examines with a challenging dataset on multiparty dialog using deep learning models. • For the future work, it is interesting to expand annotation for other entity types and add an entity linker to automatically link mentions with respect to their entities. Conclusion Approaches Convolution U1 Convolution Uk LSTM↓d LSTM↓d LSTM↑d LSTM↑d · · · · · · Q LSTM↓q LSTM↓q LSTM↑q LSTM↑q · · · · · · D · · · ~u1 ~uk ~h #d ~h #q ~h "q ~h "d Softmax Q Ui Si U0 i ⌦ Vi A DQ ⌦ ⌦ ⌦ Q0 D0 P ~pc ~pr~aq ~ad 1D Convolution Sum