SlideShare a Scribd company logo
Classifying Non-Referential It for
Question Answer Pairs
Timothy Lee, Alex Lutz, and Jinho D. Choi

Department of Mathematics and Computer Science, Emory University
• Classifying non-referential it is an important task in
coreference resolution
• But no such attempts in this classification has been done
for question answer pairs
• The style of English used in question answer documents
differs greatly from standard documents such as the Wall
Street Journal
• Our task is to classify non-referential it for question
answer pairs and introduce the dataset
• Initially Evans classified it into seven categories which
effectively captures all types of it
• We propose a more efficient set of rules to classify it for
coreference specifically for question answer pairs
Introduction
• “Politics and Government” and “Society and Culture” had the
highest proportion of non-referential instances due to their
abstract ideas
• “Computers and Internet” and “Science and Mathematics”
had the most referential-nominal cases because these dealt
with tangible objects
• One large problem while annotating was resolving
ambiguous references of it
• If it were $1,700.00 … and let it go but for $170,000...
• Here, it can be either idiomatic, or refer to the “post dated
cheque” or the “process of receiving the post dated cheque”
• Contextual information proved vital to solve this ambiguity
• Q: Regarding IT, what are the fastest ways of getting
superich? A: …. with maintenece or service of systems or
with old programming languages.
• Since Yahoo! Answers isn’t standard english, IT could have
been capitalized for emphasis or to mean Information
Technology. With the help of contextual information, such
ambiguity could be avoided
Corpus Analytics
Experimental Setup
• Used stochastic adaptive gradient descent with mini-batch
and L1 regularization
• Tested new sets of features including brown clusters, word
embeddings, and dependency derived relationships
The following features were used to classify instances of
it:
• POS and dependency of current word
• POS and lemma for dependency head of current word
• POS and lemma for succeeding token and POS of 2nd
succeeding token
• POS of succeeding token with lemma of 2nd succeeding
token
• POS of 1st and 2nd succeeding token with lemma of 3rd
succeeding token
• m0 only used the baseline features
• m1 uses additional features based on the relative position
of it, the relative distance from preceding noun, and relative
position of sentence within document
• m2 discard the annotations for errors
• m3 merges referential-nominal and referential-other during
the training set
• m4 merges referential-nominal and referential-other during
the evaluation set
Results
• We introduced a new corpus from Yahoo! Answers which
classified instances of it into four categories
• Using a mixture of old and new features, we were able to
achieve promising results despite a challenging dataset
• In the future, we plan to increase the size of our dataset by
adding more genres from Yahoo! Answers
• We plan to use a recurrent neural network with our dataset
in the future to see if that will yield better results
Conclusion
Nominal Anaphoric
Clause Anaphoric
Pleonastic
Cataphoric
Idiomatic /
Stereotypic
Proaction
Discourse Topic
Non-Referential
1. Extra positional
2. Cleft
3. Weather/Time

Condition/Place
4. Idiomatic
Referential
Non-Referential
1. Cataphoric
2. Proaction
3. Discourse Topic
4. Clause Anaphoric
5. Pleonastic
6. Idiomatic/

Stereotypic
1. Referential - Nominal
1. Nominal Anaphoric
2. Referential - Others
1. Proaction
2. Discourse Topic
3. Clause Anaphoric
3. Non-Referential
1. Cataphoric

2. Pleonastic
3. Idiomatic/Stereotypic
4. Errors
Evans (2001) Boyd (2005) Bergsma (2008) This Work (2016)
Referential - Noun
1. Nominal Anaphoric
Referential - Nominal
1. Nominal Anaphoric
Non-Referential
1. Cataphoric
2. Proaction
3. Discourse Topic
4. Pleonastic
5. Idiomatic/

Stereotypic
Referential - Clause
1. Clause Anaphoric
Li (2009)
Genre Doc Sen Tok C1 C2 C3 C4 C⇤
1. Computers and Internet 100 918 11,586 222 31 24 3 280
2. Science and Mathematics 100 801 11,589 164 35 18 3 220
3. Yahoo! Products 100 1,027 11,803 176 36 25 3 240
4. Education and Reference 100 831 11,520 148 55 36 2 241
5. Business and Finance 100 817 11,267 139 57 37 0 233
6. Entertainment and Music 100 946 11,656 138 68 30 5 241
7. Society and Culture 100 864 11,589 120 57 47 2 226
8. Health 100 906 11,305 142 97 32 0 271
9. Politics and Government 100 876 11,482 99 81 51 0 231
Total 900 7,986 103,797 1,348 517 300 18 2,183
1
Model Development Set Evaluation Set
ACC C1 C2 C3 C4 ACC C1 C2 C3 C4
M0 72.73 82.43 35.48 57.14 0.00 74.05 82.65 49.20 71.07 0.00
M1 73.21 82.56 50.00 62.50 0.00 74.68 82.93 53.14 73.33 0.00
M2 73.08 82.56 49.41 60.00 - 75.21 83.39 51.23 73.95 -
M3 76.44 82.31 64.75 - 77.14 82.26 67.87 -
M4 76.92 83.45 61.90 - 78.21 83.39 68.32 -
Table 1: Accuracies achieved by each model (in %). ACC: overall accuracy,
C1..4: F1 scores for 4 categories in Section ??. The highest accuracies are
highlighted in bold.
Proportion
0
0.2
0.4
0.6
0.8
Genre
G1 G2 G3 G4 G5 G6 G7 G8 G9
C1 C2 C3
Type Description
C1 Anaphoric instances of it which refer to
nouns, noun phrases, or gerunds
e.g. John bought a book. It was about
space.
C2 Anaphoric instances of it which do not re-
fer to nominals such as proaction, clause
anaphoras, and discourse topic.
e.g. Always use tools correctly. If it feels
very awkward, stop.
C3 Contains the most common instances of
pleonastic it including extraposition, cleft,
atmospheric, and idiomatic
e.g. It is sunny outside.
C4 Non-pronoun forms of it including disflue-
ncies, abbreviations, and misspellings
e.g. Why did my email move it self?

More Related Content

DOCX
Computational methods couurseout line
PPTX
First-Character Filtering Method in Syllable Segmentation using Data Dictiona...
DOCX
PPT
Algorithms for the thematic analysis of twitter datasets
PPTX
AsiaCALL 2017 presentation
PDF
Mth263 lecture 1
PDF
Unsupervised Software-Specific Morphological Forms Inference from Informal Di...
PPTX
Apa style
Computational methods couurseout line
First-Character Filtering Method in Syllable Segmentation using Data Dictiona...
Algorithms for the thematic analysis of twitter datasets
AsiaCALL 2017 presentation
Mth263 lecture 1
Unsupervised Software-Specific Morphological Forms Inference from Informal Di...
Apa style

What's hot (18)

PPTX
pptphrase-tagset-mapping-for-french-and-english-treebanks-and-its-application...
PDF
Word Frequency Dominance and L2 Word Recognition
PDF
Evaluation Dataset and System for Japanese Lexical Simplification
PPTX
Lect03
PDF
Conceptual Plurality in Japanese EFL Learners' Online Sentence Processing: A ...
PPT
Moore_slides.ppt
PDF
Coordinated NPs agree with singular or plural in there-constructions?: A comp...
PDF
Conceptual and Grammatical Plurality of Conjoined NPs in L2 Sentence Comprehe...
PPTX
Semantic Patterns for Sentiment Analysis of Twitter
PPTX
Mechanics of style
PPTX
Lect06
PPTX
Tweets Classification
PPT
Eurolan 2005 Pedersen
PPTX
Lect01
PDF
Learning from Noisy Label Distributions (ICANN2017)
PPTX
IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...
PPTX
Weakly Supervised Machine Reading
PDF
Grade 6 e_parcc_ig_2017
pptphrase-tagset-mapping-for-french-and-english-treebanks-and-its-application...
Word Frequency Dominance and L2 Word Recognition
Evaluation Dataset and System for Japanese Lexical Simplification
Lect03
Conceptual Plurality in Japanese EFL Learners' Online Sentence Processing: A ...
Moore_slides.ppt
Coordinated NPs agree with singular or plural in there-constructions?: A comp...
Conceptual and Grammatical Plurality of Conjoined NPs in L2 Sentence Comprehe...
Semantic Patterns for Sentiment Analysis of Twitter
Mechanics of style
Lect06
Tweets Classification
Eurolan 2005 Pedersen
Lect01
Learning from Noisy Label Distributions (ICANN2017)
IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...
Weakly Supervised Machine Reading
Grade 6 e_parcc_ig_2017
Ad

Similar to Classifying Non-Referential It for Question Answer Pairs (20)

PDF
IRJET- A Novel Approch Automatically Categorizing Software Technologies
PDF
Selected Papers From The 2006 Cyprus Syntaxfest 1st Edition Kleanthes K Grohm...
PDF
Question Focus Recognition in Question Answering Systems
PDF
Natural Language Processing
PDF
Dynamic Lexical Acquisition in Chinese Sentence Analysis
PPT
Adaptive Parser-Centric Text Normalization
DOC
Coreference Resolution using Hybrid Approach
PDF
Karlgren
PPTX
NLP in Practice - Part II
PDF
Temporal Semantic Techniques for Text Analysis and Applications
PDF
Biemann ibm cog_comp_jan2015_noanim
PPTX
Search, Signals & Sense: An Analytics Fueled Vision
PPTX
Authorship analysis using function words forensic linguistics
PDF
Automated Abstracts and Big Data
PDF
Make it simple with paraphrases: Automated paraphrasing for authoring aids an...
PPTX
LDL 2012 - Linking to ISOcat Data Categories
PDF
Recent Advances In Natural Language Processing Iv Selected Papers From Ranlp ...
PPT
The role of linguistic information for shallow language processing
PDF
Temporality universals and variation 1st Edition Bittner
PDF
A Bridge Not too Far
IRJET- A Novel Approch Automatically Categorizing Software Technologies
Selected Papers From The 2006 Cyprus Syntaxfest 1st Edition Kleanthes K Grohm...
Question Focus Recognition in Question Answering Systems
Natural Language Processing
Dynamic Lexical Acquisition in Chinese Sentence Analysis
Adaptive Parser-Centric Text Normalization
Coreference Resolution using Hybrid Approach
Karlgren
NLP in Practice - Part II
Temporal Semantic Techniques for Text Analysis and Applications
Biemann ibm cog_comp_jan2015_noanim
Search, Signals & Sense: An Analytics Fueled Vision
Authorship analysis using function words forensic linguistics
Automated Abstracts and Big Data
Make it simple with paraphrases: Automated paraphrasing for authoring aids an...
LDL 2012 - Linking to ISOcat Data Categories
Recent Advances In Natural Language Processing Iv Selected Papers From Ranlp ...
The role of linguistic information for shallow language processing
Temporality universals and variation 1st Edition Bittner
A Bridge Not too Far
Ad

More from Jinho Choi (20)

PDF
Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal ...
PDF
Analysis of Hierarchical Multi-Content Text Classification Model on B-SHARP D...
PDF
Competence-Level Prediction and Resume & Job Description Matching Using Conte...
PDF
Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-b...
PDF
The Myth of Higher-Order Inference in Coreference Resolution
PDF
Noise Pollution in Hospital Readmission Prediction: Long Document Classificat...
PDF
Abstract Meaning Representation
PDF
Semantic Role Labeling
PDF
CKY Parsing
PDF
CS329 - WordNet Similarities
PDF
CS329 - Lexical Relations
PDF
Automatic Knowledge Base Expansion for Dialogue Management
PDF
Attention is All You Need for AMR Parsing
PDF
Graph-to-Text Generation and its Applications to Dialogue
PDF
Real-time Coreference Resolution for Dialogue Understanding
PDF
Topological Sort
PDF
Tries - Put
PDF
Multi-modal Embedding Learning for Early Detection of Alzheimer's Disease
PDF
Building Widely-Interpretable Semantic Networks for Dialogue Contexts
PDF
How to make Emora talk about Sports Intelligently
Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal ...
Analysis of Hierarchical Multi-Content Text Classification Model on B-SHARP D...
Competence-Level Prediction and Resume & Job Description Matching Using Conte...
Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-b...
The Myth of Higher-Order Inference in Coreference Resolution
Noise Pollution in Hospital Readmission Prediction: Long Document Classificat...
Abstract Meaning Representation
Semantic Role Labeling
CKY Parsing
CS329 - WordNet Similarities
CS329 - Lexical Relations
Automatic Knowledge Base Expansion for Dialogue Management
Attention is All You Need for AMR Parsing
Graph-to-Text Generation and its Applications to Dialogue
Real-time Coreference Resolution for Dialogue Understanding
Topological Sort
Tries - Put
Multi-modal Embedding Learning for Early Detection of Alzheimer's Disease
Building Widely-Interpretable Semantic Networks for Dialogue Contexts
How to make Emora talk about Sports Intelligently

Recently uploaded (20)

PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
Encapsulation theory and applications.pdf
PDF
Approach and Philosophy of On baking technology
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
Big Data Technologies - Introduction.pptx
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Electronic commerce courselecture one. Pdf
PPTX
A Presentation on Artificial Intelligence
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Assigned Numbers - 2025 - Bluetooth® Document
Encapsulation theory and applications.pdf
Approach and Philosophy of On baking technology
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
A comparative analysis of optical character recognition models for extracting...
Chapter 3 Spatial Domain Image Processing.pdf
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Advanced methodologies resolving dimensionality complications for autism neur...
Digital-Transformation-Roadmap-for-Companies.pptx
Dropbox Q2 2025 Financial Results & Investor Presentation
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Big Data Technologies - Introduction.pptx
Per capita expenditure prediction using model stacking based on satellite ima...
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Electronic commerce courselecture one. Pdf
A Presentation on Artificial Intelligence
The Rise and Fall of 3GPP – Time for a Sabbatical?
Reach Out and Touch Someone: Haptics and Empathic Computing

Classifying Non-Referential It for Question Answer Pairs

  • 1. Classifying Non-Referential It for Question Answer Pairs Timothy Lee, Alex Lutz, and Jinho D. Choi
 Department of Mathematics and Computer Science, Emory University • Classifying non-referential it is an important task in coreference resolution • But no such attempts in this classification has been done for question answer pairs • The style of English used in question answer documents differs greatly from standard documents such as the Wall Street Journal • Our task is to classify non-referential it for question answer pairs and introduce the dataset • Initially Evans classified it into seven categories which effectively captures all types of it • We propose a more efficient set of rules to classify it for coreference specifically for question answer pairs Introduction • “Politics and Government” and “Society and Culture” had the highest proportion of non-referential instances due to their abstract ideas • “Computers and Internet” and “Science and Mathematics” had the most referential-nominal cases because these dealt with tangible objects • One large problem while annotating was resolving ambiguous references of it • If it were $1,700.00 … and let it go but for $170,000... • Here, it can be either idiomatic, or refer to the “post dated cheque” or the “process of receiving the post dated cheque” • Contextual information proved vital to solve this ambiguity • Q: Regarding IT, what are the fastest ways of getting superich? A: …. with maintenece or service of systems or with old programming languages. • Since Yahoo! Answers isn’t standard english, IT could have been capitalized for emphasis or to mean Information Technology. With the help of contextual information, such ambiguity could be avoided Corpus Analytics Experimental Setup • Used stochastic adaptive gradient descent with mini-batch and L1 regularization • Tested new sets of features including brown clusters, word embeddings, and dependency derived relationships The following features were used to classify instances of it: • POS and dependency of current word • POS and lemma for dependency head of current word • POS and lemma for succeeding token and POS of 2nd succeeding token • POS of succeeding token with lemma of 2nd succeeding token • POS of 1st and 2nd succeeding token with lemma of 3rd succeeding token • m0 only used the baseline features • m1 uses additional features based on the relative position of it, the relative distance from preceding noun, and relative position of sentence within document • m2 discard the annotations for errors • m3 merges referential-nominal and referential-other during the training set • m4 merges referential-nominal and referential-other during the evaluation set Results • We introduced a new corpus from Yahoo! Answers which classified instances of it into four categories • Using a mixture of old and new features, we were able to achieve promising results despite a challenging dataset • In the future, we plan to increase the size of our dataset by adding more genres from Yahoo! Answers • We plan to use a recurrent neural network with our dataset in the future to see if that will yield better results Conclusion Nominal Anaphoric Clause Anaphoric Pleonastic Cataphoric Idiomatic / Stereotypic Proaction Discourse Topic Non-Referential 1. Extra positional 2. Cleft 3. Weather/Time
 Condition/Place 4. Idiomatic Referential Non-Referential 1. Cataphoric 2. Proaction 3. Discourse Topic 4. Clause Anaphoric 5. Pleonastic 6. Idiomatic/
 Stereotypic 1. Referential - Nominal 1. Nominal Anaphoric 2. Referential - Others 1. Proaction 2. Discourse Topic 3. Clause Anaphoric 3. Non-Referential 1. Cataphoric
 2. Pleonastic 3. Idiomatic/Stereotypic 4. Errors Evans (2001) Boyd (2005) Bergsma (2008) This Work (2016) Referential - Noun 1. Nominal Anaphoric Referential - Nominal 1. Nominal Anaphoric Non-Referential 1. Cataphoric 2. Proaction 3. Discourse Topic 4. Pleonastic 5. Idiomatic/
 Stereotypic Referential - Clause 1. Clause Anaphoric Li (2009) Genre Doc Sen Tok C1 C2 C3 C4 C⇤ 1. Computers and Internet 100 918 11,586 222 31 24 3 280 2. Science and Mathematics 100 801 11,589 164 35 18 3 220 3. Yahoo! Products 100 1,027 11,803 176 36 25 3 240 4. Education and Reference 100 831 11,520 148 55 36 2 241 5. Business and Finance 100 817 11,267 139 57 37 0 233 6. Entertainment and Music 100 946 11,656 138 68 30 5 241 7. Society and Culture 100 864 11,589 120 57 47 2 226 8. Health 100 906 11,305 142 97 32 0 271 9. Politics and Government 100 876 11,482 99 81 51 0 231 Total 900 7,986 103,797 1,348 517 300 18 2,183 1 Model Development Set Evaluation Set ACC C1 C2 C3 C4 ACC C1 C2 C3 C4 M0 72.73 82.43 35.48 57.14 0.00 74.05 82.65 49.20 71.07 0.00 M1 73.21 82.56 50.00 62.50 0.00 74.68 82.93 53.14 73.33 0.00 M2 73.08 82.56 49.41 60.00 - 75.21 83.39 51.23 73.95 - M3 76.44 82.31 64.75 - 77.14 82.26 67.87 - M4 76.92 83.45 61.90 - 78.21 83.39 68.32 - Table 1: Accuracies achieved by each model (in %). ACC: overall accuracy, C1..4: F1 scores for 4 categories in Section ??. The highest accuracies are highlighted in bold. Proportion 0 0.2 0.4 0.6 0.8 Genre G1 G2 G3 G4 G5 G6 G7 G8 G9 C1 C2 C3 Type Description C1 Anaphoric instances of it which refer to nouns, noun phrases, or gerunds e.g. John bought a book. It was about space. C2 Anaphoric instances of it which do not re- fer to nominals such as proaction, clause anaphoras, and discourse topic. e.g. Always use tools correctly. If it feels very awkward, stop. C3 Contains the most common instances of pleonastic it including extraposition, cleft, atmospheric, and idiomatic e.g. It is sunny outside. C4 Non-pronoun forms of it including disflue- ncies, abbreviations, and misspellings e.g. Why did my email move it self?