SlideShare a Scribd company logo
Error analysis ofWord
Sense Disambiguation
Ruben Izquierdo
Marten Postma
PiekVossen
Izquierdo,PostmaandVossen
VUAmsterdam
Motivation
 Word Sense Disambiguation is still an unsolved problem
2 Izquierdo, Postma and Vossen VU Amsterdam
Error Analysis
 Perform error analysis on previousWSD evaluations to prove
our hypothesis
 Senseval-2: all-words task
 Senseval-3: all-words task
 Semeval2007: all-words task (#17)
 Semeval2010: all-words on specific domain (#17)
 Semeval2013: multilingual all-wordsWSD and entity linking
(#12)
3 Izquierdo, Postma and Vossen VU Amsterdam
Motivation
 Some “propagated” errors
 Errors on monosemous
 Errors because pos-tags
 Multiwords and phrasal verbs
 Little attention has been paid to the real problem
 WSD is not 1 problem but N problems
 Our hypothesis
 Context is not modeled properly in general
 System rely too much on the most frequent sense
4 Izquierdo, Postma and Vossen VU Amsterdam
Monosemous errors
5 Izquierdo, Postma and Vossen VU Amsterdam
Monosemous errors
6 Izquierdo, Postma and Vossen VU Amsterdam
Competition Monosemous Wrong Examples
Senseval2 499 (20.9%) 37.5% gene.n (suppressor_gene.n), chance.a
(chance.n) next.r (next.a)
Senseval3 334 (16.6%) 44.1% Datum.n (data.n) making.n (make.v)
out_of_sight (sight)
Semeval2007 25 (5.5%) 11.1% get_stuck.v, lack.v, write_about.v
Semeval2010 31 (2.2%) 97.9% Tidal_zone.n pine_marten.n roe_deer.n
cordgrass.n
Semeval2013
(lemmas)
348 (21.1%) 1.9% Private_enterprise, developing_country,
narrow_margin
Most Frequent Sense
7 Izquierdo, Postma and Vossen VU Amsterdam
Most Frequent Sense
 When the correct sense is NOT the most frequent sense
 Systems still assign mostly the MFS
 Senseval2
 799 tokens are not MFS
 84% systems still assign the MFS
 Most “failed” words due to MFS bias
 Senseval2, senseval3
 Say.v find.v take.v have.v cell.n church.n
 Semeval2010
 Area.n nature.n connection.n water.n population.n
8 Izquierdo, Postma and Vossen VU Amsterdam
Analysis per PoS-tag
9 Izquierdo, Postma and Vossen VU Amsterdam
Analysis per polysemy class
10 Izquierdo, Postma and Vossen VU Amsterdam
2Senses
Poly. C.
6 15
Low Medium High
Analysis per frequency class
11 Izquierdo, Postma and Vossen VU Amsterdam
Most difficult words
12 Izquierdo, Postma and Vossen VU Amsterdam
Expected vs. Observed
difficulties
 Calculate per sentence
 The “expected” difficulty
 Average polysemy, sentence length, average word length
13 Izquierdo, Postma and Vossen VU Amsterdam
 Calculate per sentence
 The “expected” difficulty
 Average polysemy, sentence length, average word length
14 Izquierdo, Postma and Vossen VU Amsterdam
Expected vs. Observed
difficulties
 Calculate per sentence
 The “expected” difficulty
 Average polysemy, sentence length, average wor length
 The “observed” difficulty
 From the real participant outputs, average error rate
 We should expect:
harder sentences  higher error rate
easier sentences   lower error rate
15 Izquierdo, Postma and Vossen VU Amsterdam
Expected vs. Observed
difficulties
16 Izquierdo, Postma and Vossen VU Amsterdam
Expected vs. Observed
difficulties
17 Izquierdo, Postma and Vossen VU Amsterdam
Expected vs. Observed
difficulties
• The context is not (probably) exploited properly
• Expected “easy” sentences SHOULD show low error rates
• Occurrences of the same word in different contexts have similar error
rate
• The difficulty of a word depends more on its polysemy than on the
context where it appears
18 Izquierdo, Postma and Vossen VU Amsterdam
Expected vs. Observed
difficulties
WSD Corpora
http://github.com/rubenIzquierdo/wsd_corpora
19 Izquierdo, Postma and Vossen VU Amsterdam
WSD Corpora
20 Izquierdo, Postma and Vossen VU Amsterdam
System Outputs
https://github.com/rubenIzquierdo/sval_systems
21 Izquierdo, Postma and Vossen VU Amsterdam
System Outputs
22 Izquierdo, Postma and Vossen VU Amsterdam
Error analysis of
Word Sense Disambiguation
Ruben Izquierdo
Marten Postma
PiekVossen
ruben.izquierdobevia@vu.nl
http://github.com/rubenIzquierdo/wsd_corpora
http://github.com/rubenIzquierdo/sval_systems
23
Analysis per PoS-tag
24 Izquierdo, Postma and Vossen VU Amsterdam

More Related Content

PDF
Word Sense Disambiguation and Induction
PDF
Word sense disambiguation a survey
PPT
Similarity based methods for word sense disambiguation
PDF
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasks
PDF
Lecture: Word Sense Disambiguation
PPTX
An Improved Approach to Word Sense Disambiguation
PDF
BibleTech2011
ODP
Word sense dissambiguation
Word Sense Disambiguation and Induction
Word sense disambiguation a survey
Similarity based methods for word sense disambiguation
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasks
Lecture: Word Sense Disambiguation
An Improved Approach to Word Sense Disambiguation
BibleTech2011
Word sense dissambiguation

Viewers also liked (20)

DOC
Draft programme 15 09-2015
PPTX
A word sense disambiguation technique for sinhala
PDF
Graph-based Word Sense Disambiguation
PDF
COLING 2014 - An Enhanced Lesk Word Sense Disambiguation Algorithm through a ...
PDF
Usage of word sense disambiguation in concept identification in ontology cons...
PPT
Similarity based methods for word sense disambiguation
PPT
Amharic WSD using WordNet
PPTX
Zoological nomenclature
PDF
PhD defense Koen Deschacht
PPTX
Word-sense disambiguation
PPTX
Biomedical Word Sense Disambiguation presentation [Autosaved]
PDF
Ontology-Based Word Sense Disambiguation for Scientific Literature
PPTX
CMSC 723: Computational Linguistics I
PPTX
Semantic annotation of biomedical data
PPT
presentation on Rotavator
PDF
Babelfy: Entity Linking meets Word Sense Disambiguation.
PDF
Words - Morphology Presentation- Dr. Shadia Y. Banjar
PDF
Sifting Social Data: Word Sense Disambiguation Using Machine Learning
Draft programme 15 09-2015
A word sense disambiguation technique for sinhala
Graph-based Word Sense Disambiguation
COLING 2014 - An Enhanced Lesk Word Sense Disambiguation Algorithm through a ...
Usage of word sense disambiguation in concept identification in ontology cons...
Similarity based methods for word sense disambiguation
Amharic WSD using WordNet
Zoological nomenclature
PhD defense Koen Deschacht
Word-sense disambiguation
Biomedical Word Sense Disambiguation presentation [Autosaved]
Ontology-Based Word Sense Disambiguation for Scientific Literature
CMSC 723: Computational Linguistics I
Semantic annotation of biomedical data
presentation on Rotavator
Babelfy: Entity Linking meets Word Sense Disambiguation.
Words - Morphology Presentation- Dr. Shadia Y. Banjar
Sifting Social Data: Word Sense Disambiguation Using Machine Learning
Ad

More from Rubén Izquierdo Beviá (17)

PPTX
ULM-1 Understanding Languages by Machines: The borders of Ambiguity
PDF
DutchSemCor workshop: Domain classification and WSD systems
PDF
RANLP2013: DutchSemCor, in Quest of the Ideal Sense Tagged Corpus
PPTX
Topic modeling and WSD on the Ancora corpus
PDF
Information Extraction
PPTX
Juan Calvino y el Calvinismo
PPTX
KafNafParserPy: a python library for parsing/creating KAF and NAF files
PDF
CLTL python course: Object Oriented Programming (3/3)
PDF
CLTL python course: Object Oriented Programming (2/3)
PDF
CLTL python course: Object Oriented Programming (1/3)
PPTX
CLTL Software and Web Services
PPTX
Thesis presentation (WSD and Semantic Classes)
PDF
ULM1 - The borders of Ambiguity
PPTX
CLTL: Description of web services and sofware. Nijmegen 2013
PPTX
CLTL presentation: training an opinion mining system from KAF files using CRF
PPT
CLIN 2012: DutchSemCor Building a semantically annotated corpus for Dutch
PPTX
RANLP 2013: DutchSemcor in quest of the ideal corpus
ULM-1 Understanding Languages by Machines: The borders of Ambiguity
DutchSemCor workshop: Domain classification and WSD systems
RANLP2013: DutchSemCor, in Quest of the Ideal Sense Tagged Corpus
Topic modeling and WSD on the Ancora corpus
Information Extraction
Juan Calvino y el Calvinismo
KafNafParserPy: a python library for parsing/creating KAF and NAF files
CLTL python course: Object Oriented Programming (3/3)
CLTL python course: Object Oriented Programming (2/3)
CLTL python course: Object Oriented Programming (1/3)
CLTL Software and Web Services
Thesis presentation (WSD and Semantic Classes)
ULM1 - The borders of Ambiguity
CLTL: Description of web services and sofware. Nijmegen 2013
CLTL presentation: training an opinion mining system from KAF files using CRF
CLIN 2012: DutchSemCor Building a semantically annotated corpus for Dutch
RANLP 2013: DutchSemcor in quest of the ideal corpus
Ad

Recently uploaded (20)

PPTX
Primary and secondary sources, and history
PPTX
lesson6-211001025531lesson plan ppt.pptx
PDF
Presentation1 [Autosaved].pdf diagnosiss
PPTX
Hydrogel Based delivery Cancer Treatment
DOCX
ENGLISH PROJECT FOR BINOD BIHARI MAHTO KOYLANCHAL UNIVERSITY
PPTX
Effective_Handling_Information_Presentation.pptx
PPTX
Presentation for DGJV QMS (PQP)_12.03.2025.pptx
PPT
The Effect of Human Resource Management Practice on Organizational Performanc...
PPTX
Impressionism_PostImpressionism_Presentation.pptx
PPTX
The spiral of silence is a theory in communication and political science that...
PPTX
Relationship Management Presentation In Banking.pptx
PPTX
Intro to ISO 9001 2015.pptx wareness raising
PPTX
INTERNATIONAL LABOUR ORAGNISATION PPT ON SOCIAL SCIENCE
PPTX
chapter8-180915055454bycuufucdghrwtrt.pptx
PPTX
Human Mind & its character Characteristics
PPTX
Self management and self evaluation presentation
PPTX
BIOLOGY TISSUE PPT CLASS 9 PROJECT PUBLIC
DOC
学位双硕士UTAS毕业证,墨尔本理工学院毕业证留学硕士毕业证
PPTX
Tour Presentation Educational Activity.pptx
PDF
Parts of Speech Prepositions Presentation in Colorful Cute Style_20250724_230...
Primary and secondary sources, and history
lesson6-211001025531lesson plan ppt.pptx
Presentation1 [Autosaved].pdf diagnosiss
Hydrogel Based delivery Cancer Treatment
ENGLISH PROJECT FOR BINOD BIHARI MAHTO KOYLANCHAL UNIVERSITY
Effective_Handling_Information_Presentation.pptx
Presentation for DGJV QMS (PQP)_12.03.2025.pptx
The Effect of Human Resource Management Practice on Organizational Performanc...
Impressionism_PostImpressionism_Presentation.pptx
The spiral of silence is a theory in communication and political science that...
Relationship Management Presentation In Banking.pptx
Intro to ISO 9001 2015.pptx wareness raising
INTERNATIONAL LABOUR ORAGNISATION PPT ON SOCIAL SCIENCE
chapter8-180915055454bycuufucdghrwtrt.pptx
Human Mind & its character Characteristics
Self management and self evaluation presentation
BIOLOGY TISSUE PPT CLASS 9 PROJECT PUBLIC
学位双硕士UTAS毕业证,墨尔本理工学院毕业证留学硕士毕业证
Tour Presentation Educational Activity.pptx
Parts of Speech Prepositions Presentation in Colorful Cute Style_20250724_230...

Error analysis of Word Sense Disambiguation

  • 1. Error analysis ofWord Sense Disambiguation Ruben Izquierdo Marten Postma PiekVossen Izquierdo,PostmaandVossen VUAmsterdam
  • 2. Motivation  Word Sense Disambiguation is still an unsolved problem 2 Izquierdo, Postma and Vossen VU Amsterdam
  • 3. Error Analysis  Perform error analysis on previousWSD evaluations to prove our hypothesis  Senseval-2: all-words task  Senseval-3: all-words task  Semeval2007: all-words task (#17)  Semeval2010: all-words on specific domain (#17)  Semeval2013: multilingual all-wordsWSD and entity linking (#12) 3 Izquierdo, Postma and Vossen VU Amsterdam
  • 4. Motivation  Some “propagated” errors  Errors on monosemous  Errors because pos-tags  Multiwords and phrasal verbs  Little attention has been paid to the real problem  WSD is not 1 problem but N problems  Our hypothesis  Context is not modeled properly in general  System rely too much on the most frequent sense 4 Izquierdo, Postma and Vossen VU Amsterdam
  • 5. Monosemous errors 5 Izquierdo, Postma and Vossen VU Amsterdam
  • 6. Monosemous errors 6 Izquierdo, Postma and Vossen VU Amsterdam Competition Monosemous Wrong Examples Senseval2 499 (20.9%) 37.5% gene.n (suppressor_gene.n), chance.a (chance.n) next.r (next.a) Senseval3 334 (16.6%) 44.1% Datum.n (data.n) making.n (make.v) out_of_sight (sight) Semeval2007 25 (5.5%) 11.1% get_stuck.v, lack.v, write_about.v Semeval2010 31 (2.2%) 97.9% Tidal_zone.n pine_marten.n roe_deer.n cordgrass.n Semeval2013 (lemmas) 348 (21.1%) 1.9% Private_enterprise, developing_country, narrow_margin
  • 7. Most Frequent Sense 7 Izquierdo, Postma and Vossen VU Amsterdam
  • 8. Most Frequent Sense  When the correct sense is NOT the most frequent sense  Systems still assign mostly the MFS  Senseval2  799 tokens are not MFS  84% systems still assign the MFS  Most “failed” words due to MFS bias  Senseval2, senseval3  Say.v find.v take.v have.v cell.n church.n  Semeval2010  Area.n nature.n connection.n water.n population.n 8 Izquierdo, Postma and Vossen VU Amsterdam
  • 9. Analysis per PoS-tag 9 Izquierdo, Postma and Vossen VU Amsterdam
  • 10. Analysis per polysemy class 10 Izquierdo, Postma and Vossen VU Amsterdam 2Senses Poly. C. 6 15 Low Medium High
  • 11. Analysis per frequency class 11 Izquierdo, Postma and Vossen VU Amsterdam
  • 12. Most difficult words 12 Izquierdo, Postma and Vossen VU Amsterdam
  • 13. Expected vs. Observed difficulties  Calculate per sentence  The “expected” difficulty  Average polysemy, sentence length, average word length 13 Izquierdo, Postma and Vossen VU Amsterdam
  • 14.  Calculate per sentence  The “expected” difficulty  Average polysemy, sentence length, average word length 14 Izquierdo, Postma and Vossen VU Amsterdam Expected vs. Observed difficulties
  • 15.  Calculate per sentence  The “expected” difficulty  Average polysemy, sentence length, average wor length  The “observed” difficulty  From the real participant outputs, average error rate  We should expect: harder sentences  higher error rate easier sentences   lower error rate 15 Izquierdo, Postma and Vossen VU Amsterdam Expected vs. Observed difficulties
  • 16. 16 Izquierdo, Postma and Vossen VU Amsterdam Expected vs. Observed difficulties
  • 17. 17 Izquierdo, Postma and Vossen VU Amsterdam Expected vs. Observed difficulties
  • 18. • The context is not (probably) exploited properly • Expected “easy” sentences SHOULD show low error rates • Occurrences of the same word in different contexts have similar error rate • The difficulty of a word depends more on its polysemy than on the context where it appears 18 Izquierdo, Postma and Vossen VU Amsterdam Expected vs. Observed difficulties
  • 20. WSD Corpora 20 Izquierdo, Postma and Vossen VU Amsterdam
  • 22. System Outputs 22 Izquierdo, Postma and Vossen VU Amsterdam
  • 23. Error analysis of Word Sense Disambiguation Ruben Izquierdo Marten Postma PiekVossen ruben.izquierdobevia@vu.nl http://github.com/rubenIzquierdo/wsd_corpora http://github.com/rubenIzquierdo/sval_systems 23
  • 24. Analysis per PoS-tag 24 Izquierdo, Postma and Vossen VU Amsterdam

Editor's Notes

  • #12: Relative freq (norvig method) <0.01  low 0.01 -= 0.05  medium > 0.05 high