SlideShare a Scribd company logo
Context-Aware Derivation
Prediction
Ekaterina Vylomova, Ryan Cotterell, Tim Baldwin, Trevor Cohn
1
SIGMORPHON SHARED TASK ON INFLECTION
run + PRESENT PARTICIPLE running
2
Ekaterina Vylomova, evylomova@gmail.com
SIGMORPHON SHARED TASK ON INFLECTION
run + PRESENT PARTICIPLE running
ran + PAST + PRESENT PARTICIPLE running
3
Ekaterina Vylomova, evylomova@gmail.com
SIGMORPHON SHARED TASK ON INFLECTION
run + PRESENT PARTICIPLE running
ran + PAST + PRESENT PARTICIPLE running
4
Ekaterina Vylomova, evylomova@gmail.com
GRAMMAR TAGS
SIGMORPHON SHARED TASK ON INFLECTION
run + PRESENT PARTICIPLE running
ran + PAST + PRESENT PARTICIPLE running
5
Ekaterina Vylomova, evylomova@gmail.com
GRAMMAR TAGS
❏ Well-studied
❏ Regular
❏ High Accuracy
WHAT ABOUT DERIVATION?
6
Ekaterina Vylomova, evylomova@gmail.com
STEM AGENT PATIENT RESULT ABILITY
bring bringer
stand standee
simulate simulator simulation simulatable
employ employer employee employment employable
WHAT ABOUT DERIVATION?
7
Ekaterina Vylomova, evylomova@gmail.com
STEM AGENT PATIENT RESULT ABILITY
bring bringer
stand standee
simulate simulator simulation simulatable
employ employer employee employment employable
Derivational slots?
Empty cells
DISTRIBUTIONAL SEMANTICS
8
Ekaterina Vylomova, evylomova@gmail.com
J.R.FIRTH
Z. HARRISL. WITTGENSTEIN
MOTIVATION
How well we can predict derivations directly from the
context?
9
Ekaterina Vylomova, evylomova@gmail.com
. . the ergometer ’s inability to properly SIMULATE
the larger rowers drag on a boat . . .
. . . this SIMULATE package is based on Simula 's object
oriented features and its coroutine concept . . .
. . . Bay pilots trained for the visit on a SIMULATE
at the California Maritime Academy . . .
MOTIVATION
How well we can predict derivations directly from the
context?
10
Ekaterina Vylomova, evylomova@gmail.com
. . the ergometer ’s inability to properly SIMULATE
the larger rowers drag on a boat . . .
. . . this SIMULATE package is based on Simula 's object
oriented features and its coroutine concept . . .
. . . Bay pilots trained for the visit on a SIMULATE
at the California Maritime Academy . . .
MOTIVATION
How well we can predict derivations directly from the
context?
11
Ekaterina Vylomova, evylomova@gmail.com
. . the ergometer ’s inability to properly SIMULATE
the larger rowers drag on a boat . . .
. . . this SIMULATION package is based on Simula 's object
oriented features and its coroutine concept . . .
. . . Bay pilots trained for the visit on a SIMULATE
at the California Maritime Academy . . .
MOTIVATION
How well we can predict derivations directly from the
context?
12
Ekaterina Vylomova, evylomova@gmail.com
. . the ergometer ’s inability to properly SIMULATE
the larger rowers drag on a boat . . .
. . . this SIMULATION package is based on Simula 's object
oriented features and its coroutine concept . . .
. . . Bay pilots trained for the visit on a SIMULATOR
at the California Maritime Academy . . .
BASELINE : 3-gram Modified KN smoothing
13
Ekaterina Vylomova, evylomova@gmail.com
This SIMULATE package is based on Simula 's object oriented features ... -47.9
This SIMULATES package is based on Simula 's object oriented features ... -50.0
This SIMULATED package is based on Simula 's object oriented features ... -49.0
This SIMULATING package is based on Simula 's object oriented features ... -49.5
This SIMULATION package is based on Simula 's object oriented features ... -46.1
This SIMULATOR package is based on Simula 's object oriented features ... -48.9
This SIMULATORS package is based on Simula 's object oriented features ... -50.7
log p
BASELINE : 3-gram Modified KN smoothing
14
Ekaterina Vylomova, evylomova@gmail.com
This SIMULATE package is based on Simula 's object oriented features ... -47.9
This SIMULATES package is based on Simula 's object oriented features ... -50.0
This SIMULATED package is based on Simula 's object oriented features ... -49.0
This SIMULATING package is based on Simula 's object oriented features ... -49.5
This SIMULATION package is based on Simula 's object oriented features ... -46.1
This SIMULATOR package is based on Simula 's object oriented features ... -48.9
This SIMULATORS package is based on Simula 's object oriented features ... -50.7
log p
DATASET
➔ English Verb Nominalizations only
➔ CELEX: <accusation, accuse+ation>
24 suffix classes / 1,456 base lemmas / 3,079 unique lemma
pairs
➔ Contexts: 107,041 contextual instances from English Wikipedia
➔ Pre-trained word embeddings: word2vec trained on Google News
15
Ekaterina Vylomova, evylomova@gmail.com
ARCHITECTURE OF THE MODEL
16
Ekaterina Vylomova, evylomova@gmail.com
ARCHITECTURE OF THE MODEL
17
Ekaterina Vylomova, evylomova@gmail.com
ARCHITECTURE OF THE MODEL
18
Ekaterina Vylomova, evylomova@gmail.com
ARCHITECTURE OF THE MODEL
19
Ekaterina Vylomova, evylomova@gmail.com
ARCHITECTURE OF THE MODEL
20
Ekaterina Vylomova, evylomova@gmail.com
ARCHITECTURE OF THE MODEL
21
Ekaterina Vylomova, evylomova@gmail.com
ARCHITECTURE OF THE MODEL
22
Ekaterina Vylomova, evylomova@gmail.com
ARCHITECTURE OF THE MODEL
23
Ekaterina Vylomova, evylomova@gmail.com
Accuracy For Predicted Lemmas (inc. bases)
24
Ekaterina Vylomova, evylomova@gmail.com
BASELINE
Base Only
CTX Only
Base+CTX
Base+CTX
+POS
Base+CTX
+POS+SD
ERROR ANALYSIS: ambiguity and vagueness
25
Ekaterina Vylomova, evylomova@gmail.com
ERROR ANALYSIS
26
Ekaterina Vylomova, evylomova@gmail.com
Correct (Context of ...) Predicted
Extra Variety of Forms
student studint, studion, studyant,
student
Especially in Split Lexicon Setting
trailer trailer, trailation, trailment
ERROR ANALYSIS
27
Ekaterina Vylomova, evylomova@gmail.com
Correct (Context of ...) Predicted
Lack of Forms
government and governance government
Bias Towards Productive Suffixes
stoppage stoption
NONCE WORDS
28
Ekaterina Vylomova, evylomova@gmail.com
transcribe laptify fape crimmle beteive
transcription laptification fape crimmle beterve
transcription laptification fapery crimmler betention
transcription laptification fapication crimmler beteption
transcription laptification fapionment crimmler betention
transcription laptification fapist crimmler betention
transcription laptification fapist crimmler beteption
transcript laptification fapery crimmler betention
transcript laptification fapist crimmler beteption
CONCLUSION
29
Ekaterina Vylomova, evylomova@gmail.com
There is regularity in derivational processes!
CONCLUSION
30
Ekaterina Vylomova, evylomova@gmail.com
There is regularity in derivational processes!
Code and data available at https://github.com/ivri/dmorph
Thank you for your { A T T E N D } !

More Related Content

PDF
Documenting and modeling inflectional paradigms in under-resourced languages
PDF
The UniMorph Project and Morphological Reinflection Task: Past, Present, and ...
PDF
Sigmorphon 2021. Keynote. UniMorph, Morphological inflection
PDF
The Secret Life of Words: Exploring Regularity and Systematicity (joint talk ...
PDF
SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection
PDF
Ekaterina vylomova-what-do-neural models-know-about-language-p2
PDF
Ekaterina vylomova-what-do-neural models-know-about-language-p1
PDF
Evaluation of Semantic Change of Harm-Related Concepts in Psychology
Documenting and modeling inflectional paradigms in under-resourced languages
The UniMorph Project and Morphological Reinflection Task: Past, Present, and ...
Sigmorphon 2021. Keynote. UniMorph, Morphological inflection
The Secret Life of Words: Exploring Regularity and Systematicity (joint talk ...
SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection
Ekaterina vylomova-what-do-neural models-know-about-language-p2
Ekaterina vylomova-what-do-neural models-know-about-language-p1
Evaluation of Semantic Change of Harm-Related Concepts in Psychology

More from Katerina Vylomova (9)

PDF
Contextualization of Morphological Inflection
PDF
Paradigm Completion for Derivational Morphology
PDF
Contemporary Models of Natural Language Processing
PDF
Men Are from Mars, Women Are from Venus: Evaluation and Modelling of Verbal A...
PDF
Take and Took, Gaggle and Goose, Book and Read: Evaluating the Utility of Vec...
PDF
Working with text data
PDF
Neural models for recognition of basic units of semiographic chants
PDF
Russia, Russians and Russian language
PDF
Ekaterina Vylomova/Brown Bag seminar presentation
Contextualization of Morphological Inflection
Paradigm Completion for Derivational Morphology
Contemporary Models of Natural Language Processing
Men Are from Mars, Women Are from Venus: Evaluation and Modelling of Verbal A...
Take and Took, Gaggle and Goose, Book and Read: Evaluating the Utility of Vec...
Working with text data
Neural models for recognition of basic units of semiographic chants
Russia, Russians and Russian language
Ekaterina Vylomova/Brown Bag seminar presentation
Ad

Recently uploaded (20)

PDF
2021 HotChips TSMC Packaging Technologies for Chiplets and 3D_0819 publish_pu...
PDF
Hybrid model detection and classification of lung cancer
PDF
1 - Historical Antecedents, Social Consideration.pdf
PPTX
The various Industrial Revolutions .pptx
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PPTX
Final SEM Unit 1 for mit wpu at pune .pptx
PDF
Web App vs Mobile App What Should You Build First.pdf
PDF
STKI Israel Market Study 2025 version august
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PPTX
Tartificialntelligence_presentation.pptx
PPTX
TLE Review Electricity (Electricity).pptx
PDF
Developing a website for English-speaking practice to English as a foreign la...
PDF
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
PDF
August Patch Tuesday
PDF
WOOl fibre morphology and structure.pdf for textiles
PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PPT
Module 1.ppt Iot fundamentals and Architecture
PDF
Enhancing emotion recognition model for a student engagement use case through...
PDF
A comparative study of natural language inference in Swahili using monolingua...
PDF
Getting started with AI Agents and Multi-Agent Systems
2021 HotChips TSMC Packaging Technologies for Chiplets and 3D_0819 publish_pu...
Hybrid model detection and classification of lung cancer
1 - Historical Antecedents, Social Consideration.pdf
The various Industrial Revolutions .pptx
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
Final SEM Unit 1 for mit wpu at pune .pptx
Web App vs Mobile App What Should You Build First.pdf
STKI Israel Market Study 2025 version august
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
Tartificialntelligence_presentation.pptx
TLE Review Electricity (Electricity).pptx
Developing a website for English-speaking practice to English as a foreign la...
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
August Patch Tuesday
WOOl fibre morphology and structure.pdf for textiles
Univ-Connecticut-ChatGPT-Presentaion.pdf
Module 1.ppt Iot fundamentals and Architecture
Enhancing emotion recognition model for a student engagement use case through...
A comparative study of natural language inference in Swahili using monolingua...
Getting started with AI Agents and Multi-Agent Systems
Ad

Context-Aware Derivation Prediction // EACL 2017

  • 1. Context-Aware Derivation Prediction Ekaterina Vylomova, Ryan Cotterell, Tim Baldwin, Trevor Cohn 1
  • 2. SIGMORPHON SHARED TASK ON INFLECTION run + PRESENT PARTICIPLE running 2 Ekaterina Vylomova, evylomova@gmail.com
  • 3. SIGMORPHON SHARED TASK ON INFLECTION run + PRESENT PARTICIPLE running ran + PAST + PRESENT PARTICIPLE running 3 Ekaterina Vylomova, evylomova@gmail.com
  • 4. SIGMORPHON SHARED TASK ON INFLECTION run + PRESENT PARTICIPLE running ran + PAST + PRESENT PARTICIPLE running 4 Ekaterina Vylomova, evylomova@gmail.com GRAMMAR TAGS
  • 5. SIGMORPHON SHARED TASK ON INFLECTION run + PRESENT PARTICIPLE running ran + PAST + PRESENT PARTICIPLE running 5 Ekaterina Vylomova, evylomova@gmail.com GRAMMAR TAGS ❏ Well-studied ❏ Regular ❏ High Accuracy
  • 6. WHAT ABOUT DERIVATION? 6 Ekaterina Vylomova, evylomova@gmail.com STEM AGENT PATIENT RESULT ABILITY bring bringer stand standee simulate simulator simulation simulatable employ employer employee employment employable
  • 7. WHAT ABOUT DERIVATION? 7 Ekaterina Vylomova, evylomova@gmail.com STEM AGENT PATIENT RESULT ABILITY bring bringer stand standee simulate simulator simulation simulatable employ employer employee employment employable Derivational slots? Empty cells
  • 8. DISTRIBUTIONAL SEMANTICS 8 Ekaterina Vylomova, evylomova@gmail.com J.R.FIRTH Z. HARRISL. WITTGENSTEIN
  • 9. MOTIVATION How well we can predict derivations directly from the context? 9 Ekaterina Vylomova, evylomova@gmail.com . . the ergometer ’s inability to properly SIMULATE the larger rowers drag on a boat . . . . . . this SIMULATE package is based on Simula 's object oriented features and its coroutine concept . . . . . . Bay pilots trained for the visit on a SIMULATE at the California Maritime Academy . . .
  • 10. MOTIVATION How well we can predict derivations directly from the context? 10 Ekaterina Vylomova, evylomova@gmail.com . . the ergometer ’s inability to properly SIMULATE the larger rowers drag on a boat . . . . . . this SIMULATE package is based on Simula 's object oriented features and its coroutine concept . . . . . . Bay pilots trained for the visit on a SIMULATE at the California Maritime Academy . . .
  • 11. MOTIVATION How well we can predict derivations directly from the context? 11 Ekaterina Vylomova, evylomova@gmail.com . . the ergometer ’s inability to properly SIMULATE the larger rowers drag on a boat . . . . . . this SIMULATION package is based on Simula 's object oriented features and its coroutine concept . . . . . . Bay pilots trained for the visit on a SIMULATE at the California Maritime Academy . . .
  • 12. MOTIVATION How well we can predict derivations directly from the context? 12 Ekaterina Vylomova, evylomova@gmail.com . . the ergometer ’s inability to properly SIMULATE the larger rowers drag on a boat . . . . . . this SIMULATION package is based on Simula 's object oriented features and its coroutine concept . . . . . . Bay pilots trained for the visit on a SIMULATOR at the California Maritime Academy . . .
  • 13. BASELINE : 3-gram Modified KN smoothing 13 Ekaterina Vylomova, evylomova@gmail.com This SIMULATE package is based on Simula 's object oriented features ... -47.9 This SIMULATES package is based on Simula 's object oriented features ... -50.0 This SIMULATED package is based on Simula 's object oriented features ... -49.0 This SIMULATING package is based on Simula 's object oriented features ... -49.5 This SIMULATION package is based on Simula 's object oriented features ... -46.1 This SIMULATOR package is based on Simula 's object oriented features ... -48.9 This SIMULATORS package is based on Simula 's object oriented features ... -50.7 log p
  • 14. BASELINE : 3-gram Modified KN smoothing 14 Ekaterina Vylomova, evylomova@gmail.com This SIMULATE package is based on Simula 's object oriented features ... -47.9 This SIMULATES package is based on Simula 's object oriented features ... -50.0 This SIMULATED package is based on Simula 's object oriented features ... -49.0 This SIMULATING package is based on Simula 's object oriented features ... -49.5 This SIMULATION package is based on Simula 's object oriented features ... -46.1 This SIMULATOR package is based on Simula 's object oriented features ... -48.9 This SIMULATORS package is based on Simula 's object oriented features ... -50.7 log p
  • 15. DATASET ➔ English Verb Nominalizations only ➔ CELEX: <accusation, accuse+ation> 24 suffix classes / 1,456 base lemmas / 3,079 unique lemma pairs ➔ Contexts: 107,041 contextual instances from English Wikipedia ➔ Pre-trained word embeddings: word2vec trained on Google News 15 Ekaterina Vylomova, evylomova@gmail.com
  • 16. ARCHITECTURE OF THE MODEL 16 Ekaterina Vylomova, evylomova@gmail.com
  • 17. ARCHITECTURE OF THE MODEL 17 Ekaterina Vylomova, evylomova@gmail.com
  • 18. ARCHITECTURE OF THE MODEL 18 Ekaterina Vylomova, evylomova@gmail.com
  • 19. ARCHITECTURE OF THE MODEL 19 Ekaterina Vylomova, evylomova@gmail.com
  • 20. ARCHITECTURE OF THE MODEL 20 Ekaterina Vylomova, evylomova@gmail.com
  • 21. ARCHITECTURE OF THE MODEL 21 Ekaterina Vylomova, evylomova@gmail.com
  • 22. ARCHITECTURE OF THE MODEL 22 Ekaterina Vylomova, evylomova@gmail.com
  • 23. ARCHITECTURE OF THE MODEL 23 Ekaterina Vylomova, evylomova@gmail.com
  • 24. Accuracy For Predicted Lemmas (inc. bases) 24 Ekaterina Vylomova, evylomova@gmail.com BASELINE Base Only CTX Only Base+CTX Base+CTX +POS Base+CTX +POS+SD
  • 25. ERROR ANALYSIS: ambiguity and vagueness 25 Ekaterina Vylomova, evylomova@gmail.com
  • 26. ERROR ANALYSIS 26 Ekaterina Vylomova, evylomova@gmail.com Correct (Context of ...) Predicted Extra Variety of Forms student studint, studion, studyant, student Especially in Split Lexicon Setting trailer trailer, trailation, trailment
  • 27. ERROR ANALYSIS 27 Ekaterina Vylomova, evylomova@gmail.com Correct (Context of ...) Predicted Lack of Forms government and governance government Bias Towards Productive Suffixes stoppage stoption
  • 28. NONCE WORDS 28 Ekaterina Vylomova, evylomova@gmail.com transcribe laptify fape crimmle beteive transcription laptification fape crimmle beterve transcription laptification fapery crimmler betention transcription laptification fapication crimmler beteption transcription laptification fapionment crimmler betention transcription laptification fapist crimmler betention transcription laptification fapist crimmler beteption transcript laptification fapery crimmler betention transcript laptification fapist crimmler beteption
  • 29. CONCLUSION 29 Ekaterina Vylomova, evylomova@gmail.com There is regularity in derivational processes!
  • 30. CONCLUSION 30 Ekaterina Vylomova, evylomova@gmail.com There is regularity in derivational processes! Code and data available at https://github.com/ivri/dmorph Thank you for your { A T T E N D } !