SlideShare a Scribd company logo
Computationally Tracing Concepts
Through Time and Space
Marieke van Erp

merpeltje
D I G I TA L H U M A N I T I E S L A B
D I G I TA L H U M A N I T I E S L A B
Overview of this talk
• Big (text) Data & Humanities
• Tracing concepts
• Entity spaces
• New horizons
• Wrapping up
D I G I TA L H U M A N I T I E S L A B
Big Data & Humanities
• Digitised archives are enabling new types
of research
• Dutch National Library: 100+ million
newspaper, book & magazine pages
• Chronicling America: 100,000
newspaper pages
• Amsterdam City Archives: 160,000
notary deeds
• Bibliothèque Nationale de Luxembourg:
800,000 pages
• & many more sources
D I G I TA L H U M A N I T I E S L A B
Zooming in & Zooming out
• Qualitative methods often filter down
to individual records or pages
• Quantitative methods started
scratching the surface
• KNAW HuC focuses on bridging the
gap between quantitative &
qualitative analyses through
advancing natural language
processing and semantic web
methods
Image source: https://upload.wikimedia.org/wikipedia/commons/b/b5/MediaWiki_flame_graph_screenshot_2014-12-15_22.png
D I G I TA L H U M A N I T I E S L A B
Digital Humanities
• Involves the understanding of
these cultural heritage data.
• Methods involving Natural
Language Processing supported
by Knowledge Graphs have
entered the humanities research
community (Meroño-Peñuela et
al.)
Tabea Tietz et al. Challenges of Knowledge Graph Evolution from an NLP Perspective. WHiSe Workshop @ ESWC 2020
D I G I TA L H U M A N I T I E S L A B
Who has the biggest sweet tooth?
• Sugar consumption patterns
are difficult to trace
• Historical apple pie recipes can
serve as a proxy
• Apple pastries are common in
many cultures
Marieke van Erp & Ulbe Bosma: Divergent patterns of sugar consumption in the wake of the Industrial Revolution: an analysis on the basis of
apple pie recipes. Forthcoming
D I G I TA L H U M A N I T I E S L A B
Analysing historical recipes
• Differences in availability of
digitised sources
• Digitisation artefacts hamper
automatic analysis
• Normalisation of quantities is
needed
• Combine quantitative &
qualitative methods
Marieke van Erp & Ulbe Bosma: Divergent patterns of sugar consumption in the wake of the Industrial Revolution: an analysis on the basis of
apple pie recipes. (Forthcoming)
Image source: https://en.wikipedia.org/wiki/Apple_pie#/media/File:For_to_Make_Tartys_in_Applis_(1381).gif
D I G I TA L H U M A N I T I E S L A B
Comparing Ingredients in Dutch and American Apple Pie Recipes
Tabea Tietz et al. Challenges of Knowledge Graph Evolution from an NLP Perspective. WHiSe Workshop @ ESWC 2020
Computationally Tracing Concepts Through Time and Space
D I G I TA L H U M A N I T I E S L A B
Comparing sugar quantities in Dutch, American, French and German apple pie recipes
Marieke van Erp & Ulbe Bosma: Divergent patterns of sugar consumption in the wake of the Industrial Revolution: an analysis on the basis of apple pie recipes. (Forthcoming)
D I G I TA L H U M A N I T I E S L A B
What is an apple pie?
• The real world is constantly
changing
• Knowledge that was considered
true at one point in time in a
specific cultural and spa7al
setting may not be true in
another context
• Concepts evolve
Tabea Tietz et al. Challenges of Knowledge Graph Evolution from an NLP Perspective. WHiSe Workshop @ ESWC 2020
D I G I TA L H U M A N I T I E S L A B
Cultural Context
● What is considered as true in
one cultural setting may not be
in another.
● Apfelstrudel == apple pie?
Tabea Tietz et al. Challenges of Knowledge Graph Evolution from an NLP Perspective. WHiSe Workshop @ ESWC 2020
How can we store this type of information at scale?
D I G I TA L H U M A N I T I E S L A B
Concept modelling
• Computer Science: Knowledge
Representation/Semantic Web
• Long history: at least since
Aristotle
• Machine readable knowledge was
Sir Tim Berners-Lee’s intent when
he developed the World Wide Web
• To date, we have several large
scale knowledge graphs such as
DBpedia and Wikidata
Image source: https://upload.wikimedia.org/wikipedia/commons/c/c6/Complexity_vs._orderliness.png
D I G I TA L H U M A N I T I E S L A B
Knowledge Graphs
• Represent what we consider
true about parts of the world
• Are created and maintained to
continuously compose
knowledge (Bonatti et al.).
Tabea Tietz et al. Challenges of Knowledge Graph Evolution from an NLP Perspective. WHiSe Workshop @ ESWC 2020
D I G I TA L H U M A N I T I E S L A B
But:
• Knowledge Graphs are often
static and only reflect one
snippet of reality
• This static representation of the
real world is a problem when
attempting to understand
historical descriptions of
concepts (Bonatti et al., Tasnim
et al.)
Tabea Tietz et al. Challenges of Knowledge Graph Evolution from an NLP Perspective. WHiSe Workshop @ ESWC 2020
D I G I TA L H U M A N I T I E S L A B
Concepts
• Are manifested in our cultures’
norms and values
• Are documented through
photographs, newspapers,
books, music, film,
advertisements.
Tabea Tietz et al. Challenges of Knowledge Graph Evolution from an NLP Perspective. WHiSe Workshop @ ESWC 2020
D I G I TA L H U M A N I T I E S L A B
Spatio-temporal context
● Distinguish the spatio-
temporal metadata of the
concept itself and the
metadata of its source
● Trace the evolution of the
concept over time and
geographic regions
Tabea Tietz et al. Challenges of Knowledge Graph Evolution from an NLP Perspective. WHiSe Workshop @ ESWC 2020
D I G I TA L H U M A N I T I E S L A B
Units
● Modern units
○ imperial vs. metric system (lbs,
kg)
● Historical units
○ ell, zentner
● Natural language description of
measurements
○ “a load of butter”, “a plate of
apples”
Tabea Tietz et al. Challenges of Knowledge Graph Evolution from an NLP Perspective. WHiSe Workshop @ ESWC 2020
D I G I TA L H U M A N I T I E S L A B
Concept modelling
● How broad or narrow should
the ontology be modeled to fit
the concept but also capture
its changes over time?
● What are the properties that
define a concept across the
spatio-temporal and cultural
context?
Tabea Tietz et al. Challenges of Knowledge Graph Evolution from an NLP Perspective. WHiSe Workshop @ ESWC 2020
Entity spaces
D I G I TA L H U M A N I T I E S L A B
Language & Meaning
• Human language is incredibly flexible and
efficient
• We can use the term ‘sugar’ to refer to
• the sugar industry (a sour day for sugar)
• to particular instances of sugar (shall I
put some sugar in?)
• nutritional information (sugar and fiber
intake)
• commodities (grain and sugar are
produced)
• How can computers make sense of this?
Marieke van Erp & Paul Groth (2020) Towards Entity Spaces. In: Proceedings of The 12th Language Resources and Evaluation
Conference (LREC’2020)
D I G I TA L H U M A N I T I E S L A B
Proxy for Entity Spaces
Marieke van Erp & Paul Groth (2020) Towards Entity Spaces. In: Proceedings of The 12th Language Resources and Evaluation Conference (LREC’2020)
D I G I TA L H U M A N I T I E S L A B
Tolerant Entity Linking
• Not every meaning of an entity
or concept is represented in a
knowledge base
• We argue that a link to an entity
space is better than no link
• ‘good enough
interpretation’ (Poesio et al.)
• Proof of concept shows increase
in recall for 8 out of 13 datasets
Marieke van Erp & Paul Groth (2020) Towards Entity Spaces. In: Proceedings of The 12th Language Resources and Evaluation
Conference (LREC’2020)
D I G I TA L H U M A N I T I E S L A B
Next steps
• Extending entity spaces beyond
Wikipedia
• Structuring concepts within
entity spaces
• Add temporal dimension
• Intangible concepts
• Scale up
Marieke van Erp & Paul Groth (2020) Towards Entity Spaces. In: Proceedings of The 12th Language Resources and Evaluation
Conference (LREC’2020)
D I G I TA L H U M A N I T I E S L A B
New Horizons
• Complex concepts have
multiple dimensions
• Dimensions may go beyond a
single discipline
• Recognising, modelling & using
concepts and knowledge
graphs require team work
D I G I TA L H U M A N I T I E S L A B
Unexpected Crews
• Within the KNAW Humanities
Cluster, we harbour
(computational) linguists,
historians, literature scientists,
ethnologists, developers, network
specialists, digital humanists…
• Different disciplines find each
other on intersection of topics/
data/methods
• Use your network!
D I G I TA L H U M A N I T I E S L A B
Wrapping Up
• Text analysis and knowledge
representation are becoming
more important to humanities
research
• Big challenges for complex
information extraction and
modelling
• Interdisciplinary collaboration is
needed
http://dhlab.nl
Acknowledgments:
Adina Nerghes, Eleonora Marzi,
Fabio Mariani, Harald Sack,
ISWS Summer School, Lientje
Maas, Mehwish Alam, Melvin
Wevers, Mortaza Alinam, Paul
Groth, Tabea Tietz, Ulbe Bosma
& Wouter van den Berg
References
• Tabea Tietz, Mehwish Alam, Harald Sack and Marieke van Erp (2020) Challenges of Knowledge
Graph Evolution from an NLP Perspective. WHiSe Workshop @ ESWC 2020
• Marieke van Erp & Paul Groth (2020) Towards Entity Spaces. In: Proceedings of The 12th Language
Resources and Evaluation Conference (LREC’2020)
• Marieke van Erp & Ulbe Bosma: Divergent patterns of sugar consumption in the wake of the
Industrial Revolution: an analysis on the basis of apple pie recipes. (Forthcoming)
• Piero Andrea Bonatti, Stefan Decker, Axel Polleres and Valentina Presutti (2019) Knowledge Graphs:
New Directions for Knowledge Representation on the Semantic Web. Dagstuhl Seminar 18371).
Dagstuhl Reports 8(9), 29–111 (2019). https://doi.org/10.4230/DagRep.8.9.29
• Albert Meroño-Peñuela, Ashkan Ashkpour, Marieke van Erp, Kees Mandemakers, Leen Breure,
Andrea Scharnhorst, Stefan Schlobach, Frank van Harmelen (2015) Semantic technologies for
historical research: A survey. In: Semantic Web Journal
• Mayesha Tasnim, Diego Collarana, Damien Graux, Fabrizio Orlandi and Maria-Esther Vidal (2019)
Summarizing Entity Temporal Evolution in Knowledge Graphs. In: Companion Proceedings of The
2019 World Wide Web Conference
•

More Related Content

PDF
Towards Culturally Aware AI Systems - TSDH Symposium
PDF
The Hitchhiker's Guide to the Future of Digital Humanities
PDF
A Polyvocal and Contextualised Semantic Web
PPTX
Visualizing the Past for the Present: A Summation of Interdisciplinary Digita...
PPT
From Catalogue 2.0 to the digital humanities: exploring the future of librari...
PPTX
PDF
Introduction to Digital humanities
ODP
The World of Digital Humanities : Digital Humanities in the World
Towards Culturally Aware AI Systems - TSDH Symposium
The Hitchhiker's Guide to the Future of Digital Humanities
A Polyvocal and Contextualised Semantic Web
Visualizing the Past for the Present: A Summation of Interdisciplinary Digita...
From Catalogue 2.0 to the digital humanities: exploring the future of librari...
Introduction to Digital humanities
The World of Digital Humanities : Digital Humanities in the World

What's hot (20)

PPTX
Digital Humanities, Big Data, and New Research Methods
PPT
Project "The Digital City Revives, A Case Study of Web Archaeology"
PDF
Digital Cultural Heritage and the new EU Framework Programme
PDF
Digicraft and 'Systemic' Thinking in Digital Humanities Reasoning on the Per...
PPTX
2015 MCN The Constant Transformation and Evolution of Information Management ...
PDF
Europeana 2019 - Connect Communities
PDF
What is Digital Public History? Teaching and Practice

PPTX
The Continuted Evolution of DAMs in the Nonprofit Sector
PDF
Digicraft and 'Systemic' Thinking in Digital Humanities
PDF
Digital Humanities, Digital Libraries and Information Science: what relation?
PDF
A1 hazan winer_conferenceoverview_2014
PPTX
Croatian Archival Project (OVIH)C
PDF
Forever In Between : similarities and differences, opportunities and responsa...
PDF
DIVE+ @PATCH2015 Workshop @IUI2015
PDF
Digital Tools in Media History
PPT
1.10 tjarda de haan
PDF
Agora User Committee Meeting 2013
PPT
The Social Digitization Workshop of the Silesian Digital Library at the Siles...
PPTX
Long-Term Engagement.
PDF
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...
Digital Humanities, Big Data, and New Research Methods
Project "The Digital City Revives, A Case Study of Web Archaeology"
Digital Cultural Heritage and the new EU Framework Programme
Digicraft and 'Systemic' Thinking in Digital Humanities Reasoning on the Per...
2015 MCN The Constant Transformation and Evolution of Information Management ...
Europeana 2019 - Connect Communities
What is Digital Public History? Teaching and Practice

The Continuted Evolution of DAMs in the Nonprofit Sector
Digicraft and 'Systemic' Thinking in Digital Humanities
Digital Humanities, Digital Libraries and Information Science: what relation?
A1 hazan winer_conferenceoverview_2014
Croatian Archival Project (OVIH)C
Forever In Between : similarities and differences, opportunities and responsa...
DIVE+ @PATCH2015 Workshop @IUI2015
Digital Tools in Media History
1.10 tjarda de haan
Agora User Committee Meeting 2013
The Social Digitization Workshop of the Silesian Digital Library at the Siles...
Long-Term Engagement.
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...
Ad

Similar to Computationally Tracing Concepts Through Time and Space (20)

PPTX
Open education projects - through the lens of innovation
PPTX
V Rolfe - open education and innovation
PDF
Semantic Archive Integration for Holocaust Research: the EHRI Research Infras...
PPTX
Adaptation or Shaping the Field: the Next Phase of Digital History
PPT
Wikidata Introductory Workshop
PDF
NORFest 2023 Lightning Talks Session One
PPTX
INFORMATION, KNOWLEDGE AND ATTENTION.pptx
PDF
Bridging Digital Humanities Research and Big Data Repositories of Digital Text
PDF
APLIC 2012: Discovering & Dealing with Data
PPTX
Fairfield pl oct2014
PDF
Words and More Words: Challenges of Big Data by Prof. Edie Rasmussen
PPT
AHRC Digital Transformations theme: the Story So Far
PPTX
Open educators as social entrepreneurs
PPT
The Evolving Library
PDF
Cvan bochove basic res and growth, rome
PDF
Plain2013 peter levesque ikmb
PDF
Plain2013 PL and Knowledge Mobilization P Levesque
PPTX
Erau webinar 3 9-17 project mangagment slides
PPT
Powerpoint 1
Open education projects - through the lens of innovation
V Rolfe - open education and innovation
Semantic Archive Integration for Holocaust Research: the EHRI Research Infras...
Adaptation or Shaping the Field: the Next Phase of Digital History
Wikidata Introductory Workshop
NORFest 2023 Lightning Talks Session One
INFORMATION, KNOWLEDGE AND ATTENTION.pptx
Bridging Digital Humanities Research and Big Data Repositories of Digital Text
APLIC 2012: Discovering & Dealing with Data
Fairfield pl oct2014
Words and More Words: Challenges of Big Data by Prof. Edie Rasmussen
AHRC Digital Transformations theme: the Story So Far
Open educators as social entrepreneurs
The Evolving Library
Cvan bochove basic res and growth, rome
Plain2013 peter levesque ikmb
Plain2013 PL and Knowledge Mobilization P Levesque
Erau webinar 3 9-17 project mangagment slides
Powerpoint 1
Ad

More from Marieke van Erp (20)

PDF
AI x Digital Humanities = > Inclusiviteit
PDF
Why language technology can’t handle Game of Thrones (yet)
PDF
(Beyond) Combining Text and Tables for qualitative and quantitative research
PDF
Finding common ground between text, maps, and tables for quantitative and qua...
PDF
Slicing and Dicing a Newspaper Corpus for Historical Ecology Research
PDF
Lessons Learnt from the Named Entity rEcognition and Linking (NEEL) Challenge...
PDF
Good Lynx, bad Lynx: Document enrichment for historical ecologists
PDF
Towards Semantic Enrichment of Newspapers: a historical ecology use case
PDF
Natural Language Processing en Named Entity Recognition
PDF
HuC lecture - Digital and Humanities: Continuing the Conversation
PDF
Multilingual Fine-grained Entity Typing
PDF
Entity Typing Using Distributional Semantics and DBpedia
PDF
Entity Typing and Event Extraction
PDF
The domain as unifier, how focusing on social history can bring technical fie...
PDF
Evaluating entity linking an analysis of current benchmark datasets and a ro...
PDF
Finding Stories in 1,784,532 Events: Scaling up computational models of narr...
PDF
Evaluating Named Entity Recognition and Disambiguation in News and Tweets
PDF
Orientation EBC 2013: Digitising Natural History
PDF
Offspring from Reproduction Problems: what replication failure teaches us
PDF
From Events to Stories: Different ways of structuring the same bag of events ...
AI x Digital Humanities = > Inclusiviteit
Why language technology can’t handle Game of Thrones (yet)
(Beyond) Combining Text and Tables for qualitative and quantitative research
Finding common ground between text, maps, and tables for quantitative and qua...
Slicing and Dicing a Newspaper Corpus for Historical Ecology Research
Lessons Learnt from the Named Entity rEcognition and Linking (NEEL) Challenge...
Good Lynx, bad Lynx: Document enrichment for historical ecologists
Towards Semantic Enrichment of Newspapers: a historical ecology use case
Natural Language Processing en Named Entity Recognition
HuC lecture - Digital and Humanities: Continuing the Conversation
Multilingual Fine-grained Entity Typing
Entity Typing Using Distributional Semantics and DBpedia
Entity Typing and Event Extraction
The domain as unifier, how focusing on social history can bring technical fie...
Evaluating entity linking an analysis of current benchmark datasets and a ro...
Finding Stories in 1,784,532 Events: Scaling up computational models of narr...
Evaluating Named Entity Recognition and Disambiguation in News and Tweets
Orientation EBC 2013: Digitising Natural History
Offspring from Reproduction Problems: what replication failure teaches us
From Events to Stories: Different ways of structuring the same bag of events ...

Recently uploaded (20)

PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Approach and Philosophy of On baking technology
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
cuic standard and advanced reporting.pdf
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
KodekX | Application Modernization Development
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Encapsulation theory and applications.pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Unlocking AI with Model Context Protocol (MCP)
Approach and Philosophy of On baking technology
Review of recent advances in non-invasive hemoglobin estimation
cuic standard and advanced reporting.pdf
sap open course for s4hana steps from ECC to s4
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
20250228 LYD VKU AI Blended-Learning.pptx
Spectral efficient network and resource selection model in 5G networks
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Building Integrated photovoltaic BIPV_UPV.pdf
Encapsulation_ Review paper, used for researhc scholars
Reach Out and Touch Someone: Haptics and Empathic Computing
KodekX | Application Modernization Development
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Chapter 3 Spatial Domain Image Processing.pdf
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
Encapsulation theory and applications.pdf
Advanced methodologies resolving dimensionality complications for autism neur...

Computationally Tracing Concepts Through Time and Space

  • 1. Computationally Tracing Concepts Through Time and Space Marieke van Erp merpeltje D I G I TA L H U M A N I T I E S L A B
  • 2. D I G I TA L H U M A N I T I E S L A B Overview of this talk • Big (text) Data & Humanities • Tracing concepts • Entity spaces • New horizons • Wrapping up
  • 3. D I G I TA L H U M A N I T I E S L A B Big Data & Humanities • Digitised archives are enabling new types of research • Dutch National Library: 100+ million newspaper, book & magazine pages • Chronicling America: 100,000 newspaper pages • Amsterdam City Archives: 160,000 notary deeds • Bibliothèque Nationale de Luxembourg: 800,000 pages • & many more sources
  • 4. D I G I TA L H U M A N I T I E S L A B Zooming in & Zooming out • Qualitative methods often filter down to individual records or pages • Quantitative methods started scratching the surface • KNAW HuC focuses on bridging the gap between quantitative & qualitative analyses through advancing natural language processing and semantic web methods Image source: https://upload.wikimedia.org/wikipedia/commons/b/b5/MediaWiki_flame_graph_screenshot_2014-12-15_22.png
  • 5. D I G I TA L H U M A N I T I E S L A B Digital Humanities • Involves the understanding of these cultural heritage data. • Methods involving Natural Language Processing supported by Knowledge Graphs have entered the humanities research community (Meroño-Peñuela et al.) Tabea Tietz et al. Challenges of Knowledge Graph Evolution from an NLP Perspective. WHiSe Workshop @ ESWC 2020
  • 6. D I G I TA L H U M A N I T I E S L A B Who has the biggest sweet tooth? • Sugar consumption patterns are difficult to trace • Historical apple pie recipes can serve as a proxy • Apple pastries are common in many cultures Marieke van Erp & Ulbe Bosma: Divergent patterns of sugar consumption in the wake of the Industrial Revolution: an analysis on the basis of apple pie recipes. Forthcoming
  • 7. D I G I TA L H U M A N I T I E S L A B Analysing historical recipes • Differences in availability of digitised sources • Digitisation artefacts hamper automatic analysis • Normalisation of quantities is needed • Combine quantitative & qualitative methods Marieke van Erp & Ulbe Bosma: Divergent patterns of sugar consumption in the wake of the Industrial Revolution: an analysis on the basis of apple pie recipes. (Forthcoming) Image source: https://en.wikipedia.org/wiki/Apple_pie#/media/File:For_to_Make_Tartys_in_Applis_(1381).gif
  • 8. D I G I TA L H U M A N I T I E S L A B Comparing Ingredients in Dutch and American Apple Pie Recipes Tabea Tietz et al. Challenges of Knowledge Graph Evolution from an NLP Perspective. WHiSe Workshop @ ESWC 2020
  • 10. D I G I TA L H U M A N I T I E S L A B Comparing sugar quantities in Dutch, American, French and German apple pie recipes Marieke van Erp & Ulbe Bosma: Divergent patterns of sugar consumption in the wake of the Industrial Revolution: an analysis on the basis of apple pie recipes. (Forthcoming)
  • 11. D I G I TA L H U M A N I T I E S L A B What is an apple pie? • The real world is constantly changing • Knowledge that was considered true at one point in time in a specific cultural and spa7al setting may not be true in another context • Concepts evolve Tabea Tietz et al. Challenges of Knowledge Graph Evolution from an NLP Perspective. WHiSe Workshop @ ESWC 2020
  • 12. D I G I TA L H U M A N I T I E S L A B Cultural Context ● What is considered as true in one cultural setting may not be in another. ● Apfelstrudel == apple pie? Tabea Tietz et al. Challenges of Knowledge Graph Evolution from an NLP Perspective. WHiSe Workshop @ ESWC 2020
  • 13. How can we store this type of information at scale?
  • 14. D I G I TA L H U M A N I T I E S L A B Concept modelling • Computer Science: Knowledge Representation/Semantic Web • Long history: at least since Aristotle • Machine readable knowledge was Sir Tim Berners-Lee’s intent when he developed the World Wide Web • To date, we have several large scale knowledge graphs such as DBpedia and Wikidata Image source: https://upload.wikimedia.org/wikipedia/commons/c/c6/Complexity_vs._orderliness.png
  • 15. D I G I TA L H U M A N I T I E S L A B Knowledge Graphs • Represent what we consider true about parts of the world • Are created and maintained to continuously compose knowledge (Bonatti et al.). Tabea Tietz et al. Challenges of Knowledge Graph Evolution from an NLP Perspective. WHiSe Workshop @ ESWC 2020
  • 16. D I G I TA L H U M A N I T I E S L A B But: • Knowledge Graphs are often static and only reflect one snippet of reality • This static representation of the real world is a problem when attempting to understand historical descriptions of concepts (Bonatti et al., Tasnim et al.) Tabea Tietz et al. Challenges of Knowledge Graph Evolution from an NLP Perspective. WHiSe Workshop @ ESWC 2020
  • 17. D I G I TA L H U M A N I T I E S L A B Concepts • Are manifested in our cultures’ norms and values • Are documented through photographs, newspapers, books, music, film, advertisements. Tabea Tietz et al. Challenges of Knowledge Graph Evolution from an NLP Perspective. WHiSe Workshop @ ESWC 2020
  • 18. D I G I TA L H U M A N I T I E S L A B Spatio-temporal context ● Distinguish the spatio- temporal metadata of the concept itself and the metadata of its source ● Trace the evolution of the concept over time and geographic regions Tabea Tietz et al. Challenges of Knowledge Graph Evolution from an NLP Perspective. WHiSe Workshop @ ESWC 2020
  • 19. D I G I TA L H U M A N I T I E S L A B Units ● Modern units ○ imperial vs. metric system (lbs, kg) ● Historical units ○ ell, zentner ● Natural language description of measurements ○ “a load of butter”, “a plate of apples” Tabea Tietz et al. Challenges of Knowledge Graph Evolution from an NLP Perspective. WHiSe Workshop @ ESWC 2020
  • 20. D I G I TA L H U M A N I T I E S L A B Concept modelling ● How broad or narrow should the ontology be modeled to fit the concept but also capture its changes over time? ● What are the properties that define a concept across the spatio-temporal and cultural context? Tabea Tietz et al. Challenges of Knowledge Graph Evolution from an NLP Perspective. WHiSe Workshop @ ESWC 2020
  • 22. D I G I TA L H U M A N I T I E S L A B Language & Meaning • Human language is incredibly flexible and efficient • We can use the term ‘sugar’ to refer to • the sugar industry (a sour day for sugar) • to particular instances of sugar (shall I put some sugar in?) • nutritional information (sugar and fiber intake) • commodities (grain and sugar are produced) • How can computers make sense of this? Marieke van Erp & Paul Groth (2020) Towards Entity Spaces. In: Proceedings of The 12th Language Resources and Evaluation Conference (LREC’2020)
  • 23. D I G I TA L H U M A N I T I E S L A B Proxy for Entity Spaces Marieke van Erp & Paul Groth (2020) Towards Entity Spaces. In: Proceedings of The 12th Language Resources and Evaluation Conference (LREC’2020)
  • 24. D I G I TA L H U M A N I T I E S L A B Tolerant Entity Linking • Not every meaning of an entity or concept is represented in a knowledge base • We argue that a link to an entity space is better than no link • ‘good enough interpretation’ (Poesio et al.) • Proof of concept shows increase in recall for 8 out of 13 datasets Marieke van Erp & Paul Groth (2020) Towards Entity Spaces. In: Proceedings of The 12th Language Resources and Evaluation Conference (LREC’2020)
  • 25. D I G I TA L H U M A N I T I E S L A B Next steps • Extending entity spaces beyond Wikipedia • Structuring concepts within entity spaces • Add temporal dimension • Intangible concepts • Scale up Marieke van Erp & Paul Groth (2020) Towards Entity Spaces. In: Proceedings of The 12th Language Resources and Evaluation Conference (LREC’2020)
  • 26. D I G I TA L H U M A N I T I E S L A B New Horizons • Complex concepts have multiple dimensions • Dimensions may go beyond a single discipline • Recognising, modelling & using concepts and knowledge graphs require team work
  • 27. D I G I TA L H U M A N I T I E S L A B Unexpected Crews • Within the KNAW Humanities Cluster, we harbour (computational) linguists, historians, literature scientists, ethnologists, developers, network specialists, digital humanists… • Different disciplines find each other on intersection of topics/ data/methods • Use your network!
  • 28. D I G I TA L H U M A N I T I E S L A B Wrapping Up • Text analysis and knowledge representation are becoming more important to humanities research • Big challenges for complex information extraction and modelling • Interdisciplinary collaboration is needed
  • 29. http://dhlab.nl Acknowledgments: Adina Nerghes, Eleonora Marzi, Fabio Mariani, Harald Sack, ISWS Summer School, Lientje Maas, Mehwish Alam, Melvin Wevers, Mortaza Alinam, Paul Groth, Tabea Tietz, Ulbe Bosma & Wouter van den Berg
  • 30. References • Tabea Tietz, Mehwish Alam, Harald Sack and Marieke van Erp (2020) Challenges of Knowledge Graph Evolution from an NLP Perspective. WHiSe Workshop @ ESWC 2020 • Marieke van Erp & Paul Groth (2020) Towards Entity Spaces. In: Proceedings of The 12th Language Resources and Evaluation Conference (LREC’2020) • Marieke van Erp & Ulbe Bosma: Divergent patterns of sugar consumption in the wake of the Industrial Revolution: an analysis on the basis of apple pie recipes. (Forthcoming) • Piero Andrea Bonatti, Stefan Decker, Axel Polleres and Valentina Presutti (2019) Knowledge Graphs: New Directions for Knowledge Representation on the Semantic Web. Dagstuhl Seminar 18371). Dagstuhl Reports 8(9), 29–111 (2019). https://doi.org/10.4230/DagRep.8.9.29 • Albert Meroño-Peñuela, Ashkan Ashkpour, Marieke van Erp, Kees Mandemakers, Leen Breure, Andrea Scharnhorst, Stefan Schlobach, Frank van Harmelen (2015) Semantic technologies for historical research: A survey. In: Semantic Web Journal • Mayesha Tasnim, Diego Collarana, Damien Graux, Fabrizio Orlandi and Maria-Esther Vidal (2019) Summarizing Entity Temporal Evolution in Knowledge Graphs. In: Companion Proceedings of The 2019 World Wide Web Conference •