SlideShare a Scribd company logo
Cultural Linked Open Data
2014-02-06
Lars Marius Garshol, larsga@bouvet.no, http://twitter.com/larsga
1
The importance of data
• Most web sites are data-driven
– if you have the data, you can add functionality
– if you don’t have the data, you’re stuck

• Example: Google Maps
– imagine you have the application, the server
farm, the scaling and monitoring, etc
– but you don’t have the actual map data
– not only are you stuck, but creating the data is
much harder than making the service

2
3
4

Research project by SINTEF and Computas
Data sources

Research project by
SINTEF and Computas
5
Must be at meeting at 1345. Three transport alternatives.

6

Research project by SINTEF and Computas
Data is raw material
for building services!

7
Possible users of cultural data
• Any kind of web store
– publishers
– streaming services
– ...

• Travel businesses

– public sector, hotels, tour organizers, event
organizers, ...

• Media

– newspapers, broadcasting, ...

• Lots of public sector uses
– education, ...

• Many things none of us can’t imagine now
8
9
Only linked data is usable
NRK/Skole

Cappelen Damm

10
Linked Open Data
• Movement to publish open data online
– in machine-readable form
– linked to other data sets

• Based on some key technologies
– URLs for identifiers
– RDF for data

• Gaining a lot of traction in the cultural sector
–
–
–
–

11

BBC
Europeana
Smithsonian Institution
...
The technology
• Provides simple data representation
–
–
–
–

graph model (RDF)
has ready-made formats (XML, text, JSON, ...)
standard query language (SPARQL)
lots of RDF databases available

• Allows anyone to refer to anything

– a museum can say explicitly that one object in
their collection has a specific relation to an object
in another collection
– liberation from the ID scheme confusion

• Can reuse terminology from other
authorities
– can also easily extend that terminology

12
13

http://lod-cloud.net/
14
http://dbpedia.org/resource/Knut_Faldbakken
• Globally unique
– across all systems and organizations

• Distributed
– if you have a domain, you can make URIs

• Self-documenting
– just follow the link to find documentation

• Can be used anywhere
– anyone can point at anything
Today
•
•
•
•

Flat, unlinked data
No navigation
No connections
Poor characterization
– doesn’t say what it is

16
Europeana Data Model

As linked data

edm:ProvidedCHO

nv:Photograph

rdf:type
dc:title
dc:date

“Bergliot Ibsen”
1903

dc:subject

foaf:Person
rdfs:label

“Bergliot Ibsen”
dbp:died

1953-02-02
dbp:born

1869-06-10
nv:provider

http://dbpedia.org/resource/Bergliot_Ibsen

rdfs:label
grs:point
17

“Aulestad”
61.2173 10.265952
http://dbpedia.org/resource/Aulestad
Choice of tools
Modelling

pellet

Reasoners

Redland RDF Libraries
APIs

Triple stores
Great, but how can we actually
link the data?

19
20
“Do they have Knut Faldbakken in here?”

21

http://data.deichman.no/sparql
Yes, but not connected to anything ...

...can we do anything about that?

22
Record linkage to the rescue
• Active research field

– dating back to the 1940s

• Can connect data
without common IDs

– measure similarity instead

• Tools exist, with
–
–
–
–

value cleaning
statistical analysis
sophisticated comparators
fast search backends

• One example is Duke

– http://code.google.com/p/duke/
– Java and open source

23
Connect to DBpedia
http://data.deichman.no/...dbakken_Knut_1941-

http://dbpedia.org/resource/Knut_Faldbakken

NAME:
LIFESPAN:
NATIONALITY: n

NAME:
BIRTHDATE:

Faldbakken, Knut
1941-

Knut Faldbakken
1941-08-31

Complete recipe here

24

http://code.google.com/p/duke/wiki/DeichmanLink
Training with genetic algorithm

25

http://www.garshol.priv.no/blog/262.html
Conclusion
• Linked Open Data has tremendous
potential
– vastly easier reuse of data
– hugely empowering for consumers
– also opens new possibilities for data owners

• Growing use in cultural sector
– both internationally and in Norway

• To learn more
– http://www.slideshare.net/larsga/linked-opendata-14964163
– http://data.norge.no/veiledning
– http://linkeddatabook.com/editions/1.0/
26
Hafslund SESAM

27

More Related Content

PPTX
Estermann Panel on Authority Files, 3 June 2020
PPT
VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium -...
PPTX
Intro to IIIF and IIIF @NLW
PPT
VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium -...
PDF
Butigan vucaj dh_ilde
PPTX
Austrian Experience in Building Data Value Chain
PPT
British Library Labs Presentation at the Accelerating Human Imagination Workshop
PPT
Keynote: Unexpected repurposing
Estermann Panel on Authority Files, 3 June 2020
VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium -...
Intro to IIIF and IIIF @NLW
VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium -...
Butigan vucaj dh_ilde
Austrian Experience in Building Data Value Chain
British Library Labs Presentation at the Accelerating Human Imagination Workshop
Keynote: Unexpected repurposing

What's hot (20)

PPTX
Widening the limits of cognitive reception with online digital library graph ...
PPT
The Great Twentieth-Century Hole Or, what the Digital Humanities Miss
PPTX
data - driven journalism 1
PDF
UI design for open data V02 nov 2014
PDF
Reusing historical newspapers of KB in e-humanities - Case studies and exampl...
PPT
Wikidata Introductory Workshop
PPTX
Online Marketing with Schema.org and Multi-channel Communication
PPT
Wikidata Introduction, Linked Digital Future Initiative, August 2019
PPT
British Library Labs Presentation at the Accelerating Human Imagination Workshop
PDF
creating a trading zone around twitter srchives. case study: paris attacks
PPTX
Cross-sector collaboration for digital museum and library projects
PDF
Europeana Research Panel DH Benelux 2017
PPTX
Zeng marcia ifla-subjectaccesssmartdatadh
PDF
HRI presentation for Umeå delegation 1.12.2015
PDF
GLAMorous LOD
PDF
Advanced web searching
PDF
UI design for open data
PPTX
eluxemburgensia: the portal for Luxembourg's historic newspapers
PDF
SSHA 2019: Reconstructring a country
PDF
How do you know what you are looking for?
Widening the limits of cognitive reception with online digital library graph ...
The Great Twentieth-Century Hole Or, what the Digital Humanities Miss
data - driven journalism 1
UI design for open data V02 nov 2014
Reusing historical newspapers of KB in e-humanities - Case studies and exampl...
Wikidata Introductory Workshop
Online Marketing with Schema.org and Multi-channel Communication
Wikidata Introduction, Linked Digital Future Initiative, August 2019
British Library Labs Presentation at the Accelerating Human Imagination Workshop
creating a trading zone around twitter srchives. case study: paris attacks
Cross-sector collaboration for digital museum and library projects
Europeana Research Panel DH Benelux 2017
Zeng marcia ifla-subjectaccesssmartdatadh
HRI presentation for Umeå delegation 1.12.2015
GLAMorous LOD
Advanced web searching
UI design for open data
eluxemburgensia: the portal for Luxembourg's historic newspapers
SSHA 2019: Reconstructring a country
How do you know what you are looking for?
Ad

Similar to Linked Open Data for the Cultural Sector (20)

PPTX
Linked Open Data Utrecht University Library
PDF
Linked Open Data for Digital Humanities
PPTX
Omitola birmingham cityuniv
PDF
Linked Data Management
PDF
Linking knowledge spaces
PDF
Open data and linked data
PPTX
The Semantic Web Exists. What Next?
PPT
Open Data Masterclass - Europeana and LOD
PPTX
Linked Open Data
PPTX
It19 20140721 linked data personal perspective
PDF
Implementing Linked Data in Low-Resource Conditions
PDF
Maintaining scholarly standards in the digital age: Publishing historical gaz...
PDF
Linked Data on a Budget
PPTX
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
PPTX
Connecting Heterogeneous Collections using Linked Data
PDF
Jabes 2011 - Conférence inaugurale "Linked Open Data : opportunités et défis"
PPTX
Linked open data project
PDF
Europeana and linked cultural heritage data
PDF
Methodological Guidelines for Publishing Linked Data
Linked Open Data Utrecht University Library
Linked Open Data for Digital Humanities
Omitola birmingham cityuniv
Linked Data Management
Linking knowledge spaces
Open data and linked data
The Semantic Web Exists. What Next?
Open Data Masterclass - Europeana and LOD
Linked Open Data
It19 20140721 linked data personal perspective
Implementing Linked Data in Low-Resource Conditions
Maintaining scholarly standards in the digital age: Publishing historical gaz...
Linked Data on a Budget
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
Connecting Heterogeneous Collections using Linked Data
Jabes 2011 - Conférence inaugurale "Linked Open Data : opportunités et défis"
Linked open data project
Europeana and linked cultural heritage data
Methodological Guidelines for Publishing Linked Data
Ad

More from Lars Marius Garshol (20)

PDF
JSLT: JSON querying and transformation
PDF
Data collection in AWS at Schibsted
PPTX
Kveik - what is it?
PDF
Nature-inspired algorithms
PDF
Collecting 600M events/day
PDF
History of writing
PDF
NoSQL and Einstein's theory of relativity
PPTX
Norwegian farmhouse ale
PPTX
Archive integration with RDF
PPTX
The Euro crisis in 10 minutes
PPTX
Using the search engine as recommendation engine
PPTX
NoSQL databases, the CAP theorem, and the theory of relativity
PPTX
Bitcoin - digital gold
PPTX
Introduction to Big Data/Machine Learning
PPTX
Hops - the green gold
PPTX
Big data 101
PPTX
Hafslund SESAM - Semantic integration in practice
PPTX
Approximate string comparators
PPTX
Experiments in genetic programming
PPTX
Semantisk integrasjon
JSLT: JSON querying and transformation
Data collection in AWS at Schibsted
Kveik - what is it?
Nature-inspired algorithms
Collecting 600M events/day
History of writing
NoSQL and Einstein's theory of relativity
Norwegian farmhouse ale
Archive integration with RDF
The Euro crisis in 10 minutes
Using the search engine as recommendation engine
NoSQL databases, the CAP theorem, and the theory of relativity
Bitcoin - digital gold
Introduction to Big Data/Machine Learning
Hops - the green gold
Big data 101
Hafslund SESAM - Semantic integration in practice
Approximate string comparators
Experiments in genetic programming
Semantisk integrasjon

Recently uploaded (20)

PPTX
Big Data Technologies - Introduction.pptx
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPT
Teaching material agriculture food technology
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Approach and Philosophy of On baking technology
PDF
Encapsulation theory and applications.pdf
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
MIND Revenue Release Quarter 2 2025 Press Release
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Chapter 3 Spatial Domain Image Processing.pdf
Big Data Technologies - Introduction.pptx
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Review of recent advances in non-invasive hemoglobin estimation
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Building Integrated photovoltaic BIPV_UPV.pdf
Teaching material agriculture food technology
Diabetes mellitus diagnosis method based random forest with bat algorithm
Approach and Philosophy of On baking technology
Encapsulation theory and applications.pdf
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
MIND Revenue Release Quarter 2 2025 Press Release
The AUB Centre for AI in Media Proposal.docx
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Understanding_Digital_Forensics_Presentation.pptx
sap open course for s4hana steps from ECC to s4
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
NewMind AI Weekly Chronicles - August'25 Week I
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
20250228 LYD VKU AI Blended-Learning.pptx
Chapter 3 Spatial Domain Image Processing.pdf

Linked Open Data for the Cultural Sector

  • 1. Cultural Linked Open Data 2014-02-06 Lars Marius Garshol, larsga@bouvet.no, http://twitter.com/larsga 1
  • 2. The importance of data • Most web sites are data-driven – if you have the data, you can add functionality – if you don’t have the data, you’re stuck • Example: Google Maps – imagine you have the application, the server farm, the scaling and monitoring, etc – but you don’t have the actual map data – not only are you stuck, but creating the data is much harder than making the service 2
  • 3. 3
  • 4. 4 Research project by SINTEF and Computas
  • 5. Data sources Research project by SINTEF and Computas 5
  • 6. Must be at meeting at 1345. Three transport alternatives. 6 Research project by SINTEF and Computas
  • 7. Data is raw material for building services! 7
  • 8. Possible users of cultural data • Any kind of web store – publishers – streaming services – ... • Travel businesses – public sector, hotels, tour organizers, event organizers, ... • Media – newspapers, broadcasting, ... • Lots of public sector uses – education, ... • Many things none of us can’t imagine now 8
  • 9. 9
  • 10. Only linked data is usable NRK/Skole Cappelen Damm 10
  • 11. Linked Open Data • Movement to publish open data online – in machine-readable form – linked to other data sets • Based on some key technologies – URLs for identifiers – RDF for data • Gaining a lot of traction in the cultural sector – – – – 11 BBC Europeana Smithsonian Institution ...
  • 12. The technology • Provides simple data representation – – – – graph model (RDF) has ready-made formats (XML, text, JSON, ...) standard query language (SPARQL) lots of RDF databases available • Allows anyone to refer to anything – a museum can say explicitly that one object in their collection has a specific relation to an object in another collection – liberation from the ID scheme confusion • Can reuse terminology from other authorities – can also easily extend that terminology 12
  • 14. 14
  • 15. http://dbpedia.org/resource/Knut_Faldbakken • Globally unique – across all systems and organizations • Distributed – if you have a domain, you can make URIs • Self-documenting – just follow the link to find documentation • Can be used anywhere – anyone can point at anything
  • 16. Today • • • • Flat, unlinked data No navigation No connections Poor characterization – doesn’t say what it is 16
  • 17. Europeana Data Model As linked data edm:ProvidedCHO nv:Photograph rdf:type dc:title dc:date “Bergliot Ibsen” 1903 dc:subject foaf:Person rdfs:label “Bergliot Ibsen” dbp:died 1953-02-02 dbp:born 1869-06-10 nv:provider http://dbpedia.org/resource/Bergliot_Ibsen rdfs:label grs:point 17 “Aulestad” 61.2173 10.265952 http://dbpedia.org/resource/Aulestad
  • 18. Choice of tools Modelling pellet Reasoners Redland RDF Libraries APIs Triple stores
  • 19. Great, but how can we actually link the data? 19
  • 20. 20
  • 21. “Do they have Knut Faldbakken in here?” 21 http://data.deichman.no/sparql
  • 22. Yes, but not connected to anything ... ...can we do anything about that? 22
  • 23. Record linkage to the rescue • Active research field – dating back to the 1940s • Can connect data without common IDs – measure similarity instead • Tools exist, with – – – – value cleaning statistical analysis sophisticated comparators fast search backends • One example is Duke – http://code.google.com/p/duke/ – Java and open source 23
  • 24. Connect to DBpedia http://data.deichman.no/...dbakken_Knut_1941- http://dbpedia.org/resource/Knut_Faldbakken NAME: LIFESPAN: NATIONALITY: n NAME: BIRTHDATE: Faldbakken, Knut 1941- Knut Faldbakken 1941-08-31 Complete recipe here 24 http://code.google.com/p/duke/wiki/DeichmanLink
  • 25. Training with genetic algorithm 25 http://www.garshol.priv.no/blog/262.html
  • 26. Conclusion • Linked Open Data has tremendous potential – vastly easier reuse of data – hugely empowering for consumers – also opens new possibilities for data owners • Growing use in cultural sector – both internationally and in Norway • To learn more – http://www.slideshare.net/larsga/linked-opendata-14964163 – http://data.norge.no/veiledning – http://linkeddatabook.com/editions/1.0/ 26