SlideShare a Scribd company logo
IRL:	Irish	Record	Linkage,	1864	-	1913	
Crea;ng	and	Consuming	Metadata	from	Transcribed	
Historical	Vital	Records	for	Inges;on	in	a	Long-term	
Digital	Preserva;on	PlaIorm	
	
Dolores	Grant	(a)	Christophe	Debruyne	(b),		
Rebecca	Grant	(a),	and	Sandra	Collins	(a)	
	
(a)  Digital	Repository	of	Ireland,	Royal	Irish	Academy,	Dublin,	Ireland	
(b)  ADAPT	@	Trinity	College	Dublin,	Dublin,	Ireland	
	
October	27,	2015	@	META4eS
IRL:	Irish	Record	Linkage,	1864	-	1913	
Developing	a	plaLorm	applying	
semanMc	technologies	to	historical	
birth,	death	and	marriage	cerMficates.	
	
Answering	quesMons	such	as:	“How	
accurate	are	historic	maternal	
mortality	rates	(MMR)	and	infant	
mortality	rates	(IMR)	for	Dublin?”	
	
Team	consists	of	researchers	
(historians),	digital	archivists,	and	
knowledge	engineers.	
Knowledge and
Linked Data
Engineers
Historians
Digital
Archivists
IRL:	Irish	Record	Linkage,	1864	-	1913	
General Registers Office (GRO)
•  Vital registration data: birth-
certificates, death-certificates
and marriage records.
•  Digitised TIFF images of
hardcopy indexes and registers.
•  2 TB of data
•  Database describing the
digitised records allowing
searches on some fields.
©General Records Office of Ireland 2014
IRL:	Irish	Record	Linkage,	1864	-	1913	
In	prior	work	(see	[1]),	we	created	a	Linked	Data	plaLorm	that	allowed	Digital	
Archivists	to	transcribe	register	pages,	which	were	then	transformed	into	RDF.	That	
RDF	was	then	used	to	populate	other	triplestores	to	analyze	that	data.	
	
	
	
	
	
	
	
	
	
	
Part	of	the	project,	however,	was	also	to	inves;gate	the	digital	long-term	
preserva;on	of	the	digi;zed	register	pages,	and	the	corresponding	RDF.	
CreaMon	of	IRL	
Knowledge	Base	
RelaMonal	
Database	
GRO	
Triplestore	
TransformaMon	
Vital	Records	
Ontology	
SeparaMon	of	Concerns	
Historical	Events	
Ontology	
IRL	
Triplestore	
Data	AnalyMcs	
Digital	Archivist	 Historian	
LOD	
Cloud
IRL:	Irish	Record	Linkage,	1864	-	1913	
Related	work	
	
•  Related	work	on	the	preservaMon	of	harvested	metadata	exist,	
e.g.,	in	the	context	of	GLAMS.	
•  Liale	work	was	to	be	found	in	the	context	of	historical	(vital)	
records.	It	was	limited	to	integraMon	problems	and	addressing	
the	problem	record	linking	in	databases.	
•  We	also	wanted	to	focus	on	research	project	agnosMc	
transcripMon	of	historical	vital	records	(separaMon	of	concerns)
IRL:	Irish	Record	Linkage,	1864	-	1913	
Method:	Crea;ng	RDF	Documents	
	
•  Register	pages	are	idenMfied	by	a	stamp	number	(e.g.	
“4646439”).	We	collect	the	triples	around	a	page	and	related	
records	with	the	following	query	to	create	an	RDF	document.	
•  PREFIX	rec:	<hap://purl.org/net/irish-record-linkage/records#>	
DESCRIBE	*	{ 	?page		rec:stampNumber	"4646439";		
	 	 	 	 	 	 	rec:withRecord	?record.	 	 	}	
•  We	also	add	a	foaf:primaryTopic	statement	to	the	document.
IRL:	Irish	Record	Linkage,	1864	-	1913	
Method:	Crea;ng	Qualified	Dublin	Core	Metadata	
	
•  AdopMng	the	guidelines	formulated	in	[2],	we	adopted	XSPARQL	
[3]	to	transform	RDF	documents	in	Qualified	Dublin	Core	
Metadata	Documents.	We	thus	have	an	RDF	file	and	a	QDC	file	
for	each	register	page.
IRL:	Irish	Record	Linkage,	1864	-	1913	
Register	Page	
District/Union/County	[SPATIAL	COVERAGE]	
Superintendent	registrar's	district	
Date	cerMfied	as	true	copy	by	superintendent	registrar	[ISSUED]	
Date	cerMfied	by	registrar	[CREATED]	
Forename/surname	registrar	on	page	
Forename/surname	superintendent	registrar	[CREATOR]	
Page	number/Volume/Quarter	
Stamp	number	[IDENTIFIER	/	used	in	TITLE]	
Year	registered	[TEMPORAL	COVERAGE]	
Record	
Date	of	registraMon	
Title/forename/surname	
registrar	
Amendments	
Number	in	register	
CerMficate	
Forename/surname	(of	subject)	[PART	OF	
DESCRIPTION]	
Address	(of	subject)	
Sex	(of	subject)	[PART	OF	DESCRIPTION]	
Forename/surname	informant	
QualificaMon	of	informant	
RelaMonship	of	informant	
Residence	of	informant	
Death	Record	
Forename/surname	of	registrar	
Date	of	death	[PART	OF	DESCRIPTION]	
Cause	of	death	and	duraMon	of	illness	
CondiMon	
Age	last	birthday	
Place	of	residence	
Rank,	profession	or	occupaMon	
1	
0..10
IRL:	Irish	Record	Linkage,	1864	-	1913
IRL:	Irish	Record	Linkage,	1864	-	1913	
RelaMonal	
Database	
GRO	
Triplestore	
TransformaMon	
Vital	Records	
Ontology	
Digital	Archivist	
RDF	File	1	
RDF	File	2	
RDF	File	n	
Qualified	
Dublin	Core	
XML	1	
Qualified	
Dublin	Core	
XML	2	
Qualified	
Dublin	Core	
XML	n	
Regiser		
Page	1	
Regiser		
Page	2	
Regiser		
Page	n	
transform	
…	
…	
…	
Digital	long-term	preservaMon	plaLorm	
ingesMon	
Part	of	the	IRL	PlaLorm
IRL:	Irish	Record	Linkage,	1864	-	1913	
Method:	Bulk	Inges;on	into	a	Digital	Long	Term	Repository	
	
•  We	adopted	the	Digital	Repository	of	Ireland	
hap://repository.dri.ie/		
•  Provides	item	by	item	ingesMon,	or	bulk	inges;on	via	a	
command	line	tools.	
•  Files	(digiMzed	register	pages,	RDF	and	QDC)	are	named	in	a	
certain	way	to	related	QDC	with	the	digiMzed	asset	and	RDF	
transcripMon.
IRL:	Irish	Record	Linkage,	1864	-	1913
IRL:	Irish	Record	Linkage,	1864	-	1913	
Conclusions	and	Future	Work	
	
•  We	created	an	automated	process	for	creaMng	and	uploading	
assets,	RDF	transcripMons	and	associated	metadata	in	a	long	
term	preservaMon	plaLorm.	
•  EvaluaMon	is	limited	due	to	the	data	sharing	agreements;	in	
terms	of	discoverability	on	the	repository	via	faceted	search	and	
in	terms	of	suitability	of	the	metadata	via	expert	feedback.	
•  Comparison	of	Qualified	Dublin	Core	with	Encoded	Archival	
DescripMon	(EAD)	is	to	be	conducted	as	well.
IRL:	Irish	Record	Linkage,	1864	-	1913	
References	
1.  Christophe	Debruyne,	Oya	Deniz	Beyan,	Rebecca	Grant,	Sandra	Collins,	Stefan	Decker:	On	
a	Linked	Data	PlaLorm	for	Irish	Historical	Vital	Records.	TPDL	2015:	99-110	
2.  BusMllo,	M.,	Collins,	S.,	Gallagher,	D.,	Grant,	R.,	Harrower,	N.,	Kenny,	S.,	Ní	Cholla,	R.,	
O’Carroll,	A.,	Redmond,	S.,	Webb,	S.:	Qualified	Dublin	Core	and	the	Digital	Repository	of	
Ireland	(Grant,	R.	ed.).	Tech.	rep.,	Maynooth:	Maynooth	University;	Dublin:	Trinity	
College	Dublin;	Dublin:	Royal	Irish	Academy;	Galway:	NaMonal	University	of	Ireland,	
Galway	(2015)	
3.  Dell’Aglio,	D.,	Polleres,	A.,	Lopes,	N.,	Bischof,	S.:	Querying	the	Web	of	Data	with	XSPARQL	
1.1.	In:	Verborgh,	R.,	Mannens,	E.	(eds.)	Proceedings	of	the	ISWC	Developers	Workshop	
2014,	co-located	with	the	13th	InternaMonal	SemanMc	Web	Conference	(ISWC	2014),	
Riva	del	Garda,	Italy,	October	19,	2014.	CEUR	Work-	shop	Proceedings,	vol.	1268,	pp.	
113–118.	CEUR-WS.org	(2014)
IRL:	Irish	Record	Linkage,	1864	-	1913	
QuesMons?	
More	informaMon	
•  Twiaer:	@IRL_Project	
•  Project	website	hap://irishrecordlinkage.wordpress.com/

More Related Content

PPTX
Rebecca Grant & Dolores Grant - Data Archiving for the Irish Record Linkage P...
PDF
Towards Linked Vital Registration Data for Reconstituting Families and Creati...
PDF
Towards linked vital registration data for reconstituting families and creati...
PPTX
Reusing Legacy data: Irish Historic Vital Registration Data, 1864-1913
PPTX
Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.
PDF
Using Semantic Technologies to Create Virtual Families from Historical Vital ...
PDF
Using Semantic Technologies to Create Virtual Families from Historical Vital ...
PDF
Using Semantic Technologies to Create Virtual Families from Historical Vital ...
Rebecca Grant & Dolores Grant - Data Archiving for the Irish Record Linkage P...
Towards Linked Vital Registration Data for Reconstituting Families and Creati...
Towards linked vital registration data for reconstituting families and creati...
Reusing Legacy data: Irish Historic Vital Registration Data, 1864-1913
Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.
Using Semantic Technologies to Create Virtual Families from Historical Vital ...
Using Semantic Technologies to Create Virtual Families from Historical Vital ...
Using Semantic Technologies to Create Virtual Families from Historical Vital ...

More from Christophe Debruyne (20)

PPTX
BURPing Through RML Test Cases (presented at KGC Workshop @ ESWC 2024)KG
PPTX
One year of DALIDA Data Literacy Workshops for Adults: a Report
PDF
Projet TOXIN : Des graphes de connaissances pour la recherche en toxicologie
PDF
Knowledge Graphs: Concept, mogelijkheden en aandachtspunten
PDF
Reusable SHACL Constraint Components for Validating Geospatial Linked Data
PDF
Hidden Amongst the Data: the Beyond 2022 Knowledge Graph
PDF
Facilitating Data Curation: a Solution Developed in the Toxicology Domain
PDF
Using Maps for Interlinking Geospatial Linked Data
PDF
Linked Data Publication and Interlinking Research within the SFI funded ADAPT...
PDF
Towards Generating Policy-compliant Datasets (poster)
PDF
Towards Generating Policy-compliant Datasets
PDF
Generating Executable Mappings from RDF Data Cube Data Structure Definitions
PDF
Uplift – Generating RDF datasets from non-RDF data with R2RML
PDF
A Lightweight Approach to Explore, Enrich and Use Data with a Geospatial Dime...
PDF
Client-side Processing of GeoSPARQL Functions with Triple Pattern Fragments
PDF
Serving Ireland's Geospatial Information as Linked Data
PDF
Serving Ireland's Geospatial Information as Linked Data (ISWC 2016 Poster)
PDF
R2RML-F: Towards Sharing and Executing Domain Logic in R2RML Mappings
PDF
Towards a Project Centric Metadata Model and Lifecycle for Ontology Mapping G...
PDF
What is Linked Data?
BURPing Through RML Test Cases (presented at KGC Workshop @ ESWC 2024)KG
One year of DALIDA Data Literacy Workshops for Adults: a Report
Projet TOXIN : Des graphes de connaissances pour la recherche en toxicologie
Knowledge Graphs: Concept, mogelijkheden en aandachtspunten
Reusable SHACL Constraint Components for Validating Geospatial Linked Data
Hidden Amongst the Data: the Beyond 2022 Knowledge Graph
Facilitating Data Curation: a Solution Developed in the Toxicology Domain
Using Maps for Interlinking Geospatial Linked Data
Linked Data Publication and Interlinking Research within the SFI funded ADAPT...
Towards Generating Policy-compliant Datasets (poster)
Towards Generating Policy-compliant Datasets
Generating Executable Mappings from RDF Data Cube Data Structure Definitions
Uplift – Generating RDF datasets from non-RDF data with R2RML
A Lightweight Approach to Explore, Enrich and Use Data with a Geospatial Dime...
Client-side Processing of GeoSPARQL Functions with Triple Pattern Fragments
Serving Ireland's Geospatial Information as Linked Data
Serving Ireland's Geospatial Information as Linked Data (ISWC 2016 Poster)
R2RML-F: Towards Sharing and Executing Domain Logic in R2RML Mappings
Towards a Project Centric Metadata Model and Lifecycle for Ontology Mapping G...
What is Linked Data?
Ad

Recently uploaded (20)

PPTX
Pharmacology of Autonomic nervous system
PDF
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
PPTX
Microbiology with diagram medical studies .pptx
PPTX
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
PPTX
2Systematics of Living Organisms t-.pptx
PPTX
Introduction to Fisheries Biotechnology_Lesson 1.pptx
PPTX
TOTAL hIP ARTHROPLASTY Presentation.pptx
PDF
The scientific heritage No 166 (166) (2025)
PPTX
INTRODUCTION TO EVS | Concept of sustainability
DOCX
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
PPTX
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
PPTX
The KM-GBF monitoring framework – status & key messages.pptx
PDF
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
PDF
Lymphatic System MCQs & Practice Quiz – Functions, Organs, Nodes, Ducts
PDF
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
PPTX
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
PDF
Warm, water-depleted rocky exoplanets with surfaceionic liquids: A proposed c...
PPTX
ECG_Course_Presentation د.محمد صقران ppt
PPTX
Classification Systems_TAXONOMY_SCIENCE8.pptx
PDF
Sciences of Europe No 170 (2025)
Pharmacology of Autonomic nervous system
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
Microbiology with diagram medical studies .pptx
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
2Systematics of Living Organisms t-.pptx
Introduction to Fisheries Biotechnology_Lesson 1.pptx
TOTAL hIP ARTHROPLASTY Presentation.pptx
The scientific heritage No 166 (166) (2025)
INTRODUCTION TO EVS | Concept of sustainability
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
The KM-GBF monitoring framework – status & key messages.pptx
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
Lymphatic System MCQs & Practice Quiz – Functions, Organs, Nodes, Ducts
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
Warm, water-depleted rocky exoplanets with surfaceionic liquids: A proposed c...
ECG_Course_Presentation د.محمد صقران ppt
Classification Systems_TAXONOMY_SCIENCE8.pptx
Sciences of Europe No 170 (2025)
Ad

Creating and Consuming Metadata from Transcribed Historical Vital Records for Ingestion in a Long-Term Digital Preservation Platform