SlideShare a Scribd company logo
Employing Google Refine to Publish Linked DataFadi Maali and Richard Cyganiak
Road mapSelf-Service Linked Government DataGoogle RefineRDF Export ExtensionRDF Reconciliation Extension
	We need government data as Linked Data not just Raw Data	….aha, and of a good quality!
Raw data catalogue, data.gov
We want governments to provide Linked Data not just Raw Data… and of good qualityTIMEMONEYSKILLS
From raw data to Linked DataDIY
DIY RecipeTool support to select datasets of interest and put them into RDFPublishers provide RDF representation of their cataloguesUser shares the RDF data
DIY RecipePublishers provide RDF representation of their cataloguesTool support to select datasets of interest and put them into RDFUser shares the RDF datadcat
DIY RecipeTool support to select datasets of interest and put them into RDFPublishers provide RDF representation of their cataloguesUser shares the RDF datadcatGoogle Refine+ RDF export extension+ RDF reconciliation extension
DIY RecipeUser shares the RDF dataPublishers provide RDF representation of their cataloguesTool support to select datasets of interest and put them into RDFdcatGoogle RefineShare RDF data along with the conversion process description+ RDF export extension+ RDF reconciliation extensionProvenance & Reproducibility
Road mapSelf-Service Linked Government DataGoogle RefineRDF Export ExtensionRDF Reconciliation Extension
Google Refine	Google Refine is a power tool for working with messy data, cleaning it up, transforming it from one format into another, extending it with web services, and linking it to databases like Freebase*Desktop application that a user interacts with using a web browserOpen Source (New BSD License)Extensible*http://code.google.com/p/google-refine/
Demo
DemoTop 100 IT university in UK (Guardian data blog http://www.guardian.co.uk/news/datablog/2009/jun/02/universityguide-choosingadegree)
Demo
Demo
Demo
Demo
DemoTop 100 Electronic Engineering university in UK (Guardian data blog http://www.guardian.co.uk/news/datablog/2009/jun/02/universityguide-choosingadegree)
Demo
RDF Reconcile ExtensionSindice search APISilk LSLCrafted RDFSilk ServerRDF Reconcile ExtensionGoogle RefineSPARQLSPARQL endpointHybrid SPARQLSPARQL endpoint with fulltext extension
Benchmarking…Reconciling DailyMed against Dbpedia (SPARQL endpoint)
Benchmarking…Reconciling DailyMed against Sider RDF dump
ConclusionPublishers provide RDF representation of their cataloguesTool support to select datasets of interest and put them into RDFUser shares the RDF datadcatGoogle Refine??+ RDF export extension+ RDF reconciliation extension
LinksGoogle Refine http://code.google.com/p/google-refine/RDF Export Extension http://lab.linkeddata.deri.ie/2010/grefine-rdf-extension/RDF Reconciliation Extensionwill be released soon…

More Related Content

PDF
What Factors Influence the Design of a Linked Data Generation Algorithm?
PDF
High quality Linked Data generation for librarians
PDF
iLastic: Linked Data Generation Workflow and User Interface for iMinds Schola...
PDF
Property graph vs. RDF Triplestore comparison in 2020
PDF
The RDF Report Card: Beyond the Triple Count
PDF
Transforming Your Data with GraphDB: GraphDB Fundamentals, Jan 2018
PPT
The Power of Semantic Technologies to Explore Linked Open Data
PPT
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
What Factors Influence the Design of a Linked Data Generation Algorithm?
High quality Linked Data generation for librarians
iLastic: Linked Data Generation Workflow and User Interface for iMinds Schola...
Property graph vs. RDF Triplestore comparison in 2020
The RDF Report Card: Beyond the Triple Count
Transforming Your Data with GraphDB: GraphDB Fundamentals, Jan 2018
The Power of Semantic Technologies to Explore Linked Open Data
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data

What's hot (20)

PPT
From Web 2.0 to the Semantic Web: Bridging the Gap in the Newsmedia Industry
PDF
Hap clojure berlin 2015
PPTX
Neo4j GraphTour New York_Thomson Reuters SS
PDF
The Bounties of Semantic Data Integration for the Enterprise
PDF
[Webinar] FactForge Debuts: Trump World Data and Instant Ranking of Industry ...
PPT
Semantic web an overview and projects
PPTX
Scalable Web Data Management using RDF
PPTX
Self-Service Linked Government Data with dcat and Gridworks
PDF
Linked Data Experiences at Springer Nature
PDF
Scalable Data Science with SparkR on HDInsight
PDF
Do the right (to left) thing
PDF
Linked data experience at Macmillan: Building discovery services for scientif...
PPTX
Analytics on Big Knowledge Graphs Deliver Entity Awareness and Help Data Linking
PDF
Stream processing: The Matrix Revolutions
PDF
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
PPTX
Visualización de la producción científica y técnica publicada en la BVS
PDF
Let your data shine... with OpenRefine
PPTX
Deep Dive on Data Driven Experiences
PDF
Data Integration & Disintegration: Managing SN SciGraph with SHACL and OWL
PDF
ODI Summit 2016 - Linked Open Data at Springer Nature
From Web 2.0 to the Semantic Web: Bridging the Gap in the Newsmedia Industry
Hap clojure berlin 2015
Neo4j GraphTour New York_Thomson Reuters SS
The Bounties of Semantic Data Integration for the Enterprise
[Webinar] FactForge Debuts: Trump World Data and Instant Ranking of Industry ...
Semantic web an overview and projects
Scalable Web Data Management using RDF
Self-Service Linked Government Data with dcat and Gridworks
Linked Data Experiences at Springer Nature
Scalable Data Science with SparkR on HDInsight
Do the right (to left) thing
Linked data experience at Macmillan: Building discovery services for scientif...
Analytics on Big Knowledge Graphs Deliver Entity Awareness and Help Data Linking
Stream processing: The Matrix Revolutions
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
Visualización de la producción científica y técnica publicada en la BVS
Let your data shine... with OpenRefine
Deep Dive on Data Driven Experiences
Data Integration & Disintegration: Managing SN SciGraph with SHACL and OWL
ODI Summit 2016 - Linked Open Data at Springer Nature
Ad

Viewers also liked (13)

PPTX
A Quick Tour of OpenRefine
PPTX
Periodismo de Datos y Visualización con herramientas Open Source
PDF
Índice Global de Apertura de Datos [Global Open Data Index presentation Span...
PPTX
Definición de un Modelo de Gestión Documental para la BNE
PPTX
TXDHC OpenRefine Training
PDF
Introduction to OpenRefine
PPTX
Data and Donuts: Data cleaning with OpenRefine
PPTX
OpenRefine Tutorial
PDF
Reutilización de datos gracias a la visualización de datos
PPTX
OpenRefine Class Tutorial
ODP
OpenRefine - Data Science Training for Librarians
PPTX
Retos y oportunidades en Archivos y Gestión Documental ante la Web Semántica
PPTX
Google refine tutotial
A Quick Tour of OpenRefine
Periodismo de Datos y Visualización con herramientas Open Source
Índice Global de Apertura de Datos [Global Open Data Index presentation Span...
Definición de un Modelo de Gestión Documental para la BNE
TXDHC OpenRefine Training
Introduction to OpenRefine
Data and Donuts: Data cleaning with OpenRefine
OpenRefine Tutorial
Reutilización de datos gracias a la visualización de datos
OpenRefine Class Tutorial
OpenRefine - Data Science Training for Librarians
Retos y oportunidades en Archivos y Gestión Documental ante la Web Semántica
Google refine tutotial
Ad

Similar to Employing Google Refine to publish Linked Data (20)

PDF
Linda (Linked Data Analytics) project general presentation
PDF
LinDa Official Project Presentation
KEY
RDFa Introductory Course Session 3/4 Why RDFa
KEY
Why rdfa
PPTX
Simplified minimalistic workflows for the publication of Linked Open Data
PPTX
Simplified minimalistic workflows for the publication of Linked Open Data
PPTX
schema.org: Linked Data's Gateway Drug
PDF
schema.org, Linked Data's Gateway Drug
PDF
Enabling Low-cost Open Data Publishing and Reuse
PPTX
Open belgium 2015 - open tourism
PPT
Tutorial Linked APIs
PDF
sparqling-the-web-apis-for-seamless-data-integration-2023-5-30-5-25-5.pdf
PPTX
Semantic Web, e-commerce
PDF
Wed roman tut_open_datapub
PPTX
Information Intermediaries
PPTX
Developing Linked Data and Semantic Web-based Applications (Expotec 2015)
PPTX
RDF and Drupal - The Semantic web
PPTX
Sigma EE: Reaping low-hanging fruits in RDF-based data integration
PDF
RDAP 16 Poster: Hacking the figshare API to Create Enhanced Metadata Records
PPTX
Informatica big data and social media
Linda (Linked Data Analytics) project general presentation
LinDa Official Project Presentation
RDFa Introductory Course Session 3/4 Why RDFa
Why rdfa
Simplified minimalistic workflows for the publication of Linked Open Data
Simplified minimalistic workflows for the publication of Linked Open Data
schema.org: Linked Data's Gateway Drug
schema.org, Linked Data's Gateway Drug
Enabling Low-cost Open Data Publishing and Reuse
Open belgium 2015 - open tourism
Tutorial Linked APIs
sparqling-the-web-apis-for-seamless-data-integration-2023-5-30-5-25-5.pdf
Semantic Web, e-commerce
Wed roman tut_open_datapub
Information Intermediaries
Developing Linked Data and Semantic Web-based Applications (Expotec 2015)
RDF and Drupal - The Semantic web
Sigma EE: Reaping low-hanging fruits in RDF-based data integration
RDAP 16 Poster: Hacking the figshare API to Create Enhanced Metadata Records
Informatica big data and social media

More from Fadi Maali (8)

PPTX
Gagg: A graph Aggregation Operator
PDF
Towards an RDF Analytics Language: Learning from Successful Experiences
PDF
RDF Analytics... SPARQL and Beyond
PPTX
Linked Data lifecycle
PPTX
Self-service Linked Government Data
PPTX
Dcat - Machine Accessible Data Catalogues
PPTX
Open data showcase
PPT
Government data catalogues interoperability
Gagg: A graph Aggregation Operator
Towards an RDF Analytics Language: Learning from Successful Experiences
RDF Analytics... SPARQL and Beyond
Linked Data lifecycle
Self-service Linked Government Data
Dcat - Machine Accessible Data Catalogues
Open data showcase
Government data catalogues interoperability

Recently uploaded (20)

PPTX
Lesson notes of climatology university.
PDF
RMMM.pdf make it easy to upload and study
PDF
01-Introduction-to-Information-Management.pdf
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PDF
O7-L3 Supply Chain Operations - ICLT Program
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PDF
VCE English Exam - Section C Student Revision Booklet
PDF
Anesthesia in Laparoscopic Surgery in India
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PDF
TR - Agricultural Crops Production NC III.pdf
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PPTX
GDM (1) (1).pptx small presentation for students
PDF
Classroom Observation Tools for Teachers
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PPTX
master seminar digital applications in india
Lesson notes of climatology university.
RMMM.pdf make it easy to upload and study
01-Introduction-to-Information-Management.pdf
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
STATICS OF THE RIGID BODIES Hibbelers.pdf
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
102 student loan defaulters named and shamed – Is someone you know on the list?
O7-L3 Supply Chain Operations - ICLT Program
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
VCE English Exam - Section C Student Revision Booklet
Anesthesia in Laparoscopic Surgery in India
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
TR - Agricultural Crops Production NC III.pdf
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
Microbial diseases, their pathogenesis and prophylaxis
GDM (1) (1).pptx small presentation for students
Classroom Observation Tools for Teachers
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
master seminar digital applications in india

Employing Google Refine to publish Linked Data

  • 1. Employing Google Refine to Publish Linked DataFadi Maali and Richard Cyganiak
  • 2. Road mapSelf-Service Linked Government DataGoogle RefineRDF Export ExtensionRDF Reconciliation Extension
  • 3. We need government data as Linked Data not just Raw Data ….aha, and of a good quality!
  • 5. We want governments to provide Linked Data not just Raw Data… and of good qualityTIMEMONEYSKILLS
  • 6. From raw data to Linked DataDIY
  • 7. DIY RecipeTool support to select datasets of interest and put them into RDFPublishers provide RDF representation of their cataloguesUser shares the RDF data
  • 8. DIY RecipePublishers provide RDF representation of their cataloguesTool support to select datasets of interest and put them into RDFUser shares the RDF datadcat
  • 9. DIY RecipeTool support to select datasets of interest and put them into RDFPublishers provide RDF representation of their cataloguesUser shares the RDF datadcatGoogle Refine+ RDF export extension+ RDF reconciliation extension
  • 10. DIY RecipeUser shares the RDF dataPublishers provide RDF representation of their cataloguesTool support to select datasets of interest and put them into RDFdcatGoogle RefineShare RDF data along with the conversion process description+ RDF export extension+ RDF reconciliation extensionProvenance & Reproducibility
  • 11. Road mapSelf-Service Linked Government DataGoogle RefineRDF Export ExtensionRDF Reconciliation Extension
  • 12. Google Refine Google Refine is a power tool for working with messy data, cleaning it up, transforming it from one format into another, extending it with web services, and linking it to databases like Freebase*Desktop application that a user interacts with using a web browserOpen Source (New BSD License)Extensible*http://code.google.com/p/google-refine/
  • 13. Demo
  • 14. DemoTop 100 IT university in UK (Guardian data blog http://www.guardian.co.uk/news/datablog/2009/jun/02/universityguide-choosingadegree)
  • 15. Demo
  • 16. Demo
  • 17. Demo
  • 18. Demo
  • 19. DemoTop 100 Electronic Engineering university in UK (Guardian data blog http://www.guardian.co.uk/news/datablog/2009/jun/02/universityguide-choosingadegree)
  • 20. Demo
  • 21. RDF Reconcile ExtensionSindice search APISilk LSLCrafted RDFSilk ServerRDF Reconcile ExtensionGoogle RefineSPARQLSPARQL endpointHybrid SPARQLSPARQL endpoint with fulltext extension
  • 24. ConclusionPublishers provide RDF representation of their cataloguesTool support to select datasets of interest and put them into RDFUser shares the RDF datadcatGoogle Refine??+ RDF export extension+ RDF reconciliation extension
  • 25. LinksGoogle Refine http://code.google.com/p/google-refine/RDF Export Extension http://lab.linkeddata.deri.ie/2010/grefine-rdf-extension/RDF Reconciliation Extensionwill be released soon…