SlideShare a Scribd company logo
Exploring Web Data & Knowledge through
the Semantic Web
Dr. Stefan Dietze
L3S Research Center

Stefan Dietze

27/11/13

1
Pluto & the seven Dwarfs?

pluto the dwarf planet ?

„…solar
system…
#pluto“

Stefan Dietze

27/11/13
“A little semantics goes a long way” (J.

1)
Hendler

yago:AstronomicalObjects

Semantic Web
 Adding meaning through
shared vocabularies and
schemas (eg DBpedia)
 W3C standards RDF &
SPARQL for data &
knowledge representation
and querying
 Persistent URIs to reference
& interlink data on the Web

dbp:CelestialBody

typeOf

typeOf

dbp:Pluto
dwarfPlanetOf

redirectOf

dbp:SolarSystem

namedAfter

dbp:Pluto(mythology)
dbp:DwarfPlanetPluto

„…solar
system…
#pluto“

1 Hendler,

J., The Dark Side of the Semantic Web, IEEE Intelligent Systems, Jan/Feb 2007
Semantic Web / Linked Data
 De-facto standard for sharing data on the Web
 Vision: well connected graph of open Web data
 350+ datasets and 32 billion triples in LOD Cloud alone
 Other „incarnations“:
 Google
 „HTTP-accessibility“
(SPARQL, URI-dereferencing)

Knowledge Graph

 Facebook Open Graph

 „Structure“ & „Semantics“
(=> shared/linked vocabularies)

 http://schema.org
BBC
Program
mes

 „Interlinked“
 „Persistent“
FOAF

DBpedia
Ontology
Geo
Ontology

Gene
Ontology

Stefan Dietze

Dublin
Core

BIBO
That’s awesome, but...
Hm,
really?

…why are there so few datasets actually used?
 Date reuse and in-links focused on trusted „reference
graphs“ such as DBpedia (i.e. Wikipedia)
 Long tail of LD datasets which are neither reused nor linked
to (LOD Cloud alone consists of 300+ datasets)

 „HTTP-accessibility“
(SPARQL, URI-dereferencing)

 Explanations?

 „Structure“ & „Semantics“
(=> shared/linked vocabularies)
 „Interlinked“
 „Persistent“

Stefan Dietze

27/11/13
Open data is more diverse than we think
SPARQL Web-Querying Infrastructure: Ready for Action?,
Carlos Buil-Aranda, Aidan Hogan, Jürgen Umbrich Pierre-Yves
Vandenbussch, International Semantic Web Conference 2013,
(ISWC2013).

Accessibility of datasets?
 Less than 50% of all SPARQL endpoints actually responsive
at given point of time
 “THE” SPARQL protocol? No, but many variants & subsets
 …

SPARQL endpoint availability over time [Buil-Aranda et al 2013]

Shared vocabularies & schemas, but:

 …still very heterogeneous [d’Aquin, WebSci13]
 …data partially messy an not conformant
(RDFS, schemas) [HoganJWS2012]

Co-occurence graph of data
types in 146 datasets: 144
Vocabularies, 588 highly
overlapping types, 719
Properties

 …even widely used reference datasets such as
DBpedia noisy [Paulheim2013]
Assessing the Educational Linked Data Landscape, D’Aquin, M.,
Adamou, A., Dietze, S., ACM Web Science 2013 (WebSci2013), Paris,
France, May 2013.
Type Inference on Noisy RDF Data, Paulheim H., Bizer, C. Semantic
Web – ISWC 2013, Lecture Notes in Computer Science Volume 8218,
2013, pp 510-525

Stefan Dietze

An empirical survey of Linked Data conformance. Hogan, A., Umbrich,
J., Harth, A., Cyganiak, R., Polleres, A., Decker., S., In the Journal of Web
Semantics 14: pp. 14–44, 2012
Too many/diverse datasets, too little information
 Which datasets are useful & trustworthy for case
XY (eg „learning about the solar system“) ?
 Which topics (eg „Astronomy“) are covered by
dataset X?
 Which datasets describe/offer videos (slides,
publications, statistics etc)?

?

?
?

Stefan Dietze

27/11/13
Data curation and dataset profiling
 Which datasets are useful & trustworthy for case
XY (eg „learning about the solar system“) ?
 Which topics (eg „Astronomy“) are covered by
dataset X?
 Which datasets describe/offer videos (slides,
publications, statistics etc)?

 Catalog of data (LinkedUp
Catalog): classification of datasets
according to resource types,
disciplines/topics, data quality,
accessability, etc

 Infrastructure for
distributed/federated querying

describes

Stefan Dietze

LinkedUp
Dataset Catalog

27/11/13
Dataset profiling: what’s all the data about
po:Programme

AAISO

BBC Programme

bibo:Fi
bibo:Film
bibo:Fil BIBO FOAF

<po:Programme …>
<po:Series>Wonders of the Solar System</.>
<po:Actor>Brian Cox</…>
</po:Programme…>

Schema mappings

yov:Video
contains

Yovisto Video
<yo:Video …>
<dc:title>Pluto & the
Dwarf Planets</dc:title>
…
</yo:Video…>

Entity disambiguation
db:Astro. Objects
db:Astro. Objects
db:Astronomy

Topic profile extraction

Dataset
Metadata

Stefan Dietze

LinkedUp
Dataset Catalog

27/11/13
LinkedUp Data Catalog
inExplore & query for datasets/types & topics
 a nutshell

http://data.linkededucation.org/linkedup/categories-explorer
http://data.linkededucation.org/linkedup/catalog/

 Federated queries using type mappings

Stefan Dietze

27/11/13
LinkedUp Challenge: using open data for learning
http://linkedup-challenge.org

 Open Data Competition to promote tools and applications that analyse / integrate (Linked)
Web data
 Organised by LinkedUp project over 2 years (“Veni”, “Vidi”, “Vici”) with 40.000 EUR awards
 Veni Competition - 22 submissions, 8 shortlisted for presentation at Open Knowledge
Conference (17 September, Geneva Switzerland)

Stefan Dietze

27/11/13
st
1

Place: PoliMedia
Exploring political debates & events
http://www.polimedia.nl/

 Cross-media exploration & analysis of political
events
(parliament debates and media coverage)
 Automatically generated links between transcripts
debates, newspaper articles, and radio bulletins.
 (Linked) Data available at http://data.polimedia.nl

 Data sources: 1) newspapers of the historical
newspaper archive, 2) radio bulletins of the Dutch
National Press Agency (ANP)
 9000+ debates (1945 – 1995)
 Over 3000 media links

Martijn Kleppe, Max Kemman, Henri Beunders (Erasmus
Universiteit Rotterdam), Laura Hollink Damir Juric (Vrije
Universiteit Amsterdam), Johan Oomen Jaap Blom
(Nederlands Instituut voor Beeld en Geluid)
Stefan Dietze

27/11/13
Outlook: more “focused” data reuse challenges
http://linkedup-challenge.org/

Open Track

Focused Track

 Scalable tools and applications
using (Linked) open data for
educational purposes

 LinkedUp data catalog
 Promotion of selected Veni
submissions

 Simplifying complex
information to make it
accessible (example:
publications from Elsevier)

 Recommender system for
educational resources (courses,
MOOCs) relevant to user
interests

 Approx. 20.000 EUR awards budget
 Final events at 11th Extended Semantic Web Conference (ESWC2014)
 Submission: 14 February 2014

Stefan Dietze

27/11/13

13
Thank you!

REFERENCES

WWW

Assessing the Educational Linked Data Landscape, D’Aquin, M., Adamou,
A., Dietze, S., ACM Web Science 2013 (WebSci2013), Paris, France, May
2013.

See also (data)

Generating structured Profiles of Linked Data Graphs, Fetahu, B; Dietze,
S., d’Aquin, M., Nunes, B.P., ISWC2013 – 12th International Semantic Web
Conference;

 http://datahub.io/group/linked-education
 http://data.linkededucation.org

 http://data.linkededucation.org/linkedup/catalog/
 http://lak.linkededucation.org

Combining a co-occurrence-based and a semantic measure for entity
linking, B. P. Nunes, S. Dietze, M.A. Casanova, R. Kawase, B. Fetahu, and
W. Nejdl., ESWC 2013 - 10th Extended Semantic Web Conference, (May
2013).

See also (general)

Type Inference on Noisy RDF Data, Paulheim H., Bizer, C. Semantic Web –
ISWC 2013, Lecture Notes in Computer Science Volume 8218, 2013, pp
510-525
An empirical survey of Linked Data conformance. Hogan, A., Umbrich, J.,
Harth, A., Cyganiak, R., Polleres, A., Decker., S., In the Journal of Web
Semantics 14: pp. 14–44, 2012

 http://linkedup-project.eu
 http://linkedup-challenge.org
 http://linkededucation.org
 http://linkeduniversities.org

SPARQL Web-Querying Infrastructure: Ready for Action?, Carlos BuilAranda, Aidan Hogan, Jürgen Umbrich Pierre-Yves Vandenbussch,
International Semantic Web Conference 2013, (ISWC2013).

Stefan Dietze

27/11/13

14

More Related Content

PDF
Demo: Profiling & Exploration of Linked Open Data
PDF
A structured catalog of open educational datasets
PDF
WWW2013 Tutorial: Linked Data & Education
PDF
Turning Data into Knowledge (KESW2014 Keynote)
PDF
Retrieval, Crawling and Fusion of Entity-centric Data on the Web
PDF
KnowEscape workshop, OKCon 2013
PDF
Linked Data for Federation of OER Data &amp; Repositories
PPTX
Online Learning and Linked Data: An Introduction
Demo: Profiling & Exploration of Linked Open Data
A structured catalog of open educational datasets
WWW2013 Tutorial: Linked Data & Education
Turning Data into Knowledge (KESW2014 Keynote)
Retrieval, Crawling and Fusion of Entity-centric Data on the Web
KnowEscape workshop, OKCon 2013
Linked Data for Federation of OER Data &amp; Repositories
Online Learning and Linked Data: An Introduction

What's hot (20)

PDF
Mining and Understanding Activities and Resources on the Web
PDF
DBpedia InsideOut
ZIP
Intro to Linked Open Data in Libraries, Archives & Museums
PPTX
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...
PDF
The Europeana Datamodel: A semantic layer on top of Cultural Heritage Objects
PDF
Web Data Management with RDF
PPTX
Data Science Curriculum for Professionals
PPTX
Semantic Web, Linked Data and Education: A Perfect Fit?
PDF
Linked Open Data for Digital Humanities
ZIP
Intro to Linked Open Data in Libraries Archives & Museums.
PDF
Open Data Dialog 2013 - Linked Data in Education
PDF
Web Data Management in the RDF Age
PPTX
Creating knowledge out of interlinked data
PDF
Linked open data and libraries
PPTX
What is #LODLAM?! (revised January 2015)
PPTX
Linked dataworkshopintro14aug2014
PDF
Towards a Machine-Actionable Scholarly Communication System
PPT
Riding the wave - Paradigm shifts in information access
PDF
From Open Linked Data towards an Ecosystem of Interlinked Knowledge
ODP
FirstWorkshopOnWikipediaResearch
Mining and Understanding Activities and Resources on the Web
DBpedia InsideOut
Intro to Linked Open Data in Libraries, Archives & Museums
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...
The Europeana Datamodel: A semantic layer on top of Cultural Heritage Objects
Web Data Management with RDF
Data Science Curriculum for Professionals
Semantic Web, Linked Data and Education: A Perfect Fit?
Linked Open Data for Digital Humanities
Intro to Linked Open Data in Libraries Archives & Museums.
Open Data Dialog 2013 - Linked Data in Education
Web Data Management in the RDF Age
Creating knowledge out of interlinked data
Linked open data and libraries
What is #LODLAM?! (revised January 2015)
Linked dataworkshopintro14aug2014
Towards a Machine-Actionable Scholarly Communication System
Riding the wave - Paradigm shifts in information access
From Open Linked Data towards an Ecosystem of Interlinked Knowledge
FirstWorkshopOnWikipediaResearch
Ad

Viewers also liked (6)

PDF
Agile patterns in the real world
PDF
Towards embedded Markup of Learning Resources on the Web
PPTX
Agile Innovation - Product Management in Turbulent times
PDF
LKNL12: Kanban for the whole value stream
PPTX
Story points considered harmful - or why the future of estimation is really i...
PPTX
From an Idea to a Vision you can implement - Vision workshop
Agile patterns in the real world
Towards embedded Markup of Learning Resources on the Web
Agile Innovation - Product Management in Turbulent times
LKNL12: Kanban for the whole value stream
Story points considered harmful - or why the future of estimation is really i...
From an Idea to a Vision you can implement - Vision workshop
Ad

Similar to Web Science Synergies: Exploring Web Knowledge through the Semantic Web (20)

PDF
What's all the data about? - Linking and Profiling of Linked Datasets
PDF
Beyond Linked Data - Exploiting Entity-Centric Knowledge on the Web
PDF
From Data to Knowledge - Profiling & Interlinking Web Datasets
PDF
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
PDF
WWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
PDF
Hide the Stack: Toward Usable Linked Data
PDF
Semantic Linking & Retrieval for Digital Libraries
PPT
euclid_linkedup WWW tutorial (Besnik Fetahu)
PDF
Interlinking educational data to Web of Data (Thesis presentation)
PDF
Linked Data Challenge and Opportunity
PDF
Open Data & Education Seminar, ITMO, St Petersburg, March 2014
PPTX
The Future of LOD
PPSX
Linked Data to Improve the OER Experience
PDF
2014_WWW_BTOR
PDF
Intertwingularity, Semantic Web and linked Geo data
PDF
The state of the art in Linked Data
PDF
LinkedUp - Linked Data Europe Workshop 2014
ODP
State of the Semantic Web
PDF
Linked Data for Architecture, Engineering and Construction (AEC)
PPTX
Omitola birmingham cityuniv
What's all the data about? - Linking and Profiling of Linked Datasets
Beyond Linked Data - Exploiting Entity-Centric Knowledge on the Web
From Data to Knowledge - Profiling & Interlinking Web Datasets
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
WWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
Hide the Stack: Toward Usable Linked Data
Semantic Linking & Retrieval for Digital Libraries
euclid_linkedup WWW tutorial (Besnik Fetahu)
Interlinking educational data to Web of Data (Thesis presentation)
Linked Data Challenge and Opportunity
Open Data & Education Seminar, ITMO, St Petersburg, March 2014
The Future of LOD
Linked Data to Improve the OER Experience
2014_WWW_BTOR
Intertwingularity, Semantic Web and linked Geo data
The state of the art in Linked Data
LinkedUp - Linked Data Europe Workshop 2014
State of the Semantic Web
Linked Data for Architecture, Engineering and Construction (AEC)
Omitola birmingham cityuniv

More from Stefan Dietze (17)

PDF
Understanding Scientific and Societal Adoption and Impact of Science Through ...
PDF
NEWORDER Project - Science in the online knowledge order
PDF
Collecting & Temporal Analysis of Behavioral Web Data - Tales From The Inside
PDF
AI in between online and offline discourse - and what has ChatGPT to do with ...
PDF
An interdisciplinary journey with the SAL spaceship – results and challenges ...
PDF
Research Knowledge Graphs at NFDI4DS & GESIS
PDF
Research Knowledge Graphs at GESIS & NFDI4DataScience
PDF
Human-in-the-loop: the Web as Foundation for interdisciplinary Data Science M...
PDF
Human-in-the-Loop: das Web als Grundlage interdisziplinärer Data Science Meth...
PDF
Towards research data knowledge graphs
PDF
Beyond research data infrastructures: exploiting artificial & crowd intellige...
PDF
From Web Data to Knowledge: on the Complementarity of Human and Artificial In...
PDF
Using AI to understand everyday learning on the Web
PDF
Analysing User Knowledge, Competence and Learning during Online Activities
PDF
Analysing & Improving Learning Resources Markup on the Web
PDF
Big Data in Learning Analytics - Analytics for Everyday Learning
PDF
Dietze linked data-vr-es
Understanding Scientific and Societal Adoption and Impact of Science Through ...
NEWORDER Project - Science in the online knowledge order
Collecting & Temporal Analysis of Behavioral Web Data - Tales From The Inside
AI in between online and offline discourse - and what has ChatGPT to do with ...
An interdisciplinary journey with the SAL spaceship – results and challenges ...
Research Knowledge Graphs at NFDI4DS & GESIS
Research Knowledge Graphs at GESIS & NFDI4DataScience
Human-in-the-loop: the Web as Foundation for interdisciplinary Data Science M...
Human-in-the-Loop: das Web als Grundlage interdisziplinärer Data Science Meth...
Towards research data knowledge graphs
Beyond research data infrastructures: exploiting artificial & crowd intellige...
From Web Data to Knowledge: on the Complementarity of Human and Artificial In...
Using AI to understand everyday learning on the Web
Analysing User Knowledge, Competence and Learning during Online Activities
Analysing & Improving Learning Resources Markup on the Web
Big Data in Learning Analytics - Analytics for Everyday Learning
Dietze linked data-vr-es

Recently uploaded (20)

PDF
01-Introduction-to-Information-Management.pdf
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PDF
Classroom Observation Tools for Teachers
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
RMMM.pdf make it easy to upload and study
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PPTX
Cell Types and Its function , kingdom of life
PDF
Complications of Minimal Access Surgery at WLH
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PDF
Computing-Curriculum for Schools in Ghana
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PPTX
GDM (1) (1).pptx small presentation for students
PPTX
Lesson notes of climatology university.
PDF
TR - Agricultural Crops Production NC III.pdf
PDF
VCE English Exam - Section C Student Revision Booklet
PDF
Pre independence Education in Inndia.pdf
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PDF
Basic Mud Logging Guide for educational purpose
01-Introduction-to-Information-Management.pdf
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
Classroom Observation Tools for Teachers
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
RMMM.pdf make it easy to upload and study
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
Cell Types and Its function , kingdom of life
Complications of Minimal Access Surgery at WLH
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
Computing-Curriculum for Schools in Ghana
Abdominal Access Techniques with Prof. Dr. R K Mishra
GDM (1) (1).pptx small presentation for students
Lesson notes of climatology university.
TR - Agricultural Crops Production NC III.pdf
VCE English Exam - Section C Student Revision Booklet
Pre independence Education in Inndia.pdf
102 student loan defaulters named and shamed – Is someone you know on the list?
Basic Mud Logging Guide for educational purpose

Web Science Synergies: Exploring Web Knowledge through the Semantic Web

  • 1. Exploring Web Data & Knowledge through the Semantic Web Dr. Stefan Dietze L3S Research Center Stefan Dietze 27/11/13 1
  • 2. Pluto & the seven Dwarfs? pluto the dwarf planet ? „…solar system… #pluto“ Stefan Dietze 27/11/13
  • 3. “A little semantics goes a long way” (J. 1) Hendler yago:AstronomicalObjects Semantic Web  Adding meaning through shared vocabularies and schemas (eg DBpedia)  W3C standards RDF & SPARQL for data & knowledge representation and querying  Persistent URIs to reference & interlink data on the Web dbp:CelestialBody typeOf typeOf dbp:Pluto dwarfPlanetOf redirectOf dbp:SolarSystem namedAfter dbp:Pluto(mythology) dbp:DwarfPlanetPluto „…solar system… #pluto“ 1 Hendler, J., The Dark Side of the Semantic Web, IEEE Intelligent Systems, Jan/Feb 2007
  • 4. Semantic Web / Linked Data  De-facto standard for sharing data on the Web  Vision: well connected graph of open Web data  350+ datasets and 32 billion triples in LOD Cloud alone  Other „incarnations“:  Google  „HTTP-accessibility“ (SPARQL, URI-dereferencing) Knowledge Graph  Facebook Open Graph  „Structure“ & „Semantics“ (=> shared/linked vocabularies)  http://schema.org BBC Program mes  „Interlinked“  „Persistent“ FOAF DBpedia Ontology Geo Ontology Gene Ontology Stefan Dietze Dublin Core BIBO
  • 5. That’s awesome, but... Hm, really? …why are there so few datasets actually used?  Date reuse and in-links focused on trusted „reference graphs“ such as DBpedia (i.e. Wikipedia)  Long tail of LD datasets which are neither reused nor linked to (LOD Cloud alone consists of 300+ datasets)  „HTTP-accessibility“ (SPARQL, URI-dereferencing)  Explanations?  „Structure“ & „Semantics“ (=> shared/linked vocabularies)  „Interlinked“  „Persistent“ Stefan Dietze 27/11/13
  • 6. Open data is more diverse than we think SPARQL Web-Querying Infrastructure: Ready for Action?, Carlos Buil-Aranda, Aidan Hogan, Jürgen Umbrich Pierre-Yves Vandenbussch, International Semantic Web Conference 2013, (ISWC2013). Accessibility of datasets?  Less than 50% of all SPARQL endpoints actually responsive at given point of time  “THE” SPARQL protocol? No, but many variants & subsets  … SPARQL endpoint availability over time [Buil-Aranda et al 2013] Shared vocabularies & schemas, but:  …still very heterogeneous [d’Aquin, WebSci13]  …data partially messy an not conformant (RDFS, schemas) [HoganJWS2012] Co-occurence graph of data types in 146 datasets: 144 Vocabularies, 588 highly overlapping types, 719 Properties  …even widely used reference datasets such as DBpedia noisy [Paulheim2013] Assessing the Educational Linked Data Landscape, D’Aquin, M., Adamou, A., Dietze, S., ACM Web Science 2013 (WebSci2013), Paris, France, May 2013. Type Inference on Noisy RDF Data, Paulheim H., Bizer, C. Semantic Web – ISWC 2013, Lecture Notes in Computer Science Volume 8218, 2013, pp 510-525 Stefan Dietze An empirical survey of Linked Data conformance. Hogan, A., Umbrich, J., Harth, A., Cyganiak, R., Polleres, A., Decker., S., In the Journal of Web Semantics 14: pp. 14–44, 2012
  • 7. Too many/diverse datasets, too little information  Which datasets are useful & trustworthy for case XY (eg „learning about the solar system“) ?  Which topics (eg „Astronomy“) are covered by dataset X?  Which datasets describe/offer videos (slides, publications, statistics etc)? ? ? ? Stefan Dietze 27/11/13
  • 8. Data curation and dataset profiling  Which datasets are useful & trustworthy for case XY (eg „learning about the solar system“) ?  Which topics (eg „Astronomy“) are covered by dataset X?  Which datasets describe/offer videos (slides, publications, statistics etc)?  Catalog of data (LinkedUp Catalog): classification of datasets according to resource types, disciplines/topics, data quality, accessability, etc  Infrastructure for distributed/federated querying describes Stefan Dietze LinkedUp Dataset Catalog 27/11/13
  • 9. Dataset profiling: what’s all the data about po:Programme AAISO BBC Programme bibo:Fi bibo:Film bibo:Fil BIBO FOAF <po:Programme …> <po:Series>Wonders of the Solar System</.> <po:Actor>Brian Cox</…> </po:Programme…> Schema mappings yov:Video contains Yovisto Video <yo:Video …> <dc:title>Pluto & the Dwarf Planets</dc:title> … </yo:Video…> Entity disambiguation db:Astro. Objects db:Astro. Objects db:Astronomy Topic profile extraction Dataset Metadata Stefan Dietze LinkedUp Dataset Catalog 27/11/13
  • 10. LinkedUp Data Catalog inExplore & query for datasets/types & topics  a nutshell http://data.linkededucation.org/linkedup/categories-explorer http://data.linkededucation.org/linkedup/catalog/  Federated queries using type mappings Stefan Dietze 27/11/13
  • 11. LinkedUp Challenge: using open data for learning http://linkedup-challenge.org  Open Data Competition to promote tools and applications that analyse / integrate (Linked) Web data  Organised by LinkedUp project over 2 years (“Veni”, “Vidi”, “Vici”) with 40.000 EUR awards  Veni Competition - 22 submissions, 8 shortlisted for presentation at Open Knowledge Conference (17 September, Geneva Switzerland) Stefan Dietze 27/11/13
  • 12. st 1 Place: PoliMedia Exploring political debates & events http://www.polimedia.nl/  Cross-media exploration & analysis of political events (parliament debates and media coverage)  Automatically generated links between transcripts debates, newspaper articles, and radio bulletins.  (Linked) Data available at http://data.polimedia.nl  Data sources: 1) newspapers of the historical newspaper archive, 2) radio bulletins of the Dutch National Press Agency (ANP)  9000+ debates (1945 – 1995)  Over 3000 media links Martijn Kleppe, Max Kemman, Henri Beunders (Erasmus Universiteit Rotterdam), Laura Hollink Damir Juric (Vrije Universiteit Amsterdam), Johan Oomen Jaap Blom (Nederlands Instituut voor Beeld en Geluid) Stefan Dietze 27/11/13
  • 13. Outlook: more “focused” data reuse challenges http://linkedup-challenge.org/ Open Track Focused Track  Scalable tools and applications using (Linked) open data for educational purposes  LinkedUp data catalog  Promotion of selected Veni submissions  Simplifying complex information to make it accessible (example: publications from Elsevier)  Recommender system for educational resources (courses, MOOCs) relevant to user interests  Approx. 20.000 EUR awards budget  Final events at 11th Extended Semantic Web Conference (ESWC2014)  Submission: 14 February 2014 Stefan Dietze 27/11/13 13
  • 14. Thank you! REFERENCES WWW Assessing the Educational Linked Data Landscape, D’Aquin, M., Adamou, A., Dietze, S., ACM Web Science 2013 (WebSci2013), Paris, France, May 2013. See also (data) Generating structured Profiles of Linked Data Graphs, Fetahu, B; Dietze, S., d’Aquin, M., Nunes, B.P., ISWC2013 – 12th International Semantic Web Conference;  http://datahub.io/group/linked-education  http://data.linkededucation.org  http://data.linkededucation.org/linkedup/catalog/  http://lak.linkededucation.org Combining a co-occurrence-based and a semantic measure for entity linking, B. P. Nunes, S. Dietze, M.A. Casanova, R. Kawase, B. Fetahu, and W. Nejdl., ESWC 2013 - 10th Extended Semantic Web Conference, (May 2013). See also (general) Type Inference on Noisy RDF Data, Paulheim H., Bizer, C. Semantic Web – ISWC 2013, Lecture Notes in Computer Science Volume 8218, 2013, pp 510-525 An empirical survey of Linked Data conformance. Hogan, A., Umbrich, J., Harth, A., Cyganiak, R., Polleres, A., Decker., S., In the Journal of Web Semantics 14: pp. 14–44, 2012  http://linkedup-project.eu  http://linkedup-challenge.org  http://linkededucation.org  http://linkeduniversities.org SPARQL Web-Querying Infrastructure: Ready for Action?, Carlos BuilAranda, Aidan Hogan, Jürgen Umbrich Pierre-Yves Vandenbussch, International Semantic Web Conference 2013, (ISWC2013). Stefan Dietze 27/11/13 14