SlideShare a Scribd company logo
Exploiting (Linked) Web Data in Educational Applications 
Stefan Dietze L3S Research Center http://purl.org/dietze @stefandietze - Open Education Challenge, Berlin, 2014 - 
28/10/14 
1 
Stefan Dietze
Linked Data for education 
 Data sharing: TED, Open Courseware, mEducator, LinkedUp, 
LAK…. 
 Tutorials & workshops (eg „Linked Learning“ series) 
 LinkedUniversities.org and LinkedEducation.org 
 W3C Linked Open Education community group 
Research areas 
 Web & data science, information retrieval, semantic web & 
Linked Data, data & knowledge integration 
 Application domains: education/TEL, Web archiving, … 
Some projects 
Introduction 
http://www.l3s.de/ 
28/10/14 2 
 See also: http://purl.org/dietze 
Stefan Dietze
Social 
Media 
Exploiting Open Data for Education?nutshell 
(Open) Educational Resources 
World Wide Web 
Distance Universities 
MOOCs 
Linked Open Data 
28/10/14 
3 
Stefan Dietze
How Open is Open Data? 
Open Data (as in “open licensing”) 
Open licensing (ODL, CC etc) 
Yet: variety of approaches 
APIs/feeds: SOAP, REST, etc 
Diverse schemas & vocabularies 
(lack of) controlled vocabularies 
Reuse & interoperability? 
Linked Data (technology) (as in “interoperability”) 
Defacto Standard for Open Data on the Web 
W3C standards: 
Common HTTP interface: SPARQL 
Common representation: RDF 
Dereferencable URIs 
Shared/linked vocabularies 
Linked Open Data 
5-star scheme by Sir Tim Berners Lee 
28/10/14 
4 
Stefan Dietze
Semantic Web 
Example: Google Knowledge Graph (DBpedia, Freebase, Yago etc) 
W3C standards (RDF & SPARQL) for knowledge representation and querying 
URIs to identify/link data 
“A little semantics goes a long way” (J. Hendler1) 
dbp:United_States 
http://dbpedia.org/resource/Cambridge_MA 
dbp:W3C 
country 
cityOf 
1 Hendler, J., The Dark Side of the Semantic Web, IEEE Intelligent Systems, Jan/Feb 2007 
schema:City 
typeOf 
dbp:MIT 
ru.dbp:Кембридж_(Массачусетс) 
sameAs 
headquarterOf
HTTP accessibility: persistent URIs, SPARQL 
FOAF 
Gene Ontology 
BIBO 
Geo Ontology 
DBpedia Ontology 
Dublin Core 
BBC Programmes 
Connected graph of open Web data (500+ datasets and 100 billion triples) 
Persistent, dereferencable URIs & content negotiation, shared/linked vocabularies 
SPARQL to query via HTTP 
Other „incarnations“: 
Google Knowledge Graph 
Facebook Open Graph 
http://schema.org 
http://dbpedia.org/resource/Cambridge_MA 
28/10/14 
6 
Stefan Dietze
LD to ensure discoverability of content/Websites (eg schema.org/microdata/RDFa) 
Annotating HTML documents about (educational) material with schema.org (eg LRMI, Learning Resource Metadata Initiative) 
Adopted by major sites (YouTube, LinkedIn etc) & tool support (DRUPAL, WordPress) 
LD is not just for your data Schema.org for discovery of content/websites 
http://schema.org 
© Ramanathan V. Guha, Google, SemTech2014 
28/10/14 
7 
Stefan Dietze
Other learning-relevant data & resources 
Publications & literature 
(Social) media resource metadata 
Domain-specific knowledge: Bioportal, Europeana, Geonames, … 
Cross-domain factual knowledge: DBpedia, Freebase, … 
LD as body of knowledge for education 
http://linkededucation.org 
http://linkeduniversities.org 
28/10/14 
8 
Stefan Dietze 
Educational datasets and vocabularies 
University Linked Data: The Open University UK, http://data.open.ac.uk, Southampton University, http://education.data.gov.uk, … 
Open Educational Resources metadata: mEducator, Open Learn, Open Courseware, … 
Schemas: Learning Resource Metadata Initiative (LRMI, mEducator Educational Resources schema, BIBO, AAISO, …
LD as background knowledge for educational apps? 
http://metamorphosis.med.duth.gr/ 
Title: ECG Patient case 1001 chest and limb leads 
28/10/14 
9 
Stefan Dietze
Title: ECG Patient case 1001 chest and limb leads 
„ECG“ dismabiguation on Wikipedia: 9 meanings 
LD as background knowledge for educational apps? 
28/10/14 
10 
Stefan Dietze
dbpedia.org/resource/Electrocardiagraphy 
1. Understanding data: contextual disambiguation through NLP tools 
2. Enrichment with factual knowledge 
dbpedia:Электрокардиография 
category:Cardiac_procedures 
dbpedia:Willem_Einthoven 
3. interlinking with related resources 
bbc:ProgrammeXY 
slideshare:SlidesetXY 
yovisto:VideolectureXY 
Title: ECG Patient case 1001 chest and limb leads 
Understanding, enriching, linking data 
28/10/14 
11 
Stefan Dietze
„Success models“: data & applications 
Supporting innovative tools & applications 
Evaluation methods 
LinkedUp – Linking Web Data for Education 
Technology transfer & community-building 
Involving educators, developers, computer scientists, data engineers… 
http://www.linkedup-challenge.org/ 
Data curation & profiling 
Collecting & exposing open data for education 
Profiling of Web Data 
http://data.linkededucation.org 
EC-funded project aimed at advancing take-up of open data and related technologies 
http://www.linkedup-project.eu/events 
28/10/14 
Stefan Dietze 
12 
http://www.linkedup-project.eu/
Community-building and collaboration Joint work on tangible outcomes (datasets, applications....) 
Associated Partners 
Initiatives 
EC Projects 
Stefan Dietze
Collected & curated datasets of educational relevance 
Beyond collecting: published over 50 datasets as LD together with most important content providers e.g. TED, OCW, SoLAR etc 
LinkedUp catalog: most comprehensive collection of LD/Open Data for education 
RDF dataset metadata 
Federated queries across datasets using type mappings 
Publishing & curating educational data 
http://data.linkededucation.org/linkedup/catalog/ 
28/10/14 
Stefan Dietze 
14
http://data-observatory.org/lod-explorer 
Supporting developers and data consumers 
Devtalk blog: developer resource & community to aid developers 
Webinars and tutorials 
http://data.linkededucation.org/linkedup/devtalk/ 
Topic-based annotation and discovery of data 
Data exploration & visualisation features 
28/10/14 
Stefan Dietze 
16
LinkedUp events, training & technology transfer Bringing stakeholders together 
Data Providers & Data Scientists 
Developers 
Community-building through events & communication channels/social media (cross-disciplinary, industry & academia) 
Exploitation of project outcomes across communities: technology transfer 
(Co-)organised approx. 20 events (tutorials, workshops, booths etc) 
More than 30 invited talks/lectures 
…. 
Users (Learners, Tutors, Teachers) 
28/10/14 
Stefan Dietze 
17
May –September 2013 
October 2013 – May 2014 
May 2014 – October 2014 
Series of Open Data Competitions to promote applications which exploit Linked Open Data 
http://www.linkedup-challenge.org/ 
LinkedUp Challenge
23 
14 
13 
8 
9 
10 
0 
5 
10 
15 
20 
25 
Veni Vidi Vici 
submissions 
shortlist 
LinkedUp Challenge results 
 50 submissions of which 27 were shortlisted 
and supported (through travel grants, 
participation in events and rewards) 
 13 Veni, Vidi, Vici winners 
(grants: 1000 – 3000 €) 
 Authors from 23 distinct, mostly European 
countries 
LinkedUp submissions & shortlist 
Coatia; 4 
Greece; 4 
Belgium; 5 
Italy; 7 
Germany; 11 
Spain; 
13 
France; 14 
Netherlands; 15 
United States; 
15 
United 
Kingdom; 21 
authors 
Top-10 
author‘s 
origins 
Stefan Dietze 28/10/14 21
Issues (1/3) - open data is messier than we think 
SPARQL endpoint availability over time [Buil-Aranda et al 2013] 
Accessibility of datasets? 
Less than 50% of all SPARQL endpoints actually responsive at given point of time [Buil-Aranda2013] 
“THE” SPARQL protocol? No, but many variants & subsets 
Data “quality”? 
…data accuracy (eg DBpedia)? [Paulheim2013] 
…vocabulary reuse/links? [D’AquinWebSci13] 
…schema compliance (RDFS, schemas) [HoganJWS2012] 
Stefan Dietze 
SPARQL Web-Querying Infrastructure: Ready for Action?, Carlos Buil-Aranda, Aidan Hogan, Jürgen Umbrich Pierre-Yves Vandenbussch, International Semantic Web Conference 2013, (ISWC2013). 
Assessing the Educational Linked Data Landscape, D’Aquin, M., Adamou, A., Dietze, S., ACM Web Science 2013 (WebSci2013), Paris, France, May 2013. 
Type Inference on Noisy RDF Data, Paulheim H., Bizer, C. Semantic Web – ISWC 2013, Lecture Notes in Computer Science Volume 8218, 2013, pp 510-525 
An empirical survey of Linked Data conformance. Hogan, A., Umbrich, J., Harth, A., Cyganiak, R., Polleres, A., Decker., S., Journal of Web Semantics 14, 2012 
28/10/14 
22
Issues (2/3) – accepting inconsistency 
Analyzing Relative Incompleteness of Movie Descriptions in the Web of Data: A Case Study, Yuan, W., Demidova, E., Dietze, S., Zhu, X., International Semantic Web Conference 2014 (ISWC2014) 
28/10/14 
Stefan Dietze 
23
Issues (3/3) – licensing/legal aspects 
Dataset 
Words 
Pages 
DBpedia 
7163 
16 
Flickr 
10367 
23 
ConceptNet 
7163 
16 
World Bank 
7056 
16 
Nature 
7024 
16 
LinkedIn 
6104 
14 
Google+ 
5740 
13 
Tumblr 
5362 
12 
Twitter 
4247 
9 
Facebook 
4179 
9 
Mashing up data: legal and licensing related issues under-estimated 
What license do you get when mashing up: 
Attribution: copyright violation from missing (86%) or incorrect attribution (14%) information 
Terms & conditions: complexity and conflicts when merging data from different sources 
Potential non-compliance from evolution of (a) LOD applications and (b) underlying datasets (and their licenses) 
T&C of established datasets 
28/10/14 
Stefan Dietze 
24 
Nature (CC0) + DBpedia (CC-ShareAlike) + FAO (Proprietary non-commercial) => ?
Get involved! 
http://www.w3.org/community/opened 
http://data.linkededucation.org/linkedup/catalog/ 
http://data.linkededucation.org/linkedup/devtalk/
Thank you! 
28/10/14 
Stefan Dietze 
26

More Related Content

PDF
Lessons Learnt from LinkedUp
PDF
LinkedUp Open Education Panel session
PDF
Open Data & Education Seminar, ITMO, St Petersburg, March 2014
PDF
Open Education and Open Development – working together
PDF
The Open Education Working Group: Bringing people and projects together
PPT
B2: Open Up: Open Data in the Public Sector
PDF
Open data in Education
PDF
LinkedUp at Mozilla Festival Science Fair
Lessons Learnt from LinkedUp
LinkedUp Open Education Panel session
Open Data & Education Seminar, ITMO, St Petersburg, March 2014
Open Education and Open Development – working together
The Open Education Working Group: Bringing people and projects together
B2: Open Up: Open Data in the Public Sector
Open data in Education
LinkedUp at Mozilla Festival Science Fair

What's hot (20)

PDF
LinkedUp Project
PPTX
Online Learning and Linked Data: An Introduction
PDF
LAK Dataset and Challenge (April 2013)
PPT
Open Educational Data - Datasets and APIs (Athens Green Hackathon 2012)
PPTX
Data Science Curriculum for Professionals
PPT
Learning Analytics & Linked Data – Opportunities, Challenges, Examples
PDF
WWW2013 Tutorial: Linked Data & Education
PPTX
Development of a Linked Data curriculum
PDF
Demo: Profiling & Exploration of Linked Open Data
PDF
Turning Data into Knowledge (KESW2014 Keynote)
PDF
Are we failing users? Can open approaches meet their needs? - Maura Marx
PPT
Open Science at the European Commission
PDF
Delivering Linked Data Training to Data Science Practitioners
PDF
Introduction Presentation for LinkedUp kickoff meeting
PPTX
November 18, 2015 NISO Webinar: Text Mining: Digging Deep for Knowledge
PDF
Linked Open Data for Digital Humanities
PDF
LAK14 Data Challenge
PPTX
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
PPTX
IIIF as an Enabler to Interoperability within a Single Institution
PDF
LACE Project Overview and Exploitation
LinkedUp Project
Online Learning and Linked Data: An Introduction
LAK Dataset and Challenge (April 2013)
Open Educational Data - Datasets and APIs (Athens Green Hackathon 2012)
Data Science Curriculum for Professionals
Learning Analytics & Linked Data – Opportunities, Challenges, Examples
WWW2013 Tutorial: Linked Data & Education
Development of a Linked Data curriculum
Demo: Profiling & Exploration of Linked Open Data
Turning Data into Knowledge (KESW2014 Keynote)
Are we failing users? Can open approaches meet their needs? - Maura Marx
Open Science at the European Commission
Delivering Linked Data Training to Data Science Practitioners
Introduction Presentation for LinkedUp kickoff meeting
November 18, 2015 NISO Webinar: Text Mining: Digging Deep for Knowledge
Linked Open Data for Digital Humanities
LAK14 Data Challenge
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
IIIF as an Enabler to Interoperability within a Single Institution
LACE Project Overview and Exploitation
Ad

Viewers also liked (20)

PDF
Evaluating Named Entity Recognition and Disambiguation in News and Tweets
PPTX
NLP todo
PPTX
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
PPT
Gathering Alternative Surface Forms for DBpedia Entities
PPTX
Federated SPARQL query processing over the Web of Data
PDF
Linked Data Fragments
PDF
DBpedia InsideOut
ODP
DBpedia: A Public Data Infrastructure for the Web of Data
PDF
LDQL: A Query Language for the Web of Linked Data
ODP
Fast Approximate A-box Consistency Checking using Machine Learning
PDF
Applying Linked Open Data to Public Procurement
PDF
Exploiting the query structure for efficient join ordering in SPARQL queries
ODP
Data Mining with Background Knowledge from the Web - Introducing the RapidMin...
PPTX
A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud
PDF
Unsupervised Extraction of Attributes and Their Values from Product Description
PPTX
FedViz: A Visual Interface for SPARQL Queries Formulation and Execution
PDF
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
PDF
RDF Tutorial - SPARQL 20091031
PDF
Querying Linked Data with SPARQL
PDF
The Future is Federated
Evaluating Named Entity Recognition and Disambiguation in News and Tweets
NLP todo
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Gathering Alternative Surface Forms for DBpedia Entities
Federated SPARQL query processing over the Web of Data
Linked Data Fragments
DBpedia InsideOut
DBpedia: A Public Data Infrastructure for the Web of Data
LDQL: A Query Language for the Web of Linked Data
Fast Approximate A-box Consistency Checking using Machine Learning
Applying Linked Open Data to Public Procurement
Exploiting the query structure for efficient join ordering in SPARQL queries
Data Mining with Background Knowledge from the Web - Introducing the RapidMin...
A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud
Unsupervised Extraction of Attributes and Their Values from Product Description
FedViz: A Visual Interface for SPARQL Queries Formulation and Execution
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
RDF Tutorial - SPARQL 20091031
Querying Linked Data with SPARQL
The Future is Federated
Ad

Similar to Open Education Challenge 2014: exploiting Linked Data in Educational Applications (20)

PDF
LinkedUp - Linked Data Europe Workshop 2014
PDF
WWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
PDF
OKCon 2013, LinkedUp & Open Education
PDF
Open Data Dialog 2013 - Linked Data in Education
PDF
Web Science Synergies: Exploring Web Knowledge through the Semantic Web
PDF
Semantic Web / Linked Data Technologies
PDF
What's all the data about? - Linking and Profiling of Linked Datasets
PPTX
Experience from 10 months of University Linked Data
PDF
Retrieval, Crawling and Fusion of Entity-centric Data on the Web
PPT
Seminario Sobre Datasets Consorcio Madrono
PDF
Big Data in Learning Analytics - Analytics for Everyday Learning
PPTX
A Framework Concept for Profiling Researchers on Twitter using the Web of Data
PDF
Mining and Understanding Activities and Resources on the Web
PDF
KnowEscape workshop, OKCon 2013
PDF
Semantic Linking & Retrieval for Digital Libraries
PPTX
Linked Data Tutorial (Florianópolis)
PPTX
Building the Open University's Web of Linked Data
PPTX
Exposing Humanities Data for Reuse and Linking - RED, linked data and the sem...
PPTX
Information is beautiful
PDF
I Linked Open Data nei Beni Culturali, alcuni progetti e casi di studio
LinkedUp - Linked Data Europe Workshop 2014
WWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
OKCon 2013, LinkedUp & Open Education
Open Data Dialog 2013 - Linked Data in Education
Web Science Synergies: Exploring Web Knowledge through the Semantic Web
Semantic Web / Linked Data Technologies
What's all the data about? - Linking and Profiling of Linked Datasets
Experience from 10 months of University Linked Data
Retrieval, Crawling and Fusion of Entity-centric Data on the Web
Seminario Sobre Datasets Consorcio Madrono
Big Data in Learning Analytics - Analytics for Everyday Learning
A Framework Concept for Profiling Researchers on Twitter using the Web of Data
Mining and Understanding Activities and Resources on the Web
KnowEscape workshop, OKCon 2013
Semantic Linking & Retrieval for Digital Libraries
Linked Data Tutorial (Florianópolis)
Building the Open University's Web of Linked Data
Exposing Humanities Data for Reuse and Linking - RED, linked data and the sem...
Information is beautiful
I Linked Open Data nei Beni Culturali, alcuni progetti e casi di studio

More from Stefan Dietze (20)

PDF
Understanding Scientific and Societal Adoption and Impact of Science Through ...
PDF
NEWORDER Project - Science in the online knowledge order
PDF
Collecting & Temporal Analysis of Behavioral Web Data - Tales From The Inside
PDF
AI in between online and offline discourse - and what has ChatGPT to do with ...
PDF
An interdisciplinary journey with the SAL spaceship – results and challenges ...
PDF
Research Knowledge Graphs at NFDI4DS & GESIS
PDF
Research Knowledge Graphs at GESIS & NFDI4DataScience
PDF
Human-in-the-loop: the Web as Foundation for interdisciplinary Data Science M...
PDF
Human-in-the-Loop: das Web als Grundlage interdisziplinärer Data Science Meth...
PDF
Towards research data knowledge graphs
PDF
Beyond research data infrastructures: exploiting artificial & crowd intellige...
PDF
From Web Data to Knowledge: on the Complementarity of Human and Artificial In...
PDF
Using AI to understand everyday learning on the Web
PDF
Analysing User Knowledge, Competence and Learning during Online Activities
PDF
Analysing & Improving Learning Resources Markup on the Web
PDF
Beyond Linked Data - Exploiting Entity-Centric Knowledge on the Web
PDF
Towards embedded Markup of Learning Resources on the Web
PDF
Linked Data for Architecture, Engineering and Construction (AEC)
PDF
Dietze linked data-vr-es
PDF
From Data to Knowledge - Profiling & Interlinking Web Datasets
Understanding Scientific and Societal Adoption and Impact of Science Through ...
NEWORDER Project - Science in the online knowledge order
Collecting & Temporal Analysis of Behavioral Web Data - Tales From The Inside
AI in between online and offline discourse - and what has ChatGPT to do with ...
An interdisciplinary journey with the SAL spaceship – results and challenges ...
Research Knowledge Graphs at NFDI4DS & GESIS
Research Knowledge Graphs at GESIS & NFDI4DataScience
Human-in-the-loop: the Web as Foundation for interdisciplinary Data Science M...
Human-in-the-Loop: das Web als Grundlage interdisziplinärer Data Science Meth...
Towards research data knowledge graphs
Beyond research data infrastructures: exploiting artificial & crowd intellige...
From Web Data to Knowledge: on the Complementarity of Human and Artificial In...
Using AI to understand everyday learning on the Web
Analysing User Knowledge, Competence and Learning during Online Activities
Analysing & Improving Learning Resources Markup on the Web
Beyond Linked Data - Exploiting Entity-Centric Knowledge on the Web
Towards embedded Markup of Learning Resources on the Web
Linked Data for Architecture, Engineering and Construction (AEC)
Dietze linked data-vr-es
From Data to Knowledge - Profiling & Interlinking Web Datasets

Recently uploaded (20)

PDF
Approach and Philosophy of On baking technology
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
Cloud computing and distributed systems.
PDF
Empathic Computing: Creating Shared Understanding
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Electronic commerce courselecture one. Pdf
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
Approach and Philosophy of On baking technology
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Cloud computing and distributed systems.
Empathic Computing: Creating Shared Understanding
Chapter 3 Spatial Domain Image Processing.pdf
Advanced methodologies resolving dimensionality complications for autism neur...
Review of recent advances in non-invasive hemoglobin estimation
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Building Integrated photovoltaic BIPV_UPV.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
Electronic commerce courselecture one. Pdf
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Spectral efficient network and resource selection model in 5G networks
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Per capita expenditure prediction using model stacking based on satellite ima...

Open Education Challenge 2014: exploiting Linked Data in Educational Applications

  • 1. Exploiting (Linked) Web Data in Educational Applications Stefan Dietze L3S Research Center http://purl.org/dietze @stefandietze - Open Education Challenge, Berlin, 2014 - 28/10/14 1 Stefan Dietze
  • 2. Linked Data for education  Data sharing: TED, Open Courseware, mEducator, LinkedUp, LAK….  Tutorials & workshops (eg „Linked Learning“ series)  LinkedUniversities.org and LinkedEducation.org  W3C Linked Open Education community group Research areas  Web & data science, information retrieval, semantic web & Linked Data, data & knowledge integration  Application domains: education/TEL, Web archiving, … Some projects Introduction http://www.l3s.de/ 28/10/14 2  See also: http://purl.org/dietze Stefan Dietze
  • 3. Social Media Exploiting Open Data for Education?nutshell (Open) Educational Resources World Wide Web Distance Universities MOOCs Linked Open Data 28/10/14 3 Stefan Dietze
  • 4. How Open is Open Data? Open Data (as in “open licensing”) Open licensing (ODL, CC etc) Yet: variety of approaches APIs/feeds: SOAP, REST, etc Diverse schemas & vocabularies (lack of) controlled vocabularies Reuse & interoperability? Linked Data (technology) (as in “interoperability”) Defacto Standard for Open Data on the Web W3C standards: Common HTTP interface: SPARQL Common representation: RDF Dereferencable URIs Shared/linked vocabularies Linked Open Data 5-star scheme by Sir Tim Berners Lee 28/10/14 4 Stefan Dietze
  • 5. Semantic Web Example: Google Knowledge Graph (DBpedia, Freebase, Yago etc) W3C standards (RDF & SPARQL) for knowledge representation and querying URIs to identify/link data “A little semantics goes a long way” (J. Hendler1) dbp:United_States http://dbpedia.org/resource/Cambridge_MA dbp:W3C country cityOf 1 Hendler, J., The Dark Side of the Semantic Web, IEEE Intelligent Systems, Jan/Feb 2007 schema:City typeOf dbp:MIT ru.dbp:Кембридж_(Массачусетс) sameAs headquarterOf
  • 6. HTTP accessibility: persistent URIs, SPARQL FOAF Gene Ontology BIBO Geo Ontology DBpedia Ontology Dublin Core BBC Programmes Connected graph of open Web data (500+ datasets and 100 billion triples) Persistent, dereferencable URIs & content negotiation, shared/linked vocabularies SPARQL to query via HTTP Other „incarnations“: Google Knowledge Graph Facebook Open Graph http://schema.org http://dbpedia.org/resource/Cambridge_MA 28/10/14 6 Stefan Dietze
  • 7. LD to ensure discoverability of content/Websites (eg schema.org/microdata/RDFa) Annotating HTML documents about (educational) material with schema.org (eg LRMI, Learning Resource Metadata Initiative) Adopted by major sites (YouTube, LinkedIn etc) & tool support (DRUPAL, WordPress) LD is not just for your data Schema.org for discovery of content/websites http://schema.org © Ramanathan V. Guha, Google, SemTech2014 28/10/14 7 Stefan Dietze
  • 8. Other learning-relevant data & resources Publications & literature (Social) media resource metadata Domain-specific knowledge: Bioportal, Europeana, Geonames, … Cross-domain factual knowledge: DBpedia, Freebase, … LD as body of knowledge for education http://linkededucation.org http://linkeduniversities.org 28/10/14 8 Stefan Dietze Educational datasets and vocabularies University Linked Data: The Open University UK, http://data.open.ac.uk, Southampton University, http://education.data.gov.uk, … Open Educational Resources metadata: mEducator, Open Learn, Open Courseware, … Schemas: Learning Resource Metadata Initiative (LRMI, mEducator Educational Resources schema, BIBO, AAISO, …
  • 9. LD as background knowledge for educational apps? http://metamorphosis.med.duth.gr/ Title: ECG Patient case 1001 chest and limb leads 28/10/14 9 Stefan Dietze
  • 10. Title: ECG Patient case 1001 chest and limb leads „ECG“ dismabiguation on Wikipedia: 9 meanings LD as background knowledge for educational apps? 28/10/14 10 Stefan Dietze
  • 11. dbpedia.org/resource/Electrocardiagraphy 1. Understanding data: contextual disambiguation through NLP tools 2. Enrichment with factual knowledge dbpedia:Электрокардиография category:Cardiac_procedures dbpedia:Willem_Einthoven 3. interlinking with related resources bbc:ProgrammeXY slideshare:SlidesetXY yovisto:VideolectureXY Title: ECG Patient case 1001 chest and limb leads Understanding, enriching, linking data 28/10/14 11 Stefan Dietze
  • 12. „Success models“: data & applications Supporting innovative tools & applications Evaluation methods LinkedUp – Linking Web Data for Education Technology transfer & community-building Involving educators, developers, computer scientists, data engineers… http://www.linkedup-challenge.org/ Data curation & profiling Collecting & exposing open data for education Profiling of Web Data http://data.linkededucation.org EC-funded project aimed at advancing take-up of open data and related technologies http://www.linkedup-project.eu/events 28/10/14 Stefan Dietze 12 http://www.linkedup-project.eu/
  • 13. Community-building and collaboration Joint work on tangible outcomes (datasets, applications....) Associated Partners Initiatives EC Projects Stefan Dietze
  • 14. Collected & curated datasets of educational relevance Beyond collecting: published over 50 datasets as LD together with most important content providers e.g. TED, OCW, SoLAR etc LinkedUp catalog: most comprehensive collection of LD/Open Data for education RDF dataset metadata Federated queries across datasets using type mappings Publishing & curating educational data http://data.linkededucation.org/linkedup/catalog/ 28/10/14 Stefan Dietze 14
  • 15. http://data-observatory.org/lod-explorer Supporting developers and data consumers Devtalk blog: developer resource & community to aid developers Webinars and tutorials http://data.linkededucation.org/linkedup/devtalk/ Topic-based annotation and discovery of data Data exploration & visualisation features 28/10/14 Stefan Dietze 16
  • 16. LinkedUp events, training & technology transfer Bringing stakeholders together Data Providers & Data Scientists Developers Community-building through events & communication channels/social media (cross-disciplinary, industry & academia) Exploitation of project outcomes across communities: technology transfer (Co-)organised approx. 20 events (tutorials, workshops, booths etc) More than 30 invited talks/lectures …. Users (Learners, Tutors, Teachers) 28/10/14 Stefan Dietze 17
  • 17. May –September 2013 October 2013 – May 2014 May 2014 – October 2014 Series of Open Data Competitions to promote applications which exploit Linked Open Data http://www.linkedup-challenge.org/ LinkedUp Challenge
  • 18. 23 14 13 8 9 10 0 5 10 15 20 25 Veni Vidi Vici submissions shortlist LinkedUp Challenge results  50 submissions of which 27 were shortlisted and supported (through travel grants, participation in events and rewards)  13 Veni, Vidi, Vici winners (grants: 1000 – 3000 €)  Authors from 23 distinct, mostly European countries LinkedUp submissions & shortlist Coatia; 4 Greece; 4 Belgium; 5 Italy; 7 Germany; 11 Spain; 13 France; 14 Netherlands; 15 United States; 15 United Kingdom; 21 authors Top-10 author‘s origins Stefan Dietze 28/10/14 21
  • 19. Issues (1/3) - open data is messier than we think SPARQL endpoint availability over time [Buil-Aranda et al 2013] Accessibility of datasets? Less than 50% of all SPARQL endpoints actually responsive at given point of time [Buil-Aranda2013] “THE” SPARQL protocol? No, but many variants & subsets Data “quality”? …data accuracy (eg DBpedia)? [Paulheim2013] …vocabulary reuse/links? [D’AquinWebSci13] …schema compliance (RDFS, schemas) [HoganJWS2012] Stefan Dietze SPARQL Web-Querying Infrastructure: Ready for Action?, Carlos Buil-Aranda, Aidan Hogan, Jürgen Umbrich Pierre-Yves Vandenbussch, International Semantic Web Conference 2013, (ISWC2013). Assessing the Educational Linked Data Landscape, D’Aquin, M., Adamou, A., Dietze, S., ACM Web Science 2013 (WebSci2013), Paris, France, May 2013. Type Inference on Noisy RDF Data, Paulheim H., Bizer, C. Semantic Web – ISWC 2013, Lecture Notes in Computer Science Volume 8218, 2013, pp 510-525 An empirical survey of Linked Data conformance. Hogan, A., Umbrich, J., Harth, A., Cyganiak, R., Polleres, A., Decker., S., Journal of Web Semantics 14, 2012 28/10/14 22
  • 20. Issues (2/3) – accepting inconsistency Analyzing Relative Incompleteness of Movie Descriptions in the Web of Data: A Case Study, Yuan, W., Demidova, E., Dietze, S., Zhu, X., International Semantic Web Conference 2014 (ISWC2014) 28/10/14 Stefan Dietze 23
  • 21. Issues (3/3) – licensing/legal aspects Dataset Words Pages DBpedia 7163 16 Flickr 10367 23 ConceptNet 7163 16 World Bank 7056 16 Nature 7024 16 LinkedIn 6104 14 Google+ 5740 13 Tumblr 5362 12 Twitter 4247 9 Facebook 4179 9 Mashing up data: legal and licensing related issues under-estimated What license do you get when mashing up: Attribution: copyright violation from missing (86%) or incorrect attribution (14%) information Terms & conditions: complexity and conflicts when merging data from different sources Potential non-compliance from evolution of (a) LOD applications and (b) underlying datasets (and their licenses) T&C of established datasets 28/10/14 Stefan Dietze 24 Nature (CC0) + DBpedia (CC-ShareAlike) + FAO (Proprietary non-commercial) => ?
  • 22. Get involved! http://www.w3.org/community/opened http://data.linkededucation.org/linkedup/catalog/ http://data.linkededucation.org/linkedup/devtalk/
  • 23. Thank you! 28/10/14 Stefan Dietze 26