SlideShare a Scribd company logo
Data Archiving and Networked Services



          Downscaling
      information systems
          for education
          Christophe Guéret (@cgueret)




DANS is een instituut van KNAW en NWO
What do you mean "Downscaling" ?
● Alternative to up/out scaling platforms when
  the cost of doing it becomes too high

● "Cost" in the wide sense
  ○   Loss of expressivity via harmonization
  ○   Loss of privacy
  ○   Loss of consistency / incompatible semantics
  ○   Hardware costs
  ○   Infrastructural costs
  ○   Cultural incompatibility
  ○   ...
Up / out scaling
● Get more and more data into one system
● Scale vertically (up) or horizontally (out)
Down scaling
● Instead of one big (cluster) system, use a
  swarm of smaller systems
● Aim at highest meaningful granularity
2 downscaled systems for education
● Information system for researchers willing to
  study Worldwide academic activity




● ~2M young learners willing to go social with
  digital media, but without Internet
Diversity aware publication of
activity of research institutions
Context
● Millions of researchers active Worldwide

● Represents lot of information about
   ○   Positions
   ○   Teaching activities
   ○   Equipment
   ○   Discoveries
   ○   ...


● Potential high value in sharing all that data
  and mining it
Problems
● Lots of name-centric, thus highly ambiguous,
  data sets

● Different conceptual spaces

● Different positions that not always
  correspond
   ○ "Maître de conférences" ~ "Universitair Docent" ~
     "Assistant professor" ?
Towards THE information system (?)
● Try to be the "Facebook for researchers"
● Eventual focus on sub-parts of the data
or THE ontology (?)
● Focus on the terminology, allow for different
  data stack (including non Web based)
Users end-up with a tough choice
● Do you prefer too specific or too generic ?




● Workaround: formats roundtripping
But data does not travel well...
● Publications from Frank van Harmelen
● Decreasing number from system to system




   148                38              13
Downscaling RIS
● Some of the harmonization high costs
  ○   Large ontologies are hard to design
  ○   Tradeoff coverage VS expressivity
  ○   Large amount of data
  ○   Lack of incentives to update one platform + branding
      and reporting issues playing against


● Alternative
  ○ Rely on a data ecosystem
  ○ Use several, layered ontologies
A research information ecosystem
Core ontology + national extensions
● Global scale insights and low level details




● Take advantage of reasoning
Cloud-less social interaction
Context
● XO laptop given to 2M
  kids aged 6-12
● Low-end hardware
  (~ old smartphone)
● Educational software
  based on
  constructivism
● Communication via
  Mesh-networking
The environment "Sugar"
Activities in Sugar
● A Sugar activity combines the concepts of
  “document” and “application” into a single
  object

● Activities can be easily shared between
  neighbouring computers

● Activity instances are associated with the
  document they let the user work on
Sharing activities
Journal of activity usage
Limitations of current data stack
● Data sharing limited to synchronous
  interaction

● Data isolated in silos

● No remote access to data created within a
  Sugar instance

● Social activity bounded by the classroom
Let's improve this, Web 2.0 way !
● Create a central server on the Cloud and
  define an API
● Create activities interacting via the API
● Add a Web frontend for authentication and
  adjust ACLs for the API


           +           +            =
Won't work because...
● Lack of stable, cheap, connection to Internet

● Lack of relevant content on the Web to
  justify getting a connection

● Issues with having kids on social networks

● (Besides, potentially hard to find a business
  model for sharing kids' work)
Downscaled alternative
● Turn every XO into a self-contained data
  publisher/consumer

● Apply Linked Data principles to achieve
  decentralised data integration
More information
● Collection of presentation about this and
  other topics
  ○ http://www.slideshare.net/cgueret


● Blog about making data sharing a reality for
  everyone
  ○ https://worldwidesemanticweb.wordpress.com/


● christophe.gueret@dans.knaw.nl

More Related Content

ODP
The Entity Registry System: Collaborative Editing of Entity Data in Poorly Co...
PDF
Digital archiving 3.0
PDF
The Entity Registry System (ERS)
ODP
Stop making tools! Nobody likes them anyway...
PDF
20170501 Distributed Network of Digital Heritage Information
PDF
Informal presentation about RES
PDF
Nanopublications and Decentralized Publishing
PPTX
Survey on NoSQL integration
The Entity Registry System: Collaborative Editing of Entity Data in Poorly Co...
Digital archiving 3.0
The Entity Registry System (ERS)
Stop making tools! Nobody likes them anyway...
20170501 Distributed Network of Digital Heritage Information
Informal presentation about RES
Nanopublications and Decentralized Publishing
Survey on NoSQL integration

What's hot (20)

PPTX
An Approach for RDF-based Semantic Access to NoSQL Repositories
PDF
Pundit at the Final DM2E event
PPTX
d:swarm - A Library Data Management Platform Based on a Linked Open Data Appr...
PDF
Wikidata
PPTX
Manchesterjan2015
PDF
Linked Open Data and American Art
PDF
Volum, Varietat, Velocitat... i Compartició
PDF
Introducing SURF
PPTX
Introduction to lod
PPT
Information Age Tools: Google Applications & More
PDF
Collaborative Creation of a Wikidata handbook
PPTX
WG5: A data wrangling experiment
PPTX
Sands Fish - Knowing in the Age of Networked Knowledge
PDF
Producing Linked Open Data with a Content Management System
PPTX
Data, part of my 2014-2015 lectures at the University of Bergamo.
PDF
What is Web-scraping?
PPTX
DataverseNL as structured data hub
 
PDF
Open content opens up new avenues of research
PPTX
Linked Open Data and DANS
 
PDF
Collecting and Making Sense of Diverse Data at WayUp
An Approach for RDF-based Semantic Access to NoSQL Repositories
Pundit at the Final DM2E event
d:swarm - A Library Data Management Platform Based on a Linked Open Data Appr...
Wikidata
Manchesterjan2015
Linked Open Data and American Art
Volum, Varietat, Velocitat... i Compartició
Introducing SURF
Introduction to lod
Information Age Tools: Google Applications & More
Collaborative Creation of a Wikidata handbook
WG5: A data wrangling experiment
Sands Fish - Knowing in the Age of Networked Knowledge
Producing Linked Open Data with a Content Management System
Data, part of my 2014-2015 lectures at the University of Bergamo.
What is Web-scraping?
DataverseNL as structured data hub
 
Open content opens up new avenues of research
Linked Open Data and DANS
 
Collecting and Making Sense of Diverse Data at WayUp
Ad

Viewers also liked (13)

PDF
Solution validation best practices
PDF
Novartis and ValiMation Present a SharePoint Solution for Risk Based Cleaning...
PDF
Google Solution Validation Process Certificate
PDF
Digital Library Home Access: User Validation, E- Resources Proxying and Feder...
PDF
Verification and Validation of Findings
PPTX
TestNG Data Binding
KEY
Data Journalism
PPT
Data validation in the Digital Age
PPTX
Solution Validation & Assessments - A practical Approach
PDF
Best Practice Solution Validation - Lean Startup Machine - Naples 2015
PPTX
Calibration and validation of analytical instruments
PDF
Developing a Roadmap for Digital Transformation
PPT
Digital Transformation: What it is and how to get there
Solution validation best practices
Novartis and ValiMation Present a SharePoint Solution for Risk Based Cleaning...
Google Solution Validation Process Certificate
Digital Library Home Access: User Validation, E- Resources Proxying and Feder...
Verification and Validation of Findings
TestNG Data Binding
Data Journalism
Data validation in the Digital Age
Solution Validation & Assessments - A practical Approach
Best Practice Solution Validation - Lean Startup Machine - Naples 2015
Calibration and validation of analytical instruments
Developing a Roadmap for Digital Transformation
Digital Transformation: What it is and how to get there
Ad

Similar to Downscaling information systems for education (20)

PDF
Next-Gen E-Learning Ideas
DOCX
Web 2 ingles
PPT
The Rationale for Semantic Technologies
PDF
Conole Ascilite Paper
PDF
Conole Lams
PPT
Conole Canada Keynote
PPT
Conole Canada Keynote
PDF
Let's downscale the semantic web !
PPTX
Using Semantic Analysis for Content Alignment and Gap Analysis
ODP
Embedding young learners into the information society
PPT
Conole Canada Keynote
PPT
Conole Japan
PDF
Corneli
PPTX
Open Data and Higher Education: future gains and current practice
PPSX
Semantic Analysis for Curricular Mapping, Gap Analysis & Remediation
PPT
Conole Prie Conference
PPT
Content Sharing: Whence and Whither?
PPT
Web2 Seminar
PDF
Preparing for the Impact of Web 3.0
PPTX
Conole openness
Next-Gen E-Learning Ideas
Web 2 ingles
The Rationale for Semantic Technologies
Conole Ascilite Paper
Conole Lams
Conole Canada Keynote
Conole Canada Keynote
Let's downscale the semantic web !
Using Semantic Analysis for Content Alignment and Gap Analysis
Embedding young learners into the information society
Conole Canada Keynote
Conole Japan
Corneli
Open Data and Higher Education: future gains and current practice
Semantic Analysis for Curricular Mapping, Gap Analysis & Remediation
Conole Prie Conference
Content Sharing: Whence and Whither?
Web2 Seminar
Preparing for the Impact of Web 3.0
Conole openness

More from Christophe Guéret (20)

PDF
HHAI June 2022 - KGs and Hybrid Intelligence
ODP
Introduction about WorldWideSemanticWeb.org for the workshop "Making it Matter"
PDF
Your next data viz gear should be a Wii-U
PDF
Linking knowledge spaces
ODP
The data behind the HuisKluis
PDF
The road towards a Web-based data ecosystem
PDF
Linked Open Data for Digital Humanities
PDF
ICT4D course 2013 - Low resources infrastructure
PDF
ICT4D course 2013 - OLPC deployments
PDF
ICT4D course 2013 - Sugar
PDF
Exposing the data from NARCIS with VIVO
PDF
Clarifier le sens de vos données publiques avec le Web de données
PDF
Is linked data something for me?
ODP
Decentralised entity registry “WikiReg”
ODP
Evolutionary and Swarm Computing for scaling up the Semantic Web
PDF
Decentralised Open Data for World Citizens
PDF
Assessing Linked Data Mappings using Network Measures
ODP
Finding and consuming (Linked) Open Data
PDF
Exploring Linked Data content through network analysis
PPTX
Is data sharing the privilege of a few? Bringing Linked Data to those without...
HHAI June 2022 - KGs and Hybrid Intelligence
Introduction about WorldWideSemanticWeb.org for the workshop "Making it Matter"
Your next data viz gear should be a Wii-U
Linking knowledge spaces
The data behind the HuisKluis
The road towards a Web-based data ecosystem
Linked Open Data for Digital Humanities
ICT4D course 2013 - Low resources infrastructure
ICT4D course 2013 - OLPC deployments
ICT4D course 2013 - Sugar
Exposing the data from NARCIS with VIVO
Clarifier le sens de vos données publiques avec le Web de données
Is linked data something for me?
Decentralised entity registry “WikiReg”
Evolutionary and Swarm Computing for scaling up the Semantic Web
Decentralised Open Data for World Citizens
Assessing Linked Data Mappings using Network Measures
Finding and consuming (Linked) Open Data
Exploring Linked Data content through network analysis
Is data sharing the privilege of a few? Bringing Linked Data to those without...

Recently uploaded (20)

PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Encapsulation theory and applications.pdf
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Approach and Philosophy of On baking technology
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
KodekX | Application Modernization Development
PDF
Review of recent advances in non-invasive hemoglobin estimation
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Encapsulation theory and applications.pdf
Mobile App Security Testing_ A Comprehensive Guide.pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Chapter 3 Spatial Domain Image Processing.pdf
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Understanding_Digital_Forensics_Presentation.pptx
Diabetes mellitus diagnosis method based random forest with bat algorithm
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Unlocking AI with Model Context Protocol (MCP)
20250228 LYD VKU AI Blended-Learning.pptx
Approach and Philosophy of On baking technology
Network Security Unit 5.pdf for BCA BBA.
KodekX | Application Modernization Development
Review of recent advances in non-invasive hemoglobin estimation
The AUB Centre for AI in Media Proposal.docx
NewMind AI Weekly Chronicles - August'25 Week I
Advanced methodologies resolving dimensionality complications for autism neur...

Downscaling information systems for education

  • 1. Data Archiving and Networked Services Downscaling information systems for education Christophe Guéret (@cgueret) DANS is een instituut van KNAW en NWO
  • 2. What do you mean "Downscaling" ? ● Alternative to up/out scaling platforms when the cost of doing it becomes too high ● "Cost" in the wide sense ○ Loss of expressivity via harmonization ○ Loss of privacy ○ Loss of consistency / incompatible semantics ○ Hardware costs ○ Infrastructural costs ○ Cultural incompatibility ○ ...
  • 3. Up / out scaling ● Get more and more data into one system ● Scale vertically (up) or horizontally (out)
  • 4. Down scaling ● Instead of one big (cluster) system, use a swarm of smaller systems ● Aim at highest meaningful granularity
  • 5. 2 downscaled systems for education ● Information system for researchers willing to study Worldwide academic activity ● ~2M young learners willing to go social with digital media, but without Internet
  • 6. Diversity aware publication of activity of research institutions
  • 7. Context ● Millions of researchers active Worldwide ● Represents lot of information about ○ Positions ○ Teaching activities ○ Equipment ○ Discoveries ○ ... ● Potential high value in sharing all that data and mining it
  • 8. Problems ● Lots of name-centric, thus highly ambiguous, data sets ● Different conceptual spaces ● Different positions that not always correspond ○ "Maître de conférences" ~ "Universitair Docent" ~ "Assistant professor" ?
  • 9. Towards THE information system (?) ● Try to be the "Facebook for researchers" ● Eventual focus on sub-parts of the data
  • 10. or THE ontology (?) ● Focus on the terminology, allow for different data stack (including non Web based)
  • 11. Users end-up with a tough choice ● Do you prefer too specific or too generic ? ● Workaround: formats roundtripping
  • 12. But data does not travel well... ● Publications from Frank van Harmelen ● Decreasing number from system to system 148 38 13
  • 13. Downscaling RIS ● Some of the harmonization high costs ○ Large ontologies are hard to design ○ Tradeoff coverage VS expressivity ○ Large amount of data ○ Lack of incentives to update one platform + branding and reporting issues playing against ● Alternative ○ Rely on a data ecosystem ○ Use several, layered ontologies
  • 15. Core ontology + national extensions ● Global scale insights and low level details ● Take advantage of reasoning
  • 17. Context ● XO laptop given to 2M kids aged 6-12 ● Low-end hardware (~ old smartphone) ● Educational software based on constructivism ● Communication via Mesh-networking
  • 19. Activities in Sugar ● A Sugar activity combines the concepts of “document” and “application” into a single object ● Activities can be easily shared between neighbouring computers ● Activity instances are associated with the document they let the user work on
  • 22. Limitations of current data stack ● Data sharing limited to synchronous interaction ● Data isolated in silos ● No remote access to data created within a Sugar instance ● Social activity bounded by the classroom
  • 23. Let's improve this, Web 2.0 way ! ● Create a central server on the Cloud and define an API ● Create activities interacting via the API ● Add a Web frontend for authentication and adjust ACLs for the API + + =
  • 24. Won't work because... ● Lack of stable, cheap, connection to Internet ● Lack of relevant content on the Web to justify getting a connection ● Issues with having kids on social networks ● (Besides, potentially hard to find a business model for sharing kids' work)
  • 25. Downscaled alternative ● Turn every XO into a self-contained data publisher/consumer ● Apply Linked Data principles to achieve decentralised data integration
  • 26. More information ● Collection of presentation about this and other topics ○ http://www.slideshare.net/cgueret ● Blog about making data sharing a reality for everyone ○ https://worldwidesemanticweb.wordpress.com/ ● christophe.gueret@dans.knaw.nl