SlideShare a Scribd company logo
Datastores

For everyone
Who am I?
            • Lex Slaghuis, CEO @Wikiwise
              – Computer Science and consulting 
                background
@ajslaghu
              – Engaged with open data quite the bit
            • Wikiwise
              – Wiki’s, open content, open data, 
                open collaboration and networking
A datastore is a system that
• Offers data on a central ‘place’
   – Historical
   – Current (although, technically that is also historical  )
   – (Near) Realtime
• Offers context
   – Descriptions, including updateness of the data
   – Contactinformation!!! 
• Prevents you from building an accesspoint for each
  production site that is opened up
A datastore is not:
• A register
  – A register only links to datastores or datasets on 
    websites
  – Although great for developers to find data
  – Developers probably want as few registers as 
    possible
• But a datastore should expose data by means 
  of (metadata) search, an indexable catalog  
  and unique links for each dataset
How to get data into a datastore
•   A working proces that allows data‐owners to publish their data by means 
    of:
     1. Sending a e‐mail with a datafile? Yes, please!
     2. Having a file dropbox, so computers (servers) can send datafiles 
         automatically
          • Requires a tunnel from a production site into the datastore
     3.   A data webproxy. This means a datastore can handle a request and 
          forward it to a server who knows the answer and then sends it back
          • Requires a secure tunnel from a datastore into a production site
          • Only option with realtime or Bigdata like geo‐info
     4.   A data replication site. A datastore synchronizes (part of a ) 
          database and offers it indepedently
          • Requires a secure tunnel, either direction can work
          • Bigdata and realtime data is though to replicate (duh!)
How to get a datastore?
• Buy / hire / build / outsource the datastore… I 
  don’t care.
• Think about trust relations
  – If external parties tap into your production 
    systems, better trust them
  – Your datastore should also be trusted so make 
    sure it is recognizable as yours (logo’s and a 
    weblocation like data.yourgov.gov)
Anything else?
• Challenges and opportunities ahead: 
   – Big goverments are building datawarehouses, which means just 
     opening up 1 system
   – Small governments also need datastores, but they do cost money
   – Semi public insitutions such as hospitals are not allowed in the formal 
     government data registers
   – Commercial and community data registers are abound, see: 
     http://thedatahub.org/ and http://opendatanederland.org/
   – Engaging a community around data results in more use of data and 
     less repeated Q&A with governments
       • But difficult. Community engagement is a lot of work.
Questions?

Wikiwise.nl (Dutch)

More Related Content

PPTX
Data warehousing
PPTX
Help your users to discover your content with OpenAthens and Link Resolvers
PDF
What’s in Your Workflow?
PPTX
Useful data presentation from DataShaka
PDF
Challenges in altmetric data collection
PDF
Exposing the data from NARCIS with VIVO
PPTX
Data Mining: Key definitions
PPTX
Tatyana Matvienko,Senior Java Developer, Big data storages
Data warehousing
Help your users to discover your content with OpenAthens and Link Resolvers
What’s in Your Workflow?
Useful data presentation from DataShaka
Challenges in altmetric data collection
Exposing the data from NARCIS with VIVO
Data Mining: Key definitions
Tatyana Matvienko,Senior Java Developer, Big data storages

What's hot (12)

PPTX
Big data storages
PDF
II-SDV 2015, 20 - 21 April, in Nice
PDF
Let's downscale the semantic web !
PDF
Getting the Most out of Your Translation Memories (TM-Town ProZ Webinar April...
PDF
Manage your Datasets
PDF
ComputableFacts: a Secure System to Store Documents and Graphs
PPTX
Life in a fast moving tech company
PPTX
Custom Data Search with Stormpath
PPTX
Wsillforwaal2013
PPT
Managing sensitive data at the University of Bristol
PPTX
Web programming lec#3
PPT
Node.js: its potential in healthcare
Big data storages
II-SDV 2015, 20 - 21 April, in Nice
Let's downscale the semantic web !
Getting the Most out of Your Translation Memories (TM-Town ProZ Webinar April...
Manage your Datasets
ComputableFacts: a Secure System to Store Documents and Graphs
Life in a fast moving tech company
Custom Data Search with Stormpath
Wsillforwaal2013
Managing sensitive data at the University of Bristol
Web programming lec#3
Node.js: its potential in healthcare
Ad

Viewers also liked (8)

PPT
Case Study XS4All For Wikiwednesday
PPT
Ogd camp 2011
PPT
Mindtouch In Datarijke Omgevingen
PDF
Over wikiwise
PDF
Hochschule esslingen hydrosmart
PPT
Open innovatie festival groningen
PPT
Do Right Thing
PPT
Introductie In Wiki Door Wikiwise
Case Study XS4All For Wikiwednesday
Ogd camp 2011
Mindtouch In Datarijke Omgevingen
Over wikiwise
Hochschule esslingen hydrosmart
Open innovatie festival groningen
Do Right Thing
Introductie In Wiki Door Wikiwise
Ad

Similar to Datastores for opendata (20)

PPTX
Data Lakehouse, Data Mesh, and Data Fabric (r1)
PPTX
Data Mart Lake Ware.pptx
PPTX
Microsoft Traditional & Modern DW solutions stack Presentation.pptx
PDF
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
PDF
Open Data Summit Presentation by Joe Olsen
PPTX
New big data architecture in hadoop.pptx
PPTX
Data Lakehouse, Data Mesh, and Data Fabric (r2)
PPTX
Data Integration, Interoperability and Virtualization
PPTX
One Large Data Lake, Hold the Hype
PPTX
One Large Data Lake, Hold the Hype
PDF
Levelling up your data infrastructure
PPTX
Data modeling trends for analytics
PDF
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
PPTX
OpenStack Swift In the Enterprise
PDF
BD_Architecture and Charateristics.pptx.pdf
PDF
Harness the power of Data in a Big Data Lake
PPTX
5 Things that Make Hadoop a Game Changer
PDF
ITI015En-The evolution of databases (I)
PPTX
Data lake-itweekend-sharif university-vahid amiry
PDF
Building Data Warehouse in SQL Server
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Mart Lake Ware.pptx
Microsoft Traditional & Modern DW solutions stack Presentation.pptx
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
Open Data Summit Presentation by Joe Olsen
New big data architecture in hadoop.pptx
Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Integration, Interoperability and Virtualization
One Large Data Lake, Hold the Hype
One Large Data Lake, Hold the Hype
Levelling up your data infrastructure
Data modeling trends for analytics
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
OpenStack Swift In the Enterprise
BD_Architecture and Charateristics.pptx.pdf
Harness the power of Data in a Big Data Lake
5 Things that Make Hadoop a Game Changer
ITI015En-The evolution of databases (I)
Data lake-itweekend-sharif university-vahid amiry
Building Data Warehouse in SQL Server

Recently uploaded (20)

PDF
Laughter Yoga Basic Learning Workshop Manual
PDF
Charisse Litchman: A Maverick Making Neurological Care More Accessible
PDF
NewBase 12 August 2025 Energy News issue - 1812 by Khaled Al Awadi_compresse...
PDF
SBI Securities Weekly Wrap 08-08-2025_250808_205045.pdf
PDF
How to Get Business Funding for Small Business Fast
PDF
How to Get Funding for Your Trucking Business
PPT
Chapter four Project-Preparation material
PDF
TyAnn Osborn: A Visionary Leader Shaping Corporate Workforce Dynamics
PPTX
Belch_12e_PPT_Ch18_Accessible_university.pptx
PPTX
2025 Product Deck V1.0.pptxCATALOGTCLCIA
PDF
Tata consultancy services case study shri Sharda college, basrur
PDF
Digital Marketing & E-commerce Certificate Glossary.pdf.................
PDF
Cours de Système d'information about ERP.pdf
PPTX
Business Ethics - An introduction and its overview.pptx
PDF
Power and position in leadershipDOC-20250808-WA0011..pdf
PDF
Stem Cell Market Report | Trends, Growth & Forecast 2025-2034
PDF
IFRS Notes in your pocket for study all the time
PDF
Hindu Circuler Economy - Model (Concept)
PPTX
svnfcksanfskjcsnvvjknsnvsdscnsncxasxa saccacxsax
PPTX
Principles of Marketing, Industrial, Consumers,
Laughter Yoga Basic Learning Workshop Manual
Charisse Litchman: A Maverick Making Neurological Care More Accessible
NewBase 12 August 2025 Energy News issue - 1812 by Khaled Al Awadi_compresse...
SBI Securities Weekly Wrap 08-08-2025_250808_205045.pdf
How to Get Business Funding for Small Business Fast
How to Get Funding for Your Trucking Business
Chapter four Project-Preparation material
TyAnn Osborn: A Visionary Leader Shaping Corporate Workforce Dynamics
Belch_12e_PPT_Ch18_Accessible_university.pptx
2025 Product Deck V1.0.pptxCATALOGTCLCIA
Tata consultancy services case study shri Sharda college, basrur
Digital Marketing & E-commerce Certificate Glossary.pdf.................
Cours de Système d'information about ERP.pdf
Business Ethics - An introduction and its overview.pptx
Power and position in leadershipDOC-20250808-WA0011..pdf
Stem Cell Market Report | Trends, Growth & Forecast 2025-2034
IFRS Notes in your pocket for study all the time
Hindu Circuler Economy - Model (Concept)
svnfcksanfskjcsnvvjknsnvsdscnsncxasxa saccacxsax
Principles of Marketing, Industrial, Consumers,

Datastores for opendata

  • 2. Who am I? • Lex Slaghuis, CEO @Wikiwise – Computer Science and consulting  background @ajslaghu – Engaged with open data quite the bit • Wikiwise – Wiki’s, open content, open data,  open collaboration and networking
  • 3. A datastore is a system that • Offers data on a central ‘place’ – Historical – Current (although, technically that is also historical  ) – (Near) Realtime • Offers context – Descriptions, including updateness of the data – Contactinformation!!!  • Prevents you from building an accesspoint for each production site that is opened up
  • 4. A datastore is not: • A register – A register only links to datastores or datasets on  websites – Although great for developers to find data – Developers probably want as few registers as  possible • But a datastore should expose data by means  of (metadata) search, an indexable catalog   and unique links for each dataset
  • 5. How to get data into a datastore • A working proces that allows data‐owners to publish their data by means  of: 1. Sending a e‐mail with a datafile? Yes, please! 2. Having a file dropbox, so computers (servers) can send datafiles  automatically • Requires a tunnel from a production site into the datastore 3. A data webproxy. This means a datastore can handle a request and  forward it to a server who knows the answer and then sends it back • Requires a secure tunnel from a datastore into a production site • Only option with realtime or Bigdata like geo‐info 4. A data replication site. A datastore synchronizes (part of a )  database and offers it indepedently • Requires a secure tunnel, either direction can work • Bigdata and realtime data is though to replicate (duh!)
  • 6. How to get a datastore? • Buy / hire / build / outsource the datastore… I  don’t care. • Think about trust relations – If external parties tap into your production  systems, better trust them – Your datastore should also be trusted so make  sure it is recognizable as yours (logo’s and a  weblocation like data.yourgov.gov)
  • 7. Anything else? • Challenges and opportunities ahead:  – Big goverments are building datawarehouses, which means just  opening up 1 system – Small governments also need datastores, but they do cost money – Semi public insitutions such as hospitals are not allowed in the formal  government data registers – Commercial and community data registers are abound, see:  http://thedatahub.org/ and http://opendatanederland.org/ – Engaging a community around data results in more use of data and  less repeated Q&A with governments • But difficult. Community engagement is a lot of work.