SlideShare a Scribd company logo
THE ROAD TO OPEN DATA 
ENLIGHTENMENT 
IS PAVED WITH NICE EXCUSES 
3rd Dec 2014 Toon Vanagt CEO data.be @Toon
Official Belgian company info sources
Some data.be features 
 Autocomplete 
 Mashing up gov 
sources 
 Data 
enrichment 
 Financial ratios 
 OCR in PDFs 
 Entity 
recognition 
 Alerts
Police Force are top users of 
data.be 
On the 
internet you 
must always 
remember: 
If something 
of value is 
free, you’re 
the product!
Definitions 
‘Open knowledge’ is any 
content, information or 
data that people are 
free to use, re-use and 
redistribute — without 
any legal, technological 
or social restriction. 
(okfn.org) 
 ‘Open data’ and ‘open 
content’ 
mean anyone can freely 
access, use, modify, and 
share for any purpose — 
subject, at most, to 
requirements that preserve 
provenance and openness. 
(opendefinition.org)
Open Data Enlightenment vs 
Buzz 
The Age of Enlightenment is the era from the1650s to the 
1780s in which cultural and intellectual forces emphasized 
reason, analysis and individualism rather than traditional lines 
of authority…. 
The current open data philosophy redefines ‘authority’ too and 
appeals to analytical power of citizens, hackers, journalists and 
entrepreneurs to put data to good use. 
Open data: 
 fosters “bottom up”-approach 
 stimulates to get more out of the data sets 
 delivers unexpected results & insights 
Beware of fancy alchemy headlines: 
 Open Data Is The New Oil 
 Unlocking The Gold Mine 
 Turning Government Data Into Gold 
 €40 Billion boost to the EU's economy each 
year…
Excuse 1: But how will we make 
money? 
 Does your government 
(department) really have to make 
money with open data? 
 Open data quickly evolved into 
primary state infrastructure & 
service. 
 Open data benefits society as a 
whole, so why tax usage 
separately? 
 If you still want or have to charge 
users, limit the cost in PSI-spirit to 
your marginal data delivery 
expense (extra bandwidth).
Who pays for open data gov 
cost? 
1. Government subsidizes underlying open data department 
costs as a primary service. Government covers the open 
data related cost as part of tis general expenses. 
2. Government agencies charge each other for cost of data 
usage between federal, regional and city level departments 
3. 11 open data revenue models for government agencies as 
authentic sources 
 3 options at input side 
 8 options at output side
Charging the INPUT side 
Government makes the user pay for 
(legally required!) data mutations: 
1. Creation of data sets (company 
creation, alarm system 
registration, publication of annual 
accounts,…) 
2. Change of data: (address move, 
new stakeholder in company, 
name changes, corrections…) 
3. Deletion of a dataset (inactive 
company, bankruptcy,…)
Downsides of INPUT based revenue 
model 
Introduces financial hurdles 
Removes incentives to keep data up to 
date 
Results in lower data quality 
Requires higher ‘enforcement’ cost 
Requires cost to clean up outdated data 
sets
Charging the OUTPUT side 
1. User pays for individual consultation 
2. Basic data are free, but user has to pay to consult extended data or meta 
data 
3. User pays for use of structured data sets (csv, xml, batch, API,..) 
4. User pays for real-time data sets, which reflect current state in authentic 
data source (daily update versus monthly update) 
5. User pays for removed data (from archive) or for change log (historic 
overview) 
6. Users pays to Service Level Agreement (eg guaranteed bandwidth or 
outside business hours) 
7. User pays for monitoring keywords (or events) in (or about) certain data 
sets to receive alerts (push notifications, e-mails, SMS,…) 
8. User pays for custom bench marking, segmentations, ratios or advanced 
filtering options
Downsides of OUTPUT based revenue 
model 
 Financial hurdle for ‘newcomers’ 
 Reduces innovation and consolidates ‘status-quo’ 
 Inequality (more for those who can pay, higher 
service through faster access, better informed) 
 Results in limited usage and applications 
 Requires costs for billing & payment system 
with back office operations
Gazette / Belgisch Staatsblad / 
Moniteur 
Input based: 
1. Creation of data sets 
(company creation, 
publication of annual 
accounts,…) 
2. Change of data: 
(address move, name 
changes, capital 
changes, new 
stakeholders…)
Belgian example 2: 
National Bank Balance sheets 
Input 
 Pay for publication of annual accounts (274 EUR 
for BVBA/SPRL = limited liability company) 
Output 
 User pays for use of structured data sets via a 
webservice (roughly between 1.850 EUR and 15.000 EUR per 
year). 
 User pays for old archived data sets which are 
no longer shown on the National Bank’s website 
 User pays for custom industry bench marking 
and ratios of competitors, customers or 
prospects (but one self-owned company 
benchmarking remains free)
Belgian example 3: 
Crossroads bank for enterprises 
Input 
 Creation of data sets 
 Change of data, such as address move or registering 
extra business entity,… 
Output 
 User pays for use of structured data sets (copy of public 
part of database with names of company stakeholders 
and self employed persons at 75.000 EUR/year 
 User pays for real-time data sets, which reflect current 
state in authentic data source (daily update versus 
monthly update) via API (2.000 API request for 50 EUR in 
prepaid balance) 
 User pays for removed data for change log (historic 
overview) 
 Users pays to Service Level Agreement (eg guaranteed 
bandwidth or outside business hours)
Avoid conflict of interest for gov 
agencies 
 Battle for budget: creates 
competition between government 
agencies 
 Inequality in support services and 
quality between paying and non-paying 
customers or agencies 
 Battle to secure authentic source as 
single gatekeeper and extend reach 
 Creates competition with private 
sector. Due to government agencies 
acting as commercial data brokers 
selling whole sale personal contact 
details to intermediates
Excuse 2: 
Our data quality is too low to release 
 Open Data is not your real challenge, you have much bigger data 
quality issues… 
 Accuracy: is the data correctly representing the real-world entity or event? 
 Completeness: Does the data include all data items representing the entity 
or event? 
 Conformance: Is the data following accepted standards? 
 Consistency: Is the data not containing contradictions? 
 Credibility: Is the data based on trustworthy sources? 
 Processability: Is the data machine-readable? 
 Relevance : Does the data include an appropriate amount of data? 
 Timeliness: Is the data representing the actual situation and is it published 
soon enough?
the process and partner chain is 
not… 
 Document data process 
partners 
 Describe steps in 
information chain upward 
of your authentic source 
(data.be had to reverse 
engineer processes)
some privacy sensitive data 
elements… 
 Keep the lawyers out of your open data project if you want to make a fast 
start  
 It’s complicated 
 It’s Personal 
 Privacy concept evolves over time and is culturally defined 
 Many grey zones 
 Don’t forget to try to anonymise your unstructured data too… accidents will 
happen 
 We can technologically do much more than we are permitted to culturally, 
morally or legally… 
 Beware that very few data points are needed to identify a person in this big 
data era. Eloquently phrased by Jonathan Mayer: “The idea of personally 
identifiable information not being identifiable is completely laughable in 
computer-science circles”.
Excuse 5: On second thought, we’re 
not that open… 
Availability: Can the data be 
accessed now and over time? 
Be consistent and offer long 
term commitments and stable 
data set formats (integration 
mapping) 
Data.be received a ‘Cease & 
Desist’ after a government 
hackathon: “Our government 
website is the only authentic 
source for air quality 
measurement. Stop using our 
data immediately or …”
Excuse 6: We opened the data in a 
layer on our WMS… 
 Web Map Service (WMS) is a standard 
protocol for serving geo-referenced map 
images over the Internet that are generated by 
a map server using data from a GIS database. 
 It is very hard to share the layer data…in other 
applications
Next frontiers for Open Data 
 Linked & graph data 
 Metadata 
 Unstructured data 
 Structured feedback loops
Barriers to open data reuse 
© 2013 European Commission training manual
Gatekeepers to the rescue 
 Don’t just ‘input’ the data which are presented 
 Inform general public on long term use of their 
‘public’ data. 
 Once online, always online… 
 Evangelise the use of open data inside and 
outside your organisation
Open up your organisation 
 Invite a data scientist to work. 
Share insights internally, learn, 
optimize quality of data sets 
 Be open about quality and refresh 
rates 
 Specify the license under which the 
data may be re-used. 
 Provide a feedback loop (now 
data.be often is feedback for 
outdated company data…) 
 Maintenance of metadata and data 
is critical!
Toon Vanagt CEO toon@data.be 
@Toon 
THANK YOU 
3rd Dec 2014 #OUP14 
Opening up conference in Brussels
The road to open data enlightenment is paved with nice excuses by Toon Vanagt
Picture copyright & attribution 
 The brick laying machine pictures 
can be found at Tiger 
Stone:http://www.tiger-stone. 
nl/index.php?option=com_co 
ntent&view=article&id=47&Itemid=5 
5 
 Keep calm cup: 
http://www.keepcalm-o-matic. 
co.uk/product/mug/keep-calm- 
and-open-up-67/ 
 Storify with pictures of opening-up. 
eu event: 
https://storify.com/openingup_eu/op 
ening-up-final-conference-1

More Related Content

PPTX
The Road to Open Data Enlightenment Is Paved With Nice Excuses
PDF
The PSI Directive and Open Government Data
PDF
Intro to big data and applications - day 1
PDF
Big data survey
PDF
The linked open government data and metadata lifecycle
PDF
Idc big data whitepaper_final
PDF
Information economics and big data
PPTX
Introduction to Big Data
The Road to Open Data Enlightenment Is Paved With Nice Excuses
The PSI Directive and Open Government Data
Intro to big data and applications - day 1
Big data survey
The linked open government data and metadata lifecycle
Idc big data whitepaper_final
Information economics and big data
Introduction to Big Data

What's hot (20)

PDF
Promoting the re use of open data through ODIP
PDF
Use of big data technologies in capital markets
PPTX
Fraud and Risk in Big Data
PDF
Open Data Support - bridging open data supply and demand
PDF
Digital cultural heritage spring 2015 day 2
PDF
PDF
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
PDF
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
PDF
A Survey on Big Data Mining Challenges
DOCX
Supply chain management
PDF
Modern data integration | Diyotta
PDF
Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...
PPTX
10 things that make companies unproductive
PPTX
Big data
PDF
wireless sensor network
PDF
Disruptive technologies - Session 2 - Blockchain smart_contracts
PDF
Big data assignment
PPTX
Main Street, Meet Mr Watson - Matt Coatney
PPT
Big data introduction - Big Data from a Consulting perspective - Sogeti
DOCX
NIC Linked Data: the OHIO project
Promoting the re use of open data through ODIP
Use of big data technologies in capital markets
Fraud and Risk in Big Data
Open Data Support - bridging open data supply and demand
Digital cultural heritage spring 2015 day 2
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
A Survey on Big Data Mining Challenges
Supply chain management
Modern data integration | Diyotta
Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...
10 things that make companies unproductive
Big data
wireless sensor network
Disruptive technologies - Session 2 - Blockchain smart_contracts
Big data assignment
Main Street, Meet Mr Watson - Matt Coatney
Big data introduction - Big Data from a Consulting perspective - Sogeti
NIC Linked Data: the OHIO project
Ad

Viewers also liked (6)

PPTX
Open Data in your organization: tactics & tricks part 1 by Job Wiegant, Koen ...
PPT
social media conference - Bart Rosseau Otto Thors
PPT
Social media conference -Otto Thors
PPTX
OUP14 Social Media Maturity Measure
PDF
Social media game by Otto Thors
PPTX
Social Media Monitoring by Henk van der Leest
Open Data in your organization: tactics & tricks part 1 by Job Wiegant, Koen ...
social media conference - Bart Rosseau Otto Thors
Social media conference -Otto Thors
OUP14 Social Media Maturity Measure
Social media game by Otto Thors
Social Media Monitoring by Henk van der Leest
Ad

Similar to The road to open data enlightenment is paved with nice excuses by Toon Vanagt (20)

PDF
Volum, Varietat, Velocitat... i Compartició
PPTX
4Growth high level objective and focus change
PPTX
#opendata Back to the future
DOCX
Project 3 – Hollywood and IT· Find 10 incidents of Hollywood p.docx
PPTX
DS103 - Unit03DS103 - Unit03DS103 - Unit03.pptx
PPTX
Age Friendly Economy - Improving your business with external data
PPTX
Mis case study , Chapter 5, Chapter 6
PDF
Recent Advances In Data And Algorithms For Egovernment Christophe Gaie
PDF
What does “BIG DATA” mean for official statistics?
PPT
How open is public data agi 2011-13
PPTX
14.02.2017 Business model innovation slides
PPTX
From Silos to Synergy_ Real-time data sharing across eco-systems in real time...
PDF
Big Data, Analytics and Data Science
PDF
To Become a Data-Driven Enterprise, Data Democratization is Essential
PDF
From Silos to Synergy_ Real-time data sharing across eco-systems in real time...
PDF
Data Refinery Is Fuelling Next Generation Big Data Analytics
PDF
Opportunities and methodological challenges of Big Data for official statist...
PPTX
Open data for smart cities
PDF
Stuart Harrison Open data - Under the hood
PPT
Adventures with Open Data in a Government World
Volum, Varietat, Velocitat... i Compartició
4Growth high level objective and focus change
#opendata Back to the future
Project 3 – Hollywood and IT· Find 10 incidents of Hollywood p.docx
DS103 - Unit03DS103 - Unit03DS103 - Unit03.pptx
Age Friendly Economy - Improving your business with external data
Mis case study , Chapter 5, Chapter 6
Recent Advances In Data And Algorithms For Egovernment Christophe Gaie
What does “BIG DATA” mean for official statistics?
How open is public data agi 2011-13
14.02.2017 Business model innovation slides
From Silos to Synergy_ Real-time data sharing across eco-systems in real time...
Big Data, Analytics and Data Science
To Become a Data-Driven Enterprise, Data Democratization is Essential
From Silos to Synergy_ Real-time data sharing across eco-systems in real time...
Data Refinery Is Fuelling Next Generation Big Data Analytics
Opportunities and methodological challenges of Big Data for official statist...
Open data for smart cities
Stuart Harrison Open data - Under the hood
Adventures with Open Data in a Government World

More from Opening-up.eu (20)

PPTX
Local Open Data: A perspective from local government in England by Gesche Schmid
PPTX
Generating rendement with open data by Arjan El Fassed
PDF
Open Knowledge Foundation Business Lounge by Pieter-Jan Pauwels
PPTX
Basics of Open Data: what you need to know by Wouter Degadt & Pieter Colpaert
PPTX
Combining data through standards and metrics by Mike Thacker
PPTX
Content strategy for a municipal Facebook by Jenny Broden
PPTX
Social Media and local participation by Birgitte Städe
PDF
Your citizen is king, but your employee is superheroe by Sofie Verhalle
PPTX
Improving decisionmaking with GIS by Bjorgulf Torjussen
PDF
Digital transformation by Jo Caudron
PDF
Can you build a business with local open data? By Jan Liefers
PDF
Building Social Networks for a safer society By Elle De Jonge & Renske Stumpel
PPTX
Open Data for local governments - De Samenkomst, Aelbeke, Kortrijk
PDF
GIS DAY Belgium mike tacker_porism
PDF
GIS DAY Belgium inge wydhooge gis en beleid
PDF
GIS DAY Belgium diedrik gaus_elsverhasselt_gent_onderwijs
PPS
GIS DAY Belgium Anton driesmans_blankenberge_speelpleinen
PDF
GIS DAY Belgium Stephanie soete_wvl
PDF
GIS DAY Belgium bart verlinde knokke_heist_gratispleinen
PDF
GIS DAY Belgium pieter beck_20131120_analyse_buurt_wijk_stadsdeelgroen
Local Open Data: A perspective from local government in England by Gesche Schmid
Generating rendement with open data by Arjan El Fassed
Open Knowledge Foundation Business Lounge by Pieter-Jan Pauwels
Basics of Open Data: what you need to know by Wouter Degadt & Pieter Colpaert
Combining data through standards and metrics by Mike Thacker
Content strategy for a municipal Facebook by Jenny Broden
Social Media and local participation by Birgitte Städe
Your citizen is king, but your employee is superheroe by Sofie Verhalle
Improving decisionmaking with GIS by Bjorgulf Torjussen
Digital transformation by Jo Caudron
Can you build a business with local open data? By Jan Liefers
Building Social Networks for a safer society By Elle De Jonge & Renske Stumpel
Open Data for local governments - De Samenkomst, Aelbeke, Kortrijk
GIS DAY Belgium mike tacker_porism
GIS DAY Belgium inge wydhooge gis en beleid
GIS DAY Belgium diedrik gaus_elsverhasselt_gent_onderwijs
GIS DAY Belgium Anton driesmans_blankenberge_speelpleinen
GIS DAY Belgium Stephanie soete_wvl
GIS DAY Belgium bart verlinde knokke_heist_gratispleinen
GIS DAY Belgium pieter beck_20131120_analyse_buurt_wijk_stadsdeelgroen

Recently uploaded (20)

PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
Supervised vs unsupervised machine learning algorithms
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
Introduction to machine learning and Linear Models
PPTX
Introduction to Knowledge Engineering Part 1
PDF
Lecture1 pattern recognition............
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PDF
Foundation of Data Science unit number two notes
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PDF
Mega Projects Data Mega Projects Data
STUDY DESIGN details- Lt Col Maksud (21).pptx
Supervised vs unsupervised machine learning algorithms
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Introduction to machine learning and Linear Models
Introduction to Knowledge Engineering Part 1
Lecture1 pattern recognition............
Introduction-to-Cloud-ComputingFinal.pptx
Acceptance and paychological effects of mandatory extra coach I classes.pptx
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
Foundation of Data Science unit number two notes
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
IBA_Chapter_11_Slides_Final_Accessible.pptx
climate analysis of Dhaka ,Banglades.pptx
oil_refinery_comprehensive_20250804084928 (1).pptx
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Clinical guidelines as a resource for EBP(1).pdf
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
Mega Projects Data Mega Projects Data

The road to open data enlightenment is paved with nice excuses by Toon Vanagt

  • 1. THE ROAD TO OPEN DATA ENLIGHTENMENT IS PAVED WITH NICE EXCUSES 3rd Dec 2014 Toon Vanagt CEO data.be @Toon
  • 3. Some data.be features  Autocomplete  Mashing up gov sources  Data enrichment  Financial ratios  OCR in PDFs  Entity recognition  Alerts
  • 4. Police Force are top users of data.be On the internet you must always remember: If something of value is free, you’re the product!
  • 5. Definitions ‘Open knowledge’ is any content, information or data that people are free to use, re-use and redistribute — without any legal, technological or social restriction. (okfn.org)  ‘Open data’ and ‘open content’ mean anyone can freely access, use, modify, and share for any purpose — subject, at most, to requirements that preserve provenance and openness. (opendefinition.org)
  • 6. Open Data Enlightenment vs Buzz The Age of Enlightenment is the era from the1650s to the 1780s in which cultural and intellectual forces emphasized reason, analysis and individualism rather than traditional lines of authority…. The current open data philosophy redefines ‘authority’ too and appeals to analytical power of citizens, hackers, journalists and entrepreneurs to put data to good use. Open data:  fosters “bottom up”-approach  stimulates to get more out of the data sets  delivers unexpected results & insights Beware of fancy alchemy headlines:  Open Data Is The New Oil  Unlocking The Gold Mine  Turning Government Data Into Gold  €40 Billion boost to the EU's economy each year…
  • 7. Excuse 1: But how will we make money?  Does your government (department) really have to make money with open data?  Open data quickly evolved into primary state infrastructure & service.  Open data benefits society as a whole, so why tax usage separately?  If you still want or have to charge users, limit the cost in PSI-spirit to your marginal data delivery expense (extra bandwidth).
  • 8. Who pays for open data gov cost? 1. Government subsidizes underlying open data department costs as a primary service. Government covers the open data related cost as part of tis general expenses. 2. Government agencies charge each other for cost of data usage between federal, regional and city level departments 3. 11 open data revenue models for government agencies as authentic sources  3 options at input side  8 options at output side
  • 9. Charging the INPUT side Government makes the user pay for (legally required!) data mutations: 1. Creation of data sets (company creation, alarm system registration, publication of annual accounts,…) 2. Change of data: (address move, new stakeholder in company, name changes, corrections…) 3. Deletion of a dataset (inactive company, bankruptcy,…)
  • 10. Downsides of INPUT based revenue model Introduces financial hurdles Removes incentives to keep data up to date Results in lower data quality Requires higher ‘enforcement’ cost Requires cost to clean up outdated data sets
  • 11. Charging the OUTPUT side 1. User pays for individual consultation 2. Basic data are free, but user has to pay to consult extended data or meta data 3. User pays for use of structured data sets (csv, xml, batch, API,..) 4. User pays for real-time data sets, which reflect current state in authentic data source (daily update versus monthly update) 5. User pays for removed data (from archive) or for change log (historic overview) 6. Users pays to Service Level Agreement (eg guaranteed bandwidth or outside business hours) 7. User pays for monitoring keywords (or events) in (or about) certain data sets to receive alerts (push notifications, e-mails, SMS,…) 8. User pays for custom bench marking, segmentations, ratios or advanced filtering options
  • 12. Downsides of OUTPUT based revenue model  Financial hurdle for ‘newcomers’  Reduces innovation and consolidates ‘status-quo’  Inequality (more for those who can pay, higher service through faster access, better informed)  Results in limited usage and applications  Requires costs for billing & payment system with back office operations
  • 13. Gazette / Belgisch Staatsblad / Moniteur Input based: 1. Creation of data sets (company creation, publication of annual accounts,…) 2. Change of data: (address move, name changes, capital changes, new stakeholders…)
  • 14. Belgian example 2: National Bank Balance sheets Input  Pay for publication of annual accounts (274 EUR for BVBA/SPRL = limited liability company) Output  User pays for use of structured data sets via a webservice (roughly between 1.850 EUR and 15.000 EUR per year).  User pays for old archived data sets which are no longer shown on the National Bank’s website  User pays for custom industry bench marking and ratios of competitors, customers or prospects (but one self-owned company benchmarking remains free)
  • 15. Belgian example 3: Crossroads bank for enterprises Input  Creation of data sets  Change of data, such as address move or registering extra business entity,… Output  User pays for use of structured data sets (copy of public part of database with names of company stakeholders and self employed persons at 75.000 EUR/year  User pays for real-time data sets, which reflect current state in authentic data source (daily update versus monthly update) via API (2.000 API request for 50 EUR in prepaid balance)  User pays for removed data for change log (historic overview)  Users pays to Service Level Agreement (eg guaranteed bandwidth or outside business hours)
  • 16. Avoid conflict of interest for gov agencies  Battle for budget: creates competition between government agencies  Inequality in support services and quality between paying and non-paying customers or agencies  Battle to secure authentic source as single gatekeeper and extend reach  Creates competition with private sector. Due to government agencies acting as commercial data brokers selling whole sale personal contact details to intermediates
  • 17. Excuse 2: Our data quality is too low to release  Open Data is not your real challenge, you have much bigger data quality issues…  Accuracy: is the data correctly representing the real-world entity or event?  Completeness: Does the data include all data items representing the entity or event?  Conformance: Is the data following accepted standards?  Consistency: Is the data not containing contradictions?  Credibility: Is the data based on trustworthy sources?  Processability: Is the data machine-readable?  Relevance : Does the data include an appropriate amount of data?  Timeliness: Is the data representing the actual situation and is it published soon enough?
  • 18. the process and partner chain is not…  Document data process partners  Describe steps in information chain upward of your authentic source (data.be had to reverse engineer processes)
  • 19. some privacy sensitive data elements…  Keep the lawyers out of your open data project if you want to make a fast start   It’s complicated  It’s Personal  Privacy concept evolves over time and is culturally defined  Many grey zones  Don’t forget to try to anonymise your unstructured data too… accidents will happen  We can technologically do much more than we are permitted to culturally, morally or legally…  Beware that very few data points are needed to identify a person in this big data era. Eloquently phrased by Jonathan Mayer: “The idea of personally identifiable information not being identifiable is completely laughable in computer-science circles”.
  • 20. Excuse 5: On second thought, we’re not that open… Availability: Can the data be accessed now and over time? Be consistent and offer long term commitments and stable data set formats (integration mapping) Data.be received a ‘Cease & Desist’ after a government hackathon: “Our government website is the only authentic source for air quality measurement. Stop using our data immediately or …”
  • 21. Excuse 6: We opened the data in a layer on our WMS…  Web Map Service (WMS) is a standard protocol for serving geo-referenced map images over the Internet that are generated by a map server using data from a GIS database.  It is very hard to share the layer data…in other applications
  • 22. Next frontiers for Open Data  Linked & graph data  Metadata  Unstructured data  Structured feedback loops
  • 23. Barriers to open data reuse © 2013 European Commission training manual
  • 24. Gatekeepers to the rescue  Don’t just ‘input’ the data which are presented  Inform general public on long term use of their ‘public’ data.  Once online, always online…  Evangelise the use of open data inside and outside your organisation
  • 25. Open up your organisation  Invite a data scientist to work. Share insights internally, learn, optimize quality of data sets  Be open about quality and refresh rates  Specify the license under which the data may be re-used.  Provide a feedback loop (now data.be often is feedback for outdated company data…)  Maintenance of metadata and data is critical!
  • 26. Toon Vanagt CEO toon@data.be @Toon THANK YOU 3rd Dec 2014 #OUP14 Opening up conference in Brussels
  • 28. Picture copyright & attribution  The brick laying machine pictures can be found at Tiger Stone:http://www.tiger-stone. nl/index.php?option=com_co ntent&view=article&id=47&Itemid=5 5  Keep calm cup: http://www.keepcalm-o-matic. co.uk/product/mug/keep-calm- and-open-up-67/  Storify with pictures of opening-up. eu event: https://storify.com/openingup_eu/op ening-up-final-conference-1

Editor's Notes

  • #7: “Data is the new oil" Turns out it meant: Cost for storage & compute cycles will go down faster than you can imagine! This feels like preaching to the Open Data Choir / So I’m keeping this short
  • #8: Austerity. More efficient gov. Do more with less money Good intentions on gov site… Basic infrastructure & service: like roads & parks were in the past Bandwidth & storage are cheap nowadays
  • #9: 1. = Ideal 2. & 3. are pragmatic
  • #13: Hurdle and limit
  • #15: Xbrl + pdf
  • #16: Free monthly dataset
  • #19: Loopholes and unofficial or undocumented access & backdoors…
  • #20: The P word is out…Privacy
  • #27: Q&A