Connected Development Data
Self-aware Data Objects
Vision
Planning and performance data from
development activities is connected
Vision: planning
• Who is planning to work in district X next
year?
• Which communities, facilities or partners are
others planning to work with?
• How can we identify and avoid potential
duplicate activities?
• How can we identify opportunities for
collaboration?
Vision: reporting
• Define what data you want to share and when
• Select who you want to share it with
• Creates a feed with stream of relevant data
• No more reports…
Vision: evaluation
• Joint evaluations focused on specific sectors
or approaches
• Draw on data from multiple implementers
• Drill down to examine source data and
evidence
• Identify implementers for interviews
Challenges
• These are not new ideas
• Many previous attempts highlight significant
challenges:
• Developing data standards
• Creating mechanisms to link systems
• Data quality problems
• Complex data governance issues
Data standards
• Data standards ensure that data from different
sources is based on same definition
• Necessary for data to be comparable, but can
be extremely time-consuming to develop
• Some success stories:
– International Aid Transparency Initiative
– HIV and AIDS indicator registry
– Humanitarian response indicator registry
Data standards
• Focus is typically on indicators and higher level
data
• Less effort to create standards for activity level
data
Mechanisms to link systems
• Migrating data from one system to another is
complex, time-consuming and expensive
• If one system changes then the link often
breaks
• Many different ways of linking systems means
work is often duplicated
• Only worth-while if working with large data-
sets
Data Quality
Can’t see the trees for the woods
• Focus on defining indicator level standards
• Therefore data often shared at this level too
• Connections and definitions that help
understand and audit the data quality often
missing
– How was data collected?
– What are the definitions inherent in the data?
Data Governance
Data Governance
• Connecting data makes it more useful but also
increases the risk of malicious attacks
• Data protection issues
• Cross-border issues (health data?)
• Security risks with vulnerable populations
Time for a fresh approach?
• Seems like these problems are un-solvable
• Final slides show-case work that we have been
doing over last five years
• Shows promising new approach to tackle
these challenges
Principles for a new approach
Emergent (bottom-up) standards development
• Support the development of standards where
there is interest and value to gain
• Ensure that each standard follows the same
‘design rules’
• Ensure that standards can be curated, shared
and – where possible – merged over time
Principles for a new approach
De-couple data from applications
• Context of the data is tightly linked to the
application in which it is created
• Ability to view and edit the data is also tightly
linked to the application
• Data must be able to exist as a micro-
application, aware of it’s context and able to
function independently
Principles for a new approach
Focus on operational data
• Current standards tend to focus on indicators,
but don’t include linkages to how the data was
collected
• If standard can include the full context, better
to start with operational data and aggregate
up
Principles for a new approach
Strong data governance
• Need strong mechanisms to manage privacy
and security
• Share data only as required for a specific
purposes
What are we trying?
• Kwantu has been working in this area for
many years
• Some promising approaches to help tackle
these problems
(1) Domain Specific Language
• Domain Specific Language (DSL) is a computer
language designed to be used by technical
experts, not programmers
• Using a DSL provides a standard and
comparable way of creating data standards
• Kwantu have developed and tested an open
source DSL in many contexts
(1) Domain Specific Language
• DSL used to create ‘Self-aware Data Objects’
(SDO) that define the standard for any
development data
• Doesn’t matter who creates each SDO
definition. They can be linked and queried
jointly
(2) Data context
• SDOs can define:
– Field names in any language
– Validations
– Help text
– Calculations
– Evidence
– Data taxonomies
– Hierarchies in and linkages to other data
• Enables us to embed the full context in the data
definition
(3) Application independent
• SDOs offers a more efficient and decentralised
application architecture
• SDO data includes it’s own view and edit
model
• Means you can interact with it in a browser or
other standard application
• SDO data is effectively a micro-application
(3) Application independent
• Legacy apps can transfer their data to and
from the relevant SDO definition
• New apps (including BetterData) can use the
view and edit models natively
• Simplifies the development of new
applications
Business
context Data envelope
Micro application
Self-aware Data Objects - Definition
Data
SDOs use a domain
specific language to
define M&E or
planning data and it’s
business context.
This can be
transformed into a
micro-application
that allows the data
to be edited and
viewed easily
Business
context
Self-aware Data Objects - Definition
Business context includes:
Data model that specifies:
- Fields
- Labels
- Help text
- Validations
- Languages
- Evidence (files or photos)
- Taxonomies
- Links to other SDO data
- Data can be expressed hierarchically
- Data can be contained in sets
It also includes a schema that validates the
data saved in the data model
Micro application
The micro application contains
view and edit models needed
to view or edit the data in a
standard web browser.
Self-aware Data Objects - Definition
Data envelope
Data
Existing applications can
access the data directly via
the Gatekeeper API
Transformer engines can be
used to transform the data
into the view and edit models
used by the application
Self-aware Data Objects - Definition
Data envelope
Data
Self-aware Data Objects - Definition
Data envelope
Every SDO includes a data envelope. This contains data on:
Who created the data and when
Who last updated the data and when
GIS coordinates
Globally unique ID for the data
Tags to code the data
Flags to indicate if the data is periodic or ad-hoc
Flags to indicate if it forms part of a series of data
Linkages to other data
Business
context
Data
Self-aware Data Objects - Data
Data
Data
Data
Data
Data
Each SDO will have multiple data instances in the Collector
from different data producers
Data envelope
Data
Query data
Import data
Collector
Data
registry
API
Gatekeeper
Prototype
collector system
environment
M&E or planning
system environment
Existing
system
API
BetterData
(4) Data Registry
• Library of shared data definitions
• Data governance team manage:
– Who can share new SDO definitions
– Who can use SDO definitions
– Curate and review SDO definitions
– Identify opportunities to link or merge
• Provide advice on privacy
• Responsible for data security
(4) Data Registry
• Option for multiple registries
• Scope set by the group that manages it
• Provides for a more organic and incremental
approach to developing standards
• While still allowing for separate data registries
to coordinate and share
(5) Collector system
• Distributed database that is linked to the Data
Registry
• Accessible only via an API that can:
– Validate SDO data against the schema held in the
Data Registry
– Publish SDO data into the collector system
– Query data held in the collector system
(6) Existing systems
• Simplify the process of integrating existing
systems
• Single standard API to validate, publish and
query data
• Systems must transform data into SDO
standard before publishing it
• Or can use SDO view and edit model to store
data natively as an SDO
(6) Existing systems
• Over time can create libraries to help speed
up integration with API
• No other changes needed
(7) BetterData
• Open source M&E system
• Integrated with Collector API
• Integrated with Data Registry
• Browse Data Registry and download relevant
SDOs
• Link SDOs into a workflow that incorporates
business logic
• Store locally or publish to Collector system
Where are we now?
• DSL – completed
• SDO examples – many in active usage
• BetterData M&E – completed
• Data Registry – early 2016
• Collector System and API – early 2016
• Data Governance guidelines - consultation
What next?
• GIZ funded pilot with South African
government
• Demonstrate working prototype in 2016
• Link and aggregate data from Municipal,
Provincial and National levels
What next?
• Canvass interest in applying to other contexts?
– Who is interested?
– What new issues does this raise?
• Establish advisory group
– Assist with refinement of DSL and SDO
specifications
– Assist with development of data governance
guidelines
Thank you!
• Rob Worthington
• rob@kwantu.net
• www.kwantu.net
• @kwantu

More Related Content

PPTX
Enhancing educational data quality in heterogeneous learning contexts using p...
PPT
Data Quality Asia Pacific Award_v1.1_20100520
PDF
Data Profiling, Data Catalogs and Metadata Harmonisation
PPTX
Data driven decision making through analytics and IoT
PPTX
MongoDB at Agilysys: A Case Study
PDF
MongoDB Case Study in Healthcare
PPTX
FAIRDOM data management support for ERACoBioTech Proposals
PPTX
Global IT Outsourcing case study
Enhancing educational data quality in heterogeneous learning contexts using p...
Data Quality Asia Pacific Award_v1.1_20100520
Data Profiling, Data Catalogs and Metadata Harmonisation
Data driven decision making through analytics and IoT
MongoDB at Agilysys: A Case Study
MongoDB Case Study in Healthcare
FAIRDOM data management support for ERACoBioTech Proposals
Global IT Outsourcing case study

What's hot (20)

PPTX
Centralize Security and Governance with Data Virtualization
PDF
Machine Data - How to Realize and Amplify its Value
PDF
Big Data Fabric Capability Maturity Model
PDF
Foundation of data quality
PPTX
Migrating Clinical Data in Various Formats to a Clinical Data Management System
PPTX
Intro to Demand Driven Open Data for Data Users
PPTX
3 Ways Tableau Improves Predictive Analytics
PDF
Data Catalog in Denodo Platform 7.0: Creating a Data Marketplace with Data Vi...
PDF
Data Marketplace - Rethink the Data
PDF
Ensuring data quality
PDF
Semantic E-Commerce - Use Cases in Enterprise Web Applications
PPTX
Introduction to BIG DATA
PDF
Data Quality
PPTX
Data Quality - Standards and Application to Open Data
PDF
II-SDV 2015, 20 - 21 April, in Nice
PPTX
Digital intelligence satish bhatia
PPT
Health Information Exchange Workgroup - November 15, 2010
PPTX
Are you ready for the transformation
PPTX
Enterprise data architecture of complex distributed applications & services
PPTX
API economy
 
Centralize Security and Governance with Data Virtualization
Machine Data - How to Realize and Amplify its Value
Big Data Fabric Capability Maturity Model
Foundation of data quality
Migrating Clinical Data in Various Formats to a Clinical Data Management System
Intro to Demand Driven Open Data for Data Users
3 Ways Tableau Improves Predictive Analytics
Data Catalog in Denodo Platform 7.0: Creating a Data Marketplace with Data Vi...
Data Marketplace - Rethink the Data
Ensuring data quality
Semantic E-Commerce - Use Cases in Enterprise Web Applications
Introduction to BIG DATA
Data Quality
Data Quality - Standards and Application to Open Data
II-SDV 2015, 20 - 21 April, in Nice
Digital intelligence satish bhatia
Health Information Exchange Workgroup - November 15, 2010
Are you ready for the transformation
Enterprise data architecture of complex distributed applications & services
API economy
 
Ad

Similar to Connected development data (20)

PPTX
Debbie Wilson: Deliver More Efficient, Joined-Up Services through Improved Ma...
PDF
Enabling a Data Mesh Architecture with Data Virtualization
PDF
PDF
Scaling Your Data: Data Democratisation and DataOps
PDF
Intro to big data and applications - day 2
PPTX
Mobile Offline First for inclusive data that spans the data divide
PDF
Why Data Mesh Needs Data Virtualization (ASEAN)
PDF
Future of Data Strategy (ASEAN)
PDF
Belgium & Luxembourg dedicated online Data Virtualization discovery workshop
PPTX
Building the enterprise data architecture
PDF
Data Services and the Modern Data Ecosystem (ASEAN)
PDF
Lunch and Learn: You have the data, now what?
PPTX
Data Governance_Notes.pptx
PDF
FIWARE Training: Introduction to Smart Data Models
PDF
Ch_2.pdf
PPT
PPTX
Evolution of the DBA to Data Platform Administrator/Specialist
PPT
software_engg-chap-03.ppt
PDF
Introduction Big Data
Debbie Wilson: Deliver More Efficient, Joined-Up Services through Improved Ma...
Enabling a Data Mesh Architecture with Data Virtualization
Scaling Your Data: Data Democratisation and DataOps
Intro to big data and applications - day 2
Mobile Offline First for inclusive data that spans the data divide
Why Data Mesh Needs Data Virtualization (ASEAN)
Future of Data Strategy (ASEAN)
Belgium & Luxembourg dedicated online Data Virtualization discovery workshop
Building the enterprise data architecture
Data Services and the Modern Data Ecosystem (ASEAN)
Lunch and Learn: You have the data, now what?
Data Governance_Notes.pptx
FIWARE Training: Introduction to Smart Data Models
Ch_2.pdf
Evolution of the DBA to Data Platform Administrator/Specialist
software_engg-chap-03.ppt
Introduction Big Data
Ad

Recently uploaded (20)

PDF
UNEP/ UNEA Plastic Treaty Negotiations Report of Inc 5.2 Geneva
PDF
Item # 10 -- Set Proposed 2025 Tax Rate
PPTX
LUNG CANCER PREDICTION MODELING USING ARTIFICIAL NEURAL NETWORK.pptx
PDF
PPT Item # 10 -- Proposed 2025 Tax Rate
PDF
Introducrion of creative nonfiction lesson 1
PPTX
cpgram enivaran cpgram enivaran cpgram enivaran
PPT
Republic Act 9729 Climate Change Adaptation
PPTX
CHS rollout Presentation by Abraham Lebeza.pptx
PPT
The Central Civil Services (Leave Travel Concession) Rules, 1988, govern the ...
PDF
AAAAAAAAAAAAAAAAAaaaaaaaAAAAAAAt - ĐV.pdf
PDF
Building Bridges (of Hope) over Our Troubled Waters_PART 1
PDF
The GDP double bind- Anders Wijkman Honorary President Club of Rome
PPT
The Central Civil Services (Leave Travel Concession) Rules, 1988, govern the ...
PPTX
Parliamentary procedure in meeting that can be use
PDF
Abhay Bhutada Foundation’s ESG Compliant Initiatives
PPTX
Workshop introduction and objectives. SK.pptx
PDF
CXPA Finland Webinar - Modern Components of Service Quality - Alec Dalton - ...
PDF
Dean, Jodi: Concept Paper - Multi Family Lot
PDF
2024-Need-Assessment-Report-March-2025.pdf
PPTX
BHARATIYA NAGARIKA SURAKSHA SAHMITA^J2023 (1).pptx
UNEP/ UNEA Plastic Treaty Negotiations Report of Inc 5.2 Geneva
Item # 10 -- Set Proposed 2025 Tax Rate
LUNG CANCER PREDICTION MODELING USING ARTIFICIAL NEURAL NETWORK.pptx
PPT Item # 10 -- Proposed 2025 Tax Rate
Introducrion of creative nonfiction lesson 1
cpgram enivaran cpgram enivaran cpgram enivaran
Republic Act 9729 Climate Change Adaptation
CHS rollout Presentation by Abraham Lebeza.pptx
The Central Civil Services (Leave Travel Concession) Rules, 1988, govern the ...
AAAAAAAAAAAAAAAAAaaaaaaaAAAAAAAt - ĐV.pdf
Building Bridges (of Hope) over Our Troubled Waters_PART 1
The GDP double bind- Anders Wijkman Honorary President Club of Rome
The Central Civil Services (Leave Travel Concession) Rules, 1988, govern the ...
Parliamentary procedure in meeting that can be use
Abhay Bhutada Foundation’s ESG Compliant Initiatives
Workshop introduction and objectives. SK.pptx
CXPA Finland Webinar - Modern Components of Service Quality - Alec Dalton - ...
Dean, Jodi: Concept Paper - Multi Family Lot
2024-Need-Assessment-Report-March-2025.pdf
BHARATIYA NAGARIKA SURAKSHA SAHMITA^J2023 (1).pptx

Connected development data

  • 2. Vision Planning and performance data from development activities is connected
  • 3. Vision: planning • Who is planning to work in district X next year? • Which communities, facilities or partners are others planning to work with? • How can we identify and avoid potential duplicate activities? • How can we identify opportunities for collaboration?
  • 4. Vision: reporting • Define what data you want to share and when • Select who you want to share it with • Creates a feed with stream of relevant data • No more reports…
  • 5. Vision: evaluation • Joint evaluations focused on specific sectors or approaches • Draw on data from multiple implementers • Drill down to examine source data and evidence • Identify implementers for interviews
  • 6. Challenges • These are not new ideas • Many previous attempts highlight significant challenges: • Developing data standards • Creating mechanisms to link systems • Data quality problems • Complex data governance issues
  • 7. Data standards • Data standards ensure that data from different sources is based on same definition • Necessary for data to be comparable, but can be extremely time-consuming to develop • Some success stories: – International Aid Transparency Initiative – HIV and AIDS indicator registry – Humanitarian response indicator registry
  • 8. Data standards • Focus is typically on indicators and higher level data • Less effort to create standards for activity level data
  • 9. Mechanisms to link systems • Migrating data from one system to another is complex, time-consuming and expensive • If one system changes then the link often breaks • Many different ways of linking systems means work is often duplicated • Only worth-while if working with large data- sets
  • 10. Data Quality Can’t see the trees for the woods • Focus on defining indicator level standards • Therefore data often shared at this level too • Connections and definitions that help understand and audit the data quality often missing – How was data collected? – What are the definitions inherent in the data?
  • 12. Data Governance • Connecting data makes it more useful but also increases the risk of malicious attacks • Data protection issues • Cross-border issues (health data?) • Security risks with vulnerable populations
  • 13. Time for a fresh approach? • Seems like these problems are un-solvable • Final slides show-case work that we have been doing over last five years • Shows promising new approach to tackle these challenges
  • 14. Principles for a new approach Emergent (bottom-up) standards development • Support the development of standards where there is interest and value to gain • Ensure that each standard follows the same ‘design rules’ • Ensure that standards can be curated, shared and – where possible – merged over time
  • 15. Principles for a new approach De-couple data from applications • Context of the data is tightly linked to the application in which it is created • Ability to view and edit the data is also tightly linked to the application • Data must be able to exist as a micro- application, aware of it’s context and able to function independently
  • 16. Principles for a new approach Focus on operational data • Current standards tend to focus on indicators, but don’t include linkages to how the data was collected • If standard can include the full context, better to start with operational data and aggregate up
  • 17. Principles for a new approach Strong data governance • Need strong mechanisms to manage privacy and security • Share data only as required for a specific purposes
  • 18. What are we trying? • Kwantu has been working in this area for many years • Some promising approaches to help tackle these problems
  • 19. (1) Domain Specific Language • Domain Specific Language (DSL) is a computer language designed to be used by technical experts, not programmers • Using a DSL provides a standard and comparable way of creating data standards • Kwantu have developed and tested an open source DSL in many contexts
  • 20. (1) Domain Specific Language • DSL used to create ‘Self-aware Data Objects’ (SDO) that define the standard for any development data • Doesn’t matter who creates each SDO definition. They can be linked and queried jointly
  • 21. (2) Data context • SDOs can define: – Field names in any language – Validations – Help text – Calculations – Evidence – Data taxonomies – Hierarchies in and linkages to other data • Enables us to embed the full context in the data definition
  • 22. (3) Application independent • SDOs offers a more efficient and decentralised application architecture • SDO data includes it’s own view and edit model • Means you can interact with it in a browser or other standard application • SDO data is effectively a micro-application
  • 23. (3) Application independent • Legacy apps can transfer their data to and from the relevant SDO definition • New apps (including BetterData) can use the view and edit models natively • Simplifies the development of new applications
  • 24. Business context Data envelope Micro application Self-aware Data Objects - Definition Data SDOs use a domain specific language to define M&E or planning data and it’s business context. This can be transformed into a micro-application that allows the data to be edited and viewed easily
  • 25. Business context Self-aware Data Objects - Definition Business context includes: Data model that specifies: - Fields - Labels - Help text - Validations - Languages - Evidence (files or photos) - Taxonomies - Links to other SDO data - Data can be expressed hierarchically - Data can be contained in sets It also includes a schema that validates the data saved in the data model
  • 26. Micro application The micro application contains view and edit models needed to view or edit the data in a standard web browser. Self-aware Data Objects - Definition Data envelope Data
  • 27. Existing applications can access the data directly via the Gatekeeper API Transformer engines can be used to transform the data into the view and edit models used by the application Self-aware Data Objects - Definition Data envelope Data
  • 28. Self-aware Data Objects - Definition Data envelope Every SDO includes a data envelope. This contains data on: Who created the data and when Who last updated the data and when GIS coordinates Globally unique ID for the data Tags to code the data Flags to indicate if the data is periodic or ad-hoc Flags to indicate if it forms part of a series of data Linkages to other data
  • 29. Business context Data Self-aware Data Objects - Data Data Data Data Data Data Each SDO will have multiple data instances in the Collector from different data producers Data envelope Data
  • 30. Query data Import data Collector Data registry API Gatekeeper Prototype collector system environment M&E or planning system environment Existing system API BetterData
  • 31. (4) Data Registry • Library of shared data definitions • Data governance team manage: – Who can share new SDO definitions – Who can use SDO definitions – Curate and review SDO definitions – Identify opportunities to link or merge • Provide advice on privacy • Responsible for data security
  • 32. (4) Data Registry • Option for multiple registries • Scope set by the group that manages it • Provides for a more organic and incremental approach to developing standards • While still allowing for separate data registries to coordinate and share
  • 33. (5) Collector system • Distributed database that is linked to the Data Registry • Accessible only via an API that can: – Validate SDO data against the schema held in the Data Registry – Publish SDO data into the collector system – Query data held in the collector system
  • 34. (6) Existing systems • Simplify the process of integrating existing systems • Single standard API to validate, publish and query data • Systems must transform data into SDO standard before publishing it • Or can use SDO view and edit model to store data natively as an SDO
  • 35. (6) Existing systems • Over time can create libraries to help speed up integration with API • No other changes needed
  • 36. (7) BetterData • Open source M&E system • Integrated with Collector API • Integrated with Data Registry • Browse Data Registry and download relevant SDOs • Link SDOs into a workflow that incorporates business logic • Store locally or publish to Collector system
  • 37. Where are we now? • DSL – completed • SDO examples – many in active usage • BetterData M&E – completed • Data Registry – early 2016 • Collector System and API – early 2016 • Data Governance guidelines - consultation
  • 38. What next? • GIZ funded pilot with South African government • Demonstrate working prototype in 2016 • Link and aggregate data from Municipal, Provincial and National levels
  • 39. What next? • Canvass interest in applying to other contexts? – Who is interested? – What new issues does this raise? • Establish advisory group – Assist with refinement of DSL and SDO specifications – Assist with development of data governance guidelines
  • 40. Thank you! • Rob Worthington • rob@kwantu.net • www.kwantu.net • @kwantu