SlideShare a Scribd company logo
A Broader Data Management Strategy
with DKAN
Data Management Strategy
Data Management Strategy
What is Data Management Strategy?
● Data management strategy is the process of planning or creating
strategies/plans for handling the data created, stored, managed and
processed by an organization.
● A data management strategy is the foundation of any data management
program. The strategy provides both the framework and the architecture
that will last throughout the life of your program.
● When you are choosing a framework ensure the foundation is built on solid
rock.
Baseline of a Healthy Data Management
Program Framework
● Current-state and future-state reference architectures
● Integration strategies
● Delivery model
● Adoption strategy
● Communication plan
● Data management maturity model and road map
Data Management Strategy
with DKAN
Data Management Strategy with DKAN
● Dkan is one of the most popular Open Source Open Data Platform.
● DKAN supports broader Data Management Strategies with tools that
simplify and streamline data migration, storage, and usability.
● The tools are designed to be straightforward so that you don’t need to be a
technical expert to use them.
● Migrate open data efficiently and effectively, store your data, and improve
the overall usability of your data.
What is DKAN?
DKAN is Open Data Platform
Then What is Open Data Platform?
Hey! wait... What is Open Data?
Open Data Platform
Relationship between Open Data Platform and Big Data
● The Open Data Platform (ODP) is primarily designed to advance the
collaboration and innovation of big data technologies
● ODP is help to big data developers in creating big data applications
on a common platform.
Open Data Platform
What is Open Data?
● Open data is the idea that some data should be freely available to everyone
to use and republish as they wish, without restrictions from copyright,
patents or other mechanisms of control.
● It is similar to concepts such as Open Source, Open Hardware, Open
Government, Open knowledge.
Open Data Platform
How it became more popular?
● It came more popular with the launch of Open Government Data (“Open
Data”) initiatives adopted by some of the world superpowers.
○ Example : USA  —  data.gov
UK — data.gov.uk
India — data.gov.in
Market Leaders of Open Data Platforms
Most of these Open Data implementations were based on popular Open Data
Platforms / frameworks already available in the market.
● CKAN
● DKAN
● Socrata
● Junar
DKAN
Open Data Platform
DKAN
● DKAN is a open data platform with a full suite of cataloging, publishing and
visualization features that allows governments, nonprofits and universities
to easily publish data to the public. DKAN is maintained by CivicActions.
● DKAN is a Drupal-based open data portal based on CKAN, the first widely
adopted open source open data portal software. CKAN stands for
Comprehensive Knowledge Archive Network.
Benefits of DKAN
DKAN is licensed under the GNU General Public License, version 2 (or later):
● Most popular free and open source license (GPLv2)
● Anyone can use it for without paying a licensing fee
● Can be used for any purpose (government and commercial)
● Prevents vendor lock-in, encourages competition among support suppliers
● Encourages cooperative community to share improvements without
partnership or purchase contracts
● Future development is responsive to active community members
Features of DKAN
● Community driven feature
development
● Manage diverse data sets
● Advanced visualization tools
● Mature user interface and workflow
● Metadata, tags, categorization
● Topics, taxonomy
● Search
● User permissions/controls
● Data stories
● Data harvesting
● Data uploader/store
● Engagement, social sharing
● Charts, graphs
● GIS, maps
● Integrated CMS, blogs
● Multi-lingual translation
● Open source code base
● Cloud-ready
● and more
DKAN sites
A partial list of DKAN sites around the world
● Multinational
○ United Nations (Open Data System Inventory) http://data.un.org/
○ The World Bank http://climatesmartplanning.org
● United States of America
○ National Democratic Institute https://nditech.org/project/dkan
○ California https://data.ca.gov
● Europe
○ Cambridgeshire, UK http://opendata.cambridgeshireinsight.org.uk
● Asia and Oceania
○ Urban Data Challenge, Japan http://udct-data.aigid.jp
● Africa
○ South Africa http://data.gov.za
DKAN Installation
There two blogs written by Mr. Supun Bandara
(Associate Tech Lead | Cloud Developer):
● How to Deploy the Dkan Product on an AWS EC2 -
https://medium.com/@supunbandara06/how-to-
deploy-the-dkan-product-on-an-aws-ec2-
ccacc667f065
● Implementing an Open Data Platform using DKan
on AWS -
http://auxenta.com/blog_implementing_an_open_da
ta_platform_using_DKan_on_AWS.html
A Broader Data Management Strategy with DKAN
The Open Data
publishing
process from
start-to-finish
Flowchart of the Open
Data publishing process.
Data Organized by DKAN
All open data catalogs built on DKAN are organized by Datasets,
Resources and Groups.
● Datasets are collections of resources, with some descriptive metadata
● Resources are just files. They can be any kind of file,
but often they are CSV files, spreadsheets or some
other kind of tabular data file.
● Organizations create datasets and upload resources.
● Data consumers can browse datasets and sometimes
see visualizations of resources.
Create Dataset in DKAN
A Broader Data Management Strategy with DKAN
A Broader Data Management Strategy with DKAN
A Broader Data Management Strategy with DKAN
A Broader Data Management Strategy with DKAN
Visualization of DKAN Dataset
● Add Chart
A Broader Data Management Strategy with DKAN
A Broader Data Management Strategy with DKAN
A Broader Data Management Strategy with DKAN
A Broader Data Management Strategy with DKAN
A Broader Data Management Strategy with DKAN
DKAN API
DKAN API
● Login
● Get Session
● Create Datasets
● Delete Datasets
● Update Datasets
● Add a file to Resource
● Retrieve the resource to check the file field
References
● Dkan Web Site - https://getdkan.org/
● How to Deploy the Dkan Product on an AWS EC2 -
https://medium.com/@supunbandara06/how-to-deploy-the-dkan-product-
on-an-aws-ec2-ccacc667f065
● DKAN Documentation - https://dkan.readthedocs.io/en/latest/
● Building Your Data Management Strategy -
https://tdwi.org/articles/2016/10/27/building-your-data-management-
strategy.aspx
Thanks!
By: Dinothan Muthulingam
Software Engineer

More Related Content

PDF
Identifying Your Audience
PPTX
IBM BP Session - Multiple CLoud Paks and Cloud Paks Foundational Services.pptx
PPT
Identification of Target Groups and Library Services
PDF
Data Lake Architecture – Modern Strategies & Approaches
PPTX
Disaster preservation in libraries
PDF
National information policy
PDF
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
Identifying Your Audience
IBM BP Session - Multiple CLoud Paks and Cloud Paks Foundational Services.pptx
Identification of Target Groups and Library Services
Data Lake Architecture – Modern Strategies & Approaches
Disaster preservation in libraries
National information policy
The Value of the Modern Data Architecture with Apache Hadoop and Teradata

What's hot (20)

PDF
Smart Data Strategy EN (1).pdf
PDF
Azure Synapse Analytics
PDF
Technology Trend Roadmap.pdf
PPT
Business intelligence
PDF
Enterprise Knowledge Graph
PPTX
Information seeking
PDF
Composable data for the composable enterprise
PPTX
Developing a Data Strategy
PPTX
Data saturday Oslo Azure Purview Erwin de Kreuk
PPTX
Introduction to Information Policy
PDF
Current and emerging trends in library services
PPTX
Communication networks
PDF
COUNTER Usage Statistics
PPTX
Data Lakehouse, Data Mesh, and Data Fabric (r1)
PDF
Time to Talk about Data Mesh
PPTX
marketing concepts in libraries
PDF
Introduction to DSpace
PPTX
Interlibrary loan, walker and eisele
PDF
Library networks and consortium
PPTX
Z39.50: Information Retrieval protocol ppt
Smart Data Strategy EN (1).pdf
Azure Synapse Analytics
Technology Trend Roadmap.pdf
Business intelligence
Enterprise Knowledge Graph
Information seeking
Composable data for the composable enterprise
Developing a Data Strategy
Data saturday Oslo Azure Purview Erwin de Kreuk
Introduction to Information Policy
Current and emerging trends in library services
Communication networks
COUNTER Usage Statistics
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Time to Talk about Data Mesh
marketing concepts in libraries
Introduction to DSpace
Interlibrary loan, walker and eisele
Library networks and consortium
Z39.50: Information Retrieval protocol ppt
Ad

Similar to A Broader Data Management Strategy with DKAN (20)

PDF
DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drup...
PDF
Open Data Portals: 9 Solutions and How they Compare
PDF
Drupal for Enterprises
PDF
DrupalCampSFL OpenPublic Overview
PDF
Docs-as-Code: Evolving the API Documentation Experience
PDF
Big Data Governance in Hadoop Environments with Cloudera Navigatorfeb2017meetu
PPTX
Data Management using CKAN | Internship Report
PPTX
Data Isn't Just Datasets: The Role of Communications, Content & Community in ...
PDF
DKAN Drupal Distribution Presentation at Drupal Gov Days 2013
PDF
PDF
Making DMPs actionable and public
PDF
Open Data Inside - Why Internal Data Portals are Key to Successful Data Gover...
PPTX
Linked Open Data Principles, benefits of LOD for sustainable development
PDF
COMSODE networking session at ICT Lisbon 2015
PPTX
Cloudera Cares + DataKind | 7 May 2015 | London, UK
PDF
How to build and run a big data platform in the 21st century
PPTX
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
PDF
DOC ROI Presentation 2pm NZ3 - Duane Wilkins
PPTX
Project Management Tech Tools
PPTX
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drup...
Open Data Portals: 9 Solutions and How they Compare
Drupal for Enterprises
DrupalCampSFL OpenPublic Overview
Docs-as-Code: Evolving the API Documentation Experience
Big Data Governance in Hadoop Environments with Cloudera Navigatorfeb2017meetu
Data Management using CKAN | Internship Report
Data Isn't Just Datasets: The Role of Communications, Content & Community in ...
DKAN Drupal Distribution Presentation at Drupal Gov Days 2013
Making DMPs actionable and public
Open Data Inside - Why Internal Data Portals are Key to Successful Data Gover...
Linked Open Data Principles, benefits of LOD for sustainable development
COMSODE networking session at ICT Lisbon 2015
Cloudera Cares + DataKind | 7 May 2015 | London, UK
How to build and run a big data platform in the 21st century
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
DOC ROI Presentation 2pm NZ3 - Duane Wilkins
Project Management Tech Tools
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
Ad

Recently uploaded (20)

PPT
Reliability_Chapter_ presentation 1221.5784
PDF
Business Analytics and business intelligence.pdf
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PDF
[EN] Industrial Machine Downtime Prediction
PPT
Quality review (1)_presentation of this 21
PPTX
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
PDF
annual-report-2024-2025 original latest.
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PDF
Clinical guidelines as a resource for EBP(1).pdf
PDF
Lecture1 pattern recognition............
PPTX
1_Introduction to advance data techniques.pptx
PPTX
Computer network topology notes for revision
PPTX
Supervised vs unsupervised machine learning algorithms
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PPTX
Database Infoormation System (DBIS).pptx
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
Reliability_Chapter_ presentation 1221.5784
Business Analytics and business intelligence.pdf
Galatica Smart Energy Infrastructure Startup Pitch Deck
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
[EN] Industrial Machine Downtime Prediction
Quality review (1)_presentation of this 21
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
annual-report-2024-2025 original latest.
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
Clinical guidelines as a resource for EBP(1).pdf
Lecture1 pattern recognition............
1_Introduction to advance data techniques.pptx
Computer network topology notes for revision
Supervised vs unsupervised machine learning algorithms
STUDY DESIGN details- Lt Col Maksud (21).pptx
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Database Infoormation System (DBIS).pptx
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj

A Broader Data Management Strategy with DKAN

  • 1. A Broader Data Management Strategy with DKAN
  • 3. Data Management Strategy What is Data Management Strategy? ● Data management strategy is the process of planning or creating strategies/plans for handling the data created, stored, managed and processed by an organization. ● A data management strategy is the foundation of any data management program. The strategy provides both the framework and the architecture that will last throughout the life of your program. ● When you are choosing a framework ensure the foundation is built on solid rock.
  • 4. Baseline of a Healthy Data Management Program Framework ● Current-state and future-state reference architectures ● Integration strategies ● Delivery model ● Adoption strategy ● Communication plan ● Data management maturity model and road map
  • 6. Data Management Strategy with DKAN ● Dkan is one of the most popular Open Source Open Data Platform. ● DKAN supports broader Data Management Strategies with tools that simplify and streamline data migration, storage, and usability. ● The tools are designed to be straightforward so that you don’t need to be a technical expert to use them. ● Migrate open data efficiently and effectively, store your data, and improve the overall usability of your data.
  • 7. What is DKAN? DKAN is Open Data Platform Then What is Open Data Platform? Hey! wait... What is Open Data?
  • 8. Open Data Platform Relationship between Open Data Platform and Big Data ● The Open Data Platform (ODP) is primarily designed to advance the collaboration and innovation of big data technologies ● ODP is help to big data developers in creating big data applications on a common platform.
  • 9. Open Data Platform What is Open Data? ● Open data is the idea that some data should be freely available to everyone to use and republish as they wish, without restrictions from copyright, patents or other mechanisms of control. ● It is similar to concepts such as Open Source, Open Hardware, Open Government, Open knowledge.
  • 10. Open Data Platform How it became more popular? ● It came more popular with the launch of Open Government Data (“Open Data”) initiatives adopted by some of the world superpowers. ○ Example : USA  —  data.gov UK — data.gov.uk India — data.gov.in
  • 11. Market Leaders of Open Data Platforms Most of these Open Data implementations were based on popular Open Data Platforms / frameworks already available in the market. ● CKAN ● DKAN ● Socrata ● Junar
  • 13. DKAN ● DKAN is a open data platform with a full suite of cataloging, publishing and visualization features that allows governments, nonprofits and universities to easily publish data to the public. DKAN is maintained by CivicActions. ● DKAN is a Drupal-based open data portal based on CKAN, the first widely adopted open source open data portal software. CKAN stands for Comprehensive Knowledge Archive Network.
  • 14. Benefits of DKAN DKAN is licensed under the GNU General Public License, version 2 (or later): ● Most popular free and open source license (GPLv2) ● Anyone can use it for without paying a licensing fee ● Can be used for any purpose (government and commercial) ● Prevents vendor lock-in, encourages competition among support suppliers ● Encourages cooperative community to share improvements without partnership or purchase contracts ● Future development is responsive to active community members
  • 15. Features of DKAN ● Community driven feature development ● Manage diverse data sets ● Advanced visualization tools ● Mature user interface and workflow ● Metadata, tags, categorization ● Topics, taxonomy ● Search ● User permissions/controls ● Data stories ● Data harvesting ● Data uploader/store ● Engagement, social sharing ● Charts, graphs ● GIS, maps ● Integrated CMS, blogs ● Multi-lingual translation ● Open source code base ● Cloud-ready ● and more
  • 16. DKAN sites A partial list of DKAN sites around the world ● Multinational ○ United Nations (Open Data System Inventory) http://data.un.org/ ○ The World Bank http://climatesmartplanning.org ● United States of America ○ National Democratic Institute https://nditech.org/project/dkan ○ California https://data.ca.gov ● Europe ○ Cambridgeshire, UK http://opendata.cambridgeshireinsight.org.uk ● Asia and Oceania ○ Urban Data Challenge, Japan http://udct-data.aigid.jp ● Africa ○ South Africa http://data.gov.za
  • 17. DKAN Installation There two blogs written by Mr. Supun Bandara (Associate Tech Lead | Cloud Developer): ● How to Deploy the Dkan Product on an AWS EC2 - https://medium.com/@supunbandara06/how-to- deploy-the-dkan-product-on-an-aws-ec2- ccacc667f065 ● Implementing an Open Data Platform using DKan on AWS - http://auxenta.com/blog_implementing_an_open_da ta_platform_using_DKan_on_AWS.html
  • 19. The Open Data publishing process from start-to-finish Flowchart of the Open Data publishing process.
  • 20. Data Organized by DKAN All open data catalogs built on DKAN are organized by Datasets, Resources and Groups. ● Datasets are collections of resources, with some descriptive metadata ● Resources are just files. They can be any kind of file, but often they are CSV files, spreadsheets or some other kind of tabular data file. ● Organizations create datasets and upload resources. ● Data consumers can browse datasets and sometimes see visualizations of resources.
  • 26. Visualization of DKAN Dataset ● Add Chart
  • 33. DKAN API ● Login ● Get Session ● Create Datasets ● Delete Datasets ● Update Datasets ● Add a file to Resource ● Retrieve the resource to check the file field
  • 34. References ● Dkan Web Site - https://getdkan.org/ ● How to Deploy the Dkan Product on an AWS EC2 - https://medium.com/@supunbandara06/how-to-deploy-the-dkan-product- on-an-aws-ec2-ccacc667f065 ● DKAN Documentation - https://dkan.readthedocs.io/en/latest/ ● Building Your Data Management Strategy - https://tdwi.org/articles/2016/10/27/building-your-data-management- strategy.aspx

Editor's Notes

  • #4: Definition of Data Management Strategy
  • #5: Thing to consider when we are choosing a Data Management Program framework.
  • #7: Why Data Management Strategy with DKAN
  • #9: Why Open Data Platform with Big Data
  • #10: Definition of Open Data
  • #11: How it became more popular?
  • #12: Some Market leader of Open data platform
  • #14: Definition of DKAN, Drupal is a CMS framwork like Wordpress
  • #15: Why DKAN?
  • #16: DKAN features
  • #17: DKAN sites
  • #18: DKAN Installation
  • #19: If you installed, You will get this page in your browser.
  • #20: Common Open Data Publishing Process
  • #21: How to store files in DKAN
  • #23: Create Dataset in DKAN
  • #24: Add Resources to Dataset
  • #25: Dataset preview with resources
  • #26: List of Datasets
  • #28: Add Chart to visualize your Dataset
  • #30: Select your Chart suitable to your resources
  • #31: Final customization of Chart
  • #32: Chart that created
  • #34: DKAN has a REST API, You can give those requests to access DKAN site