SlideShare a Scribd company logo
DRUPAL, CKAN & PUBLIC DATA
steven.decosta@linkdigital.com.au
@starl3n
Introduction 2
Link Digital
Executive Director
CKAN Association
Steering Group Member
Open Knowledge Australia
Co-Secretary & Treasurer
AWS User Group
CBR Organiser
Some key Drupal and CKAN points 3
• DKAN is not CKAN
• CKAN is owning Australian Government
• Data.Vic, Data.NSW, Data.SA and Data.Brisbane use Drupal and CKAN together
• Single Sign on – https://github.com/ckan/ckanext-drupal7
• Taxonomies and CKAN - pulling from CKAN into Drupal to enhance content for
Government websites.
• Webforms to CKAN - for an 'open data' form collection process.
• Resource Views for Drupal - configured for a CKAN portal and orgsanisation.
• Telling stories with data... Curation.
4Drupal and CKAN
5Drupal and CKAN
6Drupal and CKAN
7Drupal and CKAN
8Drupal and CKAN
9Drupal and CKAN
10Drupal and CKAN
11Drupal and CKAN
12Drupal and CKAN
Where is there work to do with Data? 13
Data portal software:
1. Open Source
2. Large and expanding installation base
within Government worldwide
3. Expanding use cases in the wider data
ecosystem
4. Python web app, PostgreSQL DB
5. Built for machines,
custodians and end users
WHAT IS CKAN?
LEARN MORE AT CKAN.org
DOWNLOAD FROM github.com/ckan/ckan
1. >> Organisations (optionally with sub-organisations)
2. >> >> Datasets
3. >> >> >> Resources
4. >> Platform Custodian
5. >> >> Organisation Custodian, Editor or Member
6. >> Published or Private datasets
CKAN STRUCTURE
1. >> Constitution
2. >> >> Parliamentary Legislation and Acts (Jurisdiction = Platform)
3. >> >> >> Ministries (Organisation)
4. >> >> >> >> Programs (Sub-Organisations)
5. >> >> >> >> >> Projects (Datasets)
6. >> >> >> >> >> >> Outcomes (Resources)
CKAN USE CASE PARADIGM
1. User registration
2. User management
3. Custodian workflows (manage datasets and data resources)
4. Directory Browse by organisation or group
5. Faceted search for multiple fields (supporting end user discovery)
6. Resource views to preview data (a recently improved feature)
7. Metadata view
CKAN UI
1. Create an organistation (usually done by platform owner)
2. Login as member of organisation
3. Click ‘add dataset’
4. Step 1: Add a title, description and other metadata
5. Step 2: Add resources (links to data or upload data files for hosting)
6. Step 3: Add any additional info
CKAN CUSTODIAN WORKFLOW
1. Title
2. Description: Using markdown as required
3. Keywords: some keywords (or tags) that describe your data.
4. License (required): a dropdown of available licenses for data.gov.au (the default is Creative
Commons Attribution 3.0 Australia)
5. Organisation: a dropdown of organisations you can publish to. Most users can only publish to a
single organisation. This will be automatically filled in.
6. Visibility: whether the dataset will be viewable to all users once complete. The default is private.
7. Geospatial Coverage (required): inherited from organisation metadata this is the area which the
data covers. It can be; a point/polygon (Well-known text); an administrative boundary API; or, a
reference URL (website address) from the National Gazatteer. Gazetteer reference URLs can be
found by searching for a place at http://www.ga.gov.au/place-names/ then clicking through to the
most appropriate location "Reference ID", and then copying and pasting the URL from the page into
the Geospatial field in data.gov.au.
CKAN METADATA FOR DATA.GOV.AU
8. Temporal Coverage From / To (required): the span of time from/to which the data is applicable. If
the data applies only to a single point in time you should only fill in the Temporal Coverage From
field.
9. Language: the language in which the dataset is published. The default is English.
10. Data Status (required): the status of the data with regard to whether it is kept updated (active, yes)
or historic (inactive, no).
11. Update Frequency (required): how often the dataset is updated. Eg: Daily, Weekly, Never. (for
remote machine readable files this field will be used to fetch new versions of this data)
12. Expose User Contact Information: display additional contact information for the dataset.
13. AGIFT Function/Theme: the AGIFT top level government function to which the dataset relates.
14. Publisher: name of Agency/publishing organisation. The default is set to the organisation’s name.
15. Jurisdiction: name of the jurisdiction in which the dataset belongs. The default is set to the
organisation’s jurisdiction.
CKAN METADATA FOR DATA.GOV.AU
1. Get JSON-formatted lists of a site’s datasets, groups or other CKAN objects
2. Get a full JSON representation of a dataset, resource or other object
3. Search for packages (datasets) or resources matching a query
4. Create, update and delete datasets, resources and other objects
5. Get an activity stream of recently changed datasets on a site
CKAN API
1. CKAN as an Information Asset Register
2. FileStore – For hosting of data and resources
3. DataStore - provides a database for structured storage of data together with a powerful Web-
accessible Data API
4. License Selection (machine ready?)
5. Harvesting
A FEW MORE POINTS
25The perfect storm
26Drupal interface
27CKAN interface
28What the DFMP does
29Data.vic.gov.au data classification and network security
Infrastructure as Software 30
Drupal, CKAN and Public Data. DrupalGov 08 february 2016
Take Note: What is NOT good 32
Take Note: What is best… 33
DRUPAL, CKAN & PUBLIC DATA
steven.decosta@linkdigital.com.au
@starl3n

More Related Content

PDF
Data Management Systems for Government Agencies - with CKAN
PDF
CKAN and Australian open data updates for Wikimedia - 7 October 2015
PPTX
Getting to Know CKAN, 24 June 2015, Singapore
ODP
Open Data and CKAN Data Catalogues
PPT
Grid Computing July 2009
PPT
Do the LOCAH-Motion: How to Make Bibliographic and Archival Linked Data
PDF
B2SHARE REST API Hands-on - EUDAT Summer School (Hans van Piggelen, SURFsara)
PDF
OpenCalais At The San Diego Software Industry Council
Data Management Systems for Government Agencies - with CKAN
CKAN and Australian open data updates for Wikimedia - 7 October 2015
Getting to Know CKAN, 24 June 2015, Singapore
Open Data and CKAN Data Catalogues
Grid Computing July 2009
Do the LOCAH-Motion: How to Make Bibliographic and Archival Linked Data
B2SHARE REST API Hands-on - EUDAT Summer School (Hans van Piggelen, SURFsara)
OpenCalais At The San Diego Software Industry Council

What's hot (20)

PDF
IRJET- Data Retrieval using Master Resource Description Framework
PPTX
Creating Linked Data from Relational Databases
PDF
Putting Historical Data in Context: how to use DSpace-GLAM
PPTX
Scaling up Linked Data
PPTX
Ag Data Commons: Agricultural research metadata and data
PDF
ResourceSync - Overview and Real-World Use Cases for Discovery, Harvesting, a...
PPTX
Providing Linked Data
PPTX
Introduction: Linked Data and the Semantic Web
PPT
Getting Started With The Talis Platform
PDF
Drupal DKAN CapitalCamp 2013 Presentation
PPT
The Social Data Web
PDF
DSpace-CRIS & OpenAIRE
PPTX
Microtask Crowdsourcing Applications for Linked Data
PPTX
FROM BIG DATA TO ACTION: HOW TO BREAK OUT OF THE SILOS AND LEVERAGE DATA GOVE...
PPTX
Materializing the Web of Linked Data
PPT
Site Interoperability Projects at DERI Galway's SW Cluster
PPTX
Globus publication demo screenshots
PPTX
Introduction to Linked Data Platform (LDP)
PPTX
Querying Linked Data
PDF
Web of Data Usage Mining
IRJET- Data Retrieval using Master Resource Description Framework
Creating Linked Data from Relational Databases
Putting Historical Data in Context: how to use DSpace-GLAM
Scaling up Linked Data
Ag Data Commons: Agricultural research metadata and data
ResourceSync - Overview and Real-World Use Cases for Discovery, Harvesting, a...
Providing Linked Data
Introduction: Linked Data and the Semantic Web
Getting Started With The Talis Platform
Drupal DKAN CapitalCamp 2013 Presentation
The Social Data Web
DSpace-CRIS & OpenAIRE
Microtask Crowdsourcing Applications for Linked Data
FROM BIG DATA TO ACTION: HOW TO BREAK OUT OF THE SILOS AND LEVERAGE DATA GOVE...
Materializing the Web of Linked Data
Site Interoperability Projects at DERI Galway's SW Cluster
Globus publication demo screenshots
Introduction to Linked Data Platform (LDP)
Querying Linked Data
Web of Data Usage Mining
Ad

Viewers also liked (15)

ODP
Ckan tutorial odw2013 131109
PPTX
Ckan foo - CKAN Association overview at CKANcon 2015, Ottawa
PDF
Introduction to CKAN
PPTX
CKAN as an open-source data management solution for open data
PDF
PPTX
Ckan Sydney meetup 4 september 2015 final
PDF
Slides Steffen Becker, "Code for Europe Workshop"
PDF
DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drup...
PPTX
Cloud Asia presentation in Singapore, 29 October 2015
PDF
Customizing CKAN
PDF
ODN - Technical introduction of the platform
PDF
CKAN - the open source data portal platform
PDF
Global - Local Open Data Index
PDF
CKAN by Friedrich Lindenberg
PDF
Minicurso de CKAN
Ckan tutorial odw2013 131109
Ckan foo - CKAN Association overview at CKANcon 2015, Ottawa
Introduction to CKAN
CKAN as an open-source data management solution for open data
Ckan Sydney meetup 4 september 2015 final
Slides Steffen Becker, "Code for Europe Workshop"
DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drup...
Cloud Asia presentation in Singapore, 29 October 2015
Customizing CKAN
ODN - Technical introduction of the platform
CKAN - the open source data portal platform
Global - Local Open Data Index
CKAN by Friedrich Lindenberg
Minicurso de CKAN
Ad

Similar to Drupal, CKAN and Public Data. DrupalGov 08 february 2016 (20)

PDF
160606 data lifecycle project outline
ODP
Open Data and CKAN Data Catalogues
PDF
Open Data Inside - Why Internal Data Portals are Key to Successful Data Gover...
PPT
A Look into the Apache OODT Ecosystem
PDF
DataUp: An overview for the DataONE Users Group
PDF
Aucfanlab Datalake - Big Data Management Platform -
PDF
Data Infrastructure for a World of Music
PPSX
The Web of data and web data commons
DOC
Subhani_OrDBA5+
PDF
A Gen3 Perspective of Disparate Data
PDF
Myth Busters: I’m Building a Data Lake, So I Don’t Need Data Virtualization (...
PDF
Oracle GoldenGate for Oracle DBAs
PDF
Session 203 iouc summit database
PPTX
How to use NCI's national repository of big spatial data collections
PDF
Active Data: Managing Data-Life Cycle on Heterogeneous Systems and Infrastruc...
PDF
Atlan to Airflow integration.pdf
PDF
San diego
PDF
San diego
PDF
PPTX
(PROJEKTURA) open data big data @tgg osijek
160606 data lifecycle project outline
Open Data and CKAN Data Catalogues
Open Data Inside - Why Internal Data Portals are Key to Successful Data Gover...
A Look into the Apache OODT Ecosystem
DataUp: An overview for the DataONE Users Group
Aucfanlab Datalake - Big Data Management Platform -
Data Infrastructure for a World of Music
The Web of data and web data commons
Subhani_OrDBA5+
A Gen3 Perspective of Disparate Data
Myth Busters: I’m Building a Data Lake, So I Don’t Need Data Virtualization (...
Oracle GoldenGate for Oracle DBAs
Session 203 iouc summit database
How to use NCI's national repository of big spatial data collections
Active Data: Managing Data-Life Cycle on Heterogeneous Systems and Infrastruc...
Atlan to Airflow integration.pdf
San diego
San diego
(PROJEKTURA) open data big data @tgg osijek

Recently uploaded (20)

PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
MIND Revenue Release Quarter 2 2025 Press Release
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Encapsulation theory and applications.pdf
PPT
Teaching material agriculture food technology
PDF
Machine learning based COVID-19 study performance prediction
PPTX
Spectroscopy.pptx food analysis technology
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Mobile App Security Testing_ A Comprehensive Guide.pdf
Understanding_Digital_Forensics_Presentation.pptx
Building Integrated photovoltaic BIPV_UPV.pdf
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Review of recent advances in non-invasive hemoglobin estimation
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Reach Out and Touch Someone: Haptics and Empathic Computing
Per capita expenditure prediction using model stacking based on satellite ima...
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Digital-Transformation-Roadmap-for-Companies.pptx
MIND Revenue Release Quarter 2 2025 Press Release
The AUB Centre for AI in Media Proposal.docx
Encapsulation theory and applications.pdf
Teaching material agriculture food technology
Machine learning based COVID-19 study performance prediction
Spectroscopy.pptx food analysis technology
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Unlocking AI with Model Context Protocol (MCP)
Advanced methodologies resolving dimensionality complications for autism neur...
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf

Drupal, CKAN and Public Data. DrupalGov 08 february 2016

  • 1. DRUPAL, CKAN & PUBLIC DATA steven.decosta@linkdigital.com.au @starl3n
  • 2. Introduction 2 Link Digital Executive Director CKAN Association Steering Group Member Open Knowledge Australia Co-Secretary & Treasurer AWS User Group CBR Organiser
  • 3. Some key Drupal and CKAN points 3 • DKAN is not CKAN • CKAN is owning Australian Government • Data.Vic, Data.NSW, Data.SA and Data.Brisbane use Drupal and CKAN together • Single Sign on – https://github.com/ckan/ckanext-drupal7 • Taxonomies and CKAN - pulling from CKAN into Drupal to enhance content for Government websites. • Webforms to CKAN - for an 'open data' form collection process. • Resource Views for Drupal - configured for a CKAN portal and orgsanisation. • Telling stories with data... Curation.
  • 13. Where is there work to do with Data? 13
  • 14. Data portal software: 1. Open Source 2. Large and expanding installation base within Government worldwide 3. Expanding use cases in the wider data ecosystem 4. Python web app, PostgreSQL DB 5. Built for machines, custodians and end users WHAT IS CKAN?
  • 15. LEARN MORE AT CKAN.org
  • 17. 1. >> Organisations (optionally with sub-organisations) 2. >> >> Datasets 3. >> >> >> Resources 4. >> Platform Custodian 5. >> >> Organisation Custodian, Editor or Member 6. >> Published or Private datasets CKAN STRUCTURE
  • 18. 1. >> Constitution 2. >> >> Parliamentary Legislation and Acts (Jurisdiction = Platform) 3. >> >> >> Ministries (Organisation) 4. >> >> >> >> Programs (Sub-Organisations) 5. >> >> >> >> >> Projects (Datasets) 6. >> >> >> >> >> >> Outcomes (Resources) CKAN USE CASE PARADIGM
  • 19. 1. User registration 2. User management 3. Custodian workflows (manage datasets and data resources) 4. Directory Browse by organisation or group 5. Faceted search for multiple fields (supporting end user discovery) 6. Resource views to preview data (a recently improved feature) 7. Metadata view CKAN UI
  • 20. 1. Create an organistation (usually done by platform owner) 2. Login as member of organisation 3. Click ‘add dataset’ 4. Step 1: Add a title, description and other metadata 5. Step 2: Add resources (links to data or upload data files for hosting) 6. Step 3: Add any additional info CKAN CUSTODIAN WORKFLOW
  • 21. 1. Title 2. Description: Using markdown as required 3. Keywords: some keywords (or tags) that describe your data. 4. License (required): a dropdown of available licenses for data.gov.au (the default is Creative Commons Attribution 3.0 Australia) 5. Organisation: a dropdown of organisations you can publish to. Most users can only publish to a single organisation. This will be automatically filled in. 6. Visibility: whether the dataset will be viewable to all users once complete. The default is private. 7. Geospatial Coverage (required): inherited from organisation metadata this is the area which the data covers. It can be; a point/polygon (Well-known text); an administrative boundary API; or, a reference URL (website address) from the National Gazatteer. Gazetteer reference URLs can be found by searching for a place at http://www.ga.gov.au/place-names/ then clicking through to the most appropriate location "Reference ID", and then copying and pasting the URL from the page into the Geospatial field in data.gov.au. CKAN METADATA FOR DATA.GOV.AU
  • 22. 8. Temporal Coverage From / To (required): the span of time from/to which the data is applicable. If the data applies only to a single point in time you should only fill in the Temporal Coverage From field. 9. Language: the language in which the dataset is published. The default is English. 10. Data Status (required): the status of the data with regard to whether it is kept updated (active, yes) or historic (inactive, no). 11. Update Frequency (required): how often the dataset is updated. Eg: Daily, Weekly, Never. (for remote machine readable files this field will be used to fetch new versions of this data) 12. Expose User Contact Information: display additional contact information for the dataset. 13. AGIFT Function/Theme: the AGIFT top level government function to which the dataset relates. 14. Publisher: name of Agency/publishing organisation. The default is set to the organisation’s name. 15. Jurisdiction: name of the jurisdiction in which the dataset belongs. The default is set to the organisation’s jurisdiction. CKAN METADATA FOR DATA.GOV.AU
  • 23. 1. Get JSON-formatted lists of a site’s datasets, groups or other CKAN objects 2. Get a full JSON representation of a dataset, resource or other object 3. Search for packages (datasets) or resources matching a query 4. Create, update and delete datasets, resources and other objects 5. Get an activity stream of recently changed datasets on a site CKAN API
  • 24. 1. CKAN as an Information Asset Register 2. FileStore – For hosting of data and resources 3. DataStore - provides a database for structured storage of data together with a powerful Web- accessible Data API 4. License Selection (machine ready?) 5. Harvesting A FEW MORE POINTS
  • 32. Take Note: What is NOT good 32
  • 33. Take Note: What is best… 33
  • 34. DRUPAL, CKAN & PUBLIC DATA steven.decosta@linkdigital.com.au @starl3n