SlideShare a Scribd company logo
Adding Value to Data 
A Rake’s Progress
Contents 
• Anecdotes from A Rake’s Progress 
• The TERN/OzFlux facility 
• The OzFlux data path 
• Non-generic things about OzFlux 
• Generic things about OzFlux 
• Conclusions
TOGA-COARE
OASIS ‘94 and ‘95
The Savanna Project
The TERN/OzFlux Facility
Surface fluxes 
Radiation 
Meteorology 
Soil properties 
AusPlots, Supersites 
Intensive field campaigns 
Site characteristics 
Knowledge of ecosystem exchange of 
carbon, water & energy. 
Ecosystem dynamics 
Spatial and temporal 
dynamics 
Continental & global 
budgets 
Vegetation type 
Leaf area index 
Gross primary 
product 
Soil moisture 
Hyperspectral 
OzFlux 
Flux tower network 
AusCover 
Remote sensing 
eMAST 
Land surface models 
Biomass 
Soil carbon & nutrients 
Leaf-level photosynthesis
The OzFlux Site Data Path 
u’, v’, w’, 
T’ 
q’,c’ 
WS, WD, T, RH 
Fsd, Fsu, Fld, 
Flu 
Fg, Sws, Tsoil 
Rain 
10 Hz 
0.1 Hz 
Fast (10 Hz raw) 
Slow (30 minute average) 
CR3000 
data 
logger 
Slow 
(30 
minute 
average) 
Fast (10 Hz) 
(optional with 
ethernet modem) 
Modmax 
modem 
CF card
OzFlux Data Path 
Tower 
data 
OzFluxQC 
L3 data 
netCDF files 
L4 data 
netCDF files 
L5 data 
L6 data 
OAI-PMH server 
OzFlux portal 
THREDDS server 
Community 
developed Python 
scripts 
WWW 
netCDF files 
PI institution 
NeCTAR/RDSI
OzFlux Data Portal 
• Publicly accessible archive of data 
• Ability to audit data provided by TERN funded 
sites 
• Publication of data via RDA and TDDP 
– Make the data easy to find 
• Repository of self-documenting data files 
– netCDF files conforming to CF Metadata 
Conventions
Non-generic things about OzFlux 
• Data is very homogeneous 
– Standard suite of instruments 
– Standard data collection and processing 
– High level of automation 
• Existed prior to TERN with a long history of 
collaboration 
• Lucky breaks 
– Standard data processing and portal developed before 
TERN 
– Choice of data format compatible with public access 
to data
Generic things about OzFlux 
• Prior to TERN 
– Individual solutions to data curation 
– No license applied to data 
– Data difficult to discover 
• Constituency 
– Community of rugged individualists
Messages for Others 
• Clear vision of what data curation is 
achievable by your community 
– Accept outside help 
• Solutions need to fit the community 
– Voluntary compliance is better than enforcement 
– Guiding is better than directing 
• Staged solutions to help community accept 
change 
– Herding cats
Communication 
• Email lists 
• ASN/OzFlux newsletter 
• Use of collaborative resources 
– Trello for scheduling workflow 
– GitHub for software updates 
– CloudStor+ for providing data to site Pis 
– ResearchGate project
What happens when communication fails? 
• Establishment of 2 data paths: 
– Arose through misunderstanding and a lack of 
formality in process 
– Compromise allowed co-existence as “research” 
and “operational” systems 
– OzFlux will need to make sure boundaries 
between the 2 systems remain clear inside and 
outside OzFlux 
– OzFlux will need to make sure maintaining 2 
systems does not adversely affect resources
Conclusions 
• Scientists can be lead to water and they will 
drink if; 
– They can see there is something in it for them. 
– Change is managed, staged and suits the 
community.

More Related Content

PPTX
Leeds presentation
PPTX
EcoTas13 Caddy-Retalic TERN Infrastructure
PDF
AusPlots field data collection with AusScribe
PPTX
Australian Ecosystems Science Cloud
PPTX
Tea time4scienceTERN
PDF
Going Glocal—Polar Data in a Global Infrastructure
PPTX
How TERN Data Infrastructure works
PPTX
TERN Facility Portals - Stuart Phinn
Leeds presentation
EcoTas13 Caddy-Retalic TERN Infrastructure
AusPlots field data collection with AusScribe
Australian Ecosystems Science Cloud
Tea time4scienceTERN
Going Glocal—Polar Data in a Global Infrastructure
How TERN Data Infrastructure works
TERN Facility Portals - Stuart Phinn

Similar to Adding value to data isaac (20)

PPTX
Making Drone data open for Scientific Research
PPTX
From Data to Data: One version of a History of Scholarly Communication
PPT
RFCs for HDF5 and HDF-EOS5 Status Update
PPTX
DATAD-R African Open Science Platform (AOSP)
PDF
Aus cover perth 6 june 2016
PPTX
African Open Science Platform
PPTX
Developing an Australian phenology monitoring network, Tim Brown, ACEAS Grand...
PDF
Ausplots Training - Session 1
PPTX
Ecosystem data and TERN: Genes to geosciences workshop 19 May 2014
PDF
Ben Evans SPEDDEXES 2014
PDF
Weather Station Data Publication at Irstea: an implementation Report.
PDF
Research data zone: veilige en geoptimaliseerde netwerkomgeving voor onderzoe...
PDF
ApacheCon NA 2013
KEY
Danis biosystematics2011
PDF
Don't Be Scared. Data Don't Bite. Introduction to Big Data.
PDF
IEEE_BigData2014-Lee.pdf
PPTX
Managing your data paget
PPTX
Gab Abramowitz_The e-MAST data-model interface
PDF
Semantics-enhanced Geoscience Interoperability, Analytics, and Applications
PPTX
Enabling efficient movement of data into & out of a high-performance analysis...
Making Drone data open for Scientific Research
From Data to Data: One version of a History of Scholarly Communication
RFCs for HDF5 and HDF-EOS5 Status Update
DATAD-R African Open Science Platform (AOSP)
Aus cover perth 6 june 2016
African Open Science Platform
Developing an Australian phenology monitoring network, Tim Brown, ACEAS Grand...
Ausplots Training - Session 1
Ecosystem data and TERN: Genes to geosciences workshop 19 May 2014
Ben Evans SPEDDEXES 2014
Weather Station Data Publication at Irstea: an implementation Report.
Research data zone: veilige en geoptimaliseerde netwerkomgeving voor onderzoe...
ApacheCon NA 2013
Danis biosystematics2011
Don't Be Scared. Data Don't Bite. Introduction to Big Data.
IEEE_BigData2014-Lee.pdf
Managing your data paget
Gab Abramowitz_The e-MAST data-model interface
Semantics-enhanced Geoscience Interoperability, Analytics, and Applications
Enabling efficient movement of data into & out of a high-performance analysis...
Ad

More from TERN Australia (20)

PDF
Careers Grounded in Soils
PDF
TERN Australia Soil & Herbarium Collection Brochure
PDF
Summary of TERN monitoring plots in the Pilbara WA, Apr2015 - Jun2021
PDF
Summary of TERN plots on Kangaroo Island, SA, Oct 2018 - Oct 2021
PDF
MER Pilot Network flyer 2020
PPTX
Australia's Environmental Predictive Capability
PPTX
Biodiversity Management in Tasmania's Temperate Native Forests
PDF
Observing Environmental Change in Australia: Conversations for Sustainability
PDF
Observing Environmental Change in Australia: Conversations for Sustainability
PPTX
Dr Michael Mirtl (ILTER Chair) presenting at the AusLTER Forum 2018
PPTX
Prof Bob Scholes (Wits University, South Africa) presenting at the AusLTER Fo...
PPTX
Prof Phil Robertson (Michigan State University, USA) presenting at the AusLTE...
PPTX
Dr Manuel Maass (National Autonomous University of Mexico) presenting at the ...
PDF
Yuxia Liu Phenology 2018 poster on tracking grass phenology
PDF
Qiaoyun Xie Phenology 2018 presentation on agricultural phenology
PDF
Ha Nguyen Phenology 2018 presentation on Melbourne pollen trends
PDF
Paul Beggs Phenology 2018 presentation on AusPollen
PDF
GEOSS Ecosystem Mapping for Australia
PDF
TERN Ecosystem Surveillance Plots Roy Hill Station
PDF
TERN Ecosystem Surveillance Plots Kakadu National Park
Careers Grounded in Soils
TERN Australia Soil & Herbarium Collection Brochure
Summary of TERN monitoring plots in the Pilbara WA, Apr2015 - Jun2021
Summary of TERN plots on Kangaroo Island, SA, Oct 2018 - Oct 2021
MER Pilot Network flyer 2020
Australia's Environmental Predictive Capability
Biodiversity Management in Tasmania's Temperate Native Forests
Observing Environmental Change in Australia: Conversations for Sustainability
Observing Environmental Change in Australia: Conversations for Sustainability
Dr Michael Mirtl (ILTER Chair) presenting at the AusLTER Forum 2018
Prof Bob Scholes (Wits University, South Africa) presenting at the AusLTER Fo...
Prof Phil Robertson (Michigan State University, USA) presenting at the AusLTE...
Dr Manuel Maass (National Autonomous University of Mexico) presenting at the ...
Yuxia Liu Phenology 2018 poster on tracking grass phenology
Qiaoyun Xie Phenology 2018 presentation on agricultural phenology
Ha Nguyen Phenology 2018 presentation on Melbourne pollen trends
Paul Beggs Phenology 2018 presentation on AusPollen
GEOSS Ecosystem Mapping for Australia
TERN Ecosystem Surveillance Plots Roy Hill Station
TERN Ecosystem Surveillance Plots Kakadu National Park
Ad

Recently uploaded (20)

PDF
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
PDF
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
PPTX
Comparative Structure of Integument in Vertebrates.pptx
PDF
lecture 2026 of Sjogren's syndrome l .pdf
PPTX
INTRODUCTION TO EVS | Concept of sustainability
PPTX
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
PPTX
Introduction to Fisheries Biotechnology_Lesson 1.pptx
PDF
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
PPTX
ECG_Course_Presentation د.محمد صقران ppt
PPTX
The KM-GBF monitoring framework – status & key messages.pptx
PDF
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
PPTX
7. General Toxicologyfor clinical phrmacy.pptx
PPTX
2Systematics of Living Organisms t-.pptx
PPT
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
PPTX
Derivatives of integument scales, beaks, horns,.pptx
PDF
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
PPTX
EPIDURAL ANESTHESIA ANATOMY AND PHYSIOLOGY.pptx
PPTX
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
PDF
Phytochemical Investigation of Miliusa longipes.pdf
PPTX
neck nodes and dissection types and lymph nodes levels
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
Comparative Structure of Integument in Vertebrates.pptx
lecture 2026 of Sjogren's syndrome l .pdf
INTRODUCTION TO EVS | Concept of sustainability
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
Introduction to Fisheries Biotechnology_Lesson 1.pptx
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
ECG_Course_Presentation د.محمد صقران ppt
The KM-GBF monitoring framework – status & key messages.pptx
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
7. General Toxicologyfor clinical phrmacy.pptx
2Systematics of Living Organisms t-.pptx
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
Derivatives of integument scales, beaks, horns,.pptx
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
EPIDURAL ANESTHESIA ANATOMY AND PHYSIOLOGY.pptx
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
Phytochemical Investigation of Miliusa longipes.pdf
neck nodes and dissection types and lymph nodes levels

Adding value to data isaac

  • 1. Adding Value to Data A Rake’s Progress
  • 2. Contents • Anecdotes from A Rake’s Progress • The TERN/OzFlux facility • The OzFlux data path • Non-generic things about OzFlux • Generic things about OzFlux • Conclusions
  • 7. Surface fluxes Radiation Meteorology Soil properties AusPlots, Supersites Intensive field campaigns Site characteristics Knowledge of ecosystem exchange of carbon, water & energy. Ecosystem dynamics Spatial and temporal dynamics Continental & global budgets Vegetation type Leaf area index Gross primary product Soil moisture Hyperspectral OzFlux Flux tower network AusCover Remote sensing eMAST Land surface models Biomass Soil carbon & nutrients Leaf-level photosynthesis
  • 8. The OzFlux Site Data Path u’, v’, w’, T’ q’,c’ WS, WD, T, RH Fsd, Fsu, Fld, Flu Fg, Sws, Tsoil Rain 10 Hz 0.1 Hz Fast (10 Hz raw) Slow (30 minute average) CR3000 data logger Slow (30 minute average) Fast (10 Hz) (optional with ethernet modem) Modmax modem CF card
  • 9. OzFlux Data Path Tower data OzFluxQC L3 data netCDF files L4 data netCDF files L5 data L6 data OAI-PMH server OzFlux portal THREDDS server Community developed Python scripts WWW netCDF files PI institution NeCTAR/RDSI
  • 10. OzFlux Data Portal • Publicly accessible archive of data • Ability to audit data provided by TERN funded sites • Publication of data via RDA and TDDP – Make the data easy to find • Repository of self-documenting data files – netCDF files conforming to CF Metadata Conventions
  • 11. Non-generic things about OzFlux • Data is very homogeneous – Standard suite of instruments – Standard data collection and processing – High level of automation • Existed prior to TERN with a long history of collaboration • Lucky breaks – Standard data processing and portal developed before TERN – Choice of data format compatible with public access to data
  • 12. Generic things about OzFlux • Prior to TERN – Individual solutions to data curation – No license applied to data – Data difficult to discover • Constituency – Community of rugged individualists
  • 13. Messages for Others • Clear vision of what data curation is achievable by your community – Accept outside help • Solutions need to fit the community – Voluntary compliance is better than enforcement – Guiding is better than directing • Staged solutions to help community accept change – Herding cats
  • 14. Communication • Email lists • ASN/OzFlux newsletter • Use of collaborative resources – Trello for scheduling workflow – GitHub for software updates – CloudStor+ for providing data to site Pis – ResearchGate project
  • 15. What happens when communication fails? • Establishment of 2 data paths: – Arose through misunderstanding and a lack of formality in process – Compromise allowed co-existence as “research” and “operational” systems – OzFlux will need to make sure boundaries between the 2 systems remain clear inside and outside OzFlux – OzFlux will need to make sure maintaining 2 systems does not adversely affect resources
  • 16. Conclusions • Scientists can be lead to water and they will drink if; – They can see there is something in it for them. – Change is managed, staged and suits the community.

Editor's Notes

  • #4: Picture of the S/V Malaita Map of TOGA-COARE research area Describe project and data collection and curation Hold up box of 3.5” diskettes Make the point that the data was used once and would be difficult to use again (media may not be readable)
  • #5: Photo of HNK Map of OASIS experiment area Description of experiment and use of data 2 papers from aircraft data Data collection and curation Hold up CD-ROM of HNK data
  • #6: Not sure about this … Picture of towers Map of NATT tower area Description of data collection and curation Screen shot of data available on OzFlux Data Portal At least part of the data, the data from the network of towers, from this experiment is now available as self-documenting netCDF files available for public download from the OzFlux Data Portal
  • #7: Network of 28 sites across Australia, 13 funded by TERN Measurements of the exchange of water and carbon between the Australian biosphere and the atmosphere 80 site-years data available from the portal 64 site-years contributed to the global FluxNet initiative Characterise the climate and biotic drivers of carbon and water exchange Parameterise and validate land-surface models
  • #8: OzFlux is one of many facilities within TERN. Complexity of the ecosystem science problems being addressed now requires contributions from multiple facilities across TERN. OzFlux needs to work with other facilities. Establishing a framework for collaboration is difficult and often ad-hoc. Allocating resources to implement the frameworks is also problematic. Clear view of facility and TERN strategic goals are required.
  • #9: High level diagram of OzFlux data path from towers
  • #10: High level diagram of the OzFlux data quality control and post-processing system QC and post-processing occurs at the site PI institution. Pis upload processed data to the OzFlux data portal Public access to data via the portal Rif-cs files available via the OAI-PMH server for RDA, TDDP and anyone else netCDF files available for download from the OzFlux data portal netCDF files available for remote processing via the THREDDS server
  • #11: Need to increase return on tax-payers investment by making data available for multiple uses. Many Pis see the data as their own possession and are reluctant to make it easily available to anonymous users. Many Pis are not convinced of the need for public availibility of their data. Many Pis are unaware of the implications of data licensing.
  • #14: TERN’s role in facilitating this process in OzFlux has been extremely important. TERN have provided a knowledge and experience bank that facilities can draw on. In the main, TERN have sought to help and guide facilities rather than force them to adopt solutions. Data curation solutions need to be driven by research needs, this will result in the research community seeing the benefit of the overhead required and increase the chances of voluntary compliance. This means the organisation will not have to dedicate resources to enforcing compliance. Allow people within the community to change at their own pace. Introduction of common license across OzFlux was done only after consultation with the community and was staged. The community asked for a non-commercial license, a fair use data policy and a restricted access provision. All of these were implemented about 18 months ago. It is likely the community will accept less restrictive license model in the near-future.