SlideShare a Scribd company logo
DM_PPT_NP_v01SESIP_0715_AJ
HDF Product Designer
Aleksandar Jelenak, H. Joe Lee, Ted Habermann
Gerd Heber, John Readey, Joel Plutchak
The HDF Group
HDF Workshop @ 2015 ESIP Summer Meeting
DM_PPT_NP_v01SESIP_0715_AJ
2
Data Producer’s Conundrum
HDF Features Interoperability
Project Requirements
HDF Product Designer
• Datatypes
• Groups
• Attributes
• Dimension scales
• Compression
• Chunking
• Scale/offset
• Etc.
• Conventions
• Metadata
• Software
• netCDF
• Science objectives
• Data processing, discovery &
distribution
• Data documentation
• User engagement,
preparedness, feedback
HDF Workshop @ 2015 ESIP Summer Meeting
DM_PPT_NP_v01SESIP_0715_AJ
Brief History
• Original idea from Jeffrey Lee, who developed HDF5
Earth Science Builder/Creator toolset for the ICESat-2
mission
• A similar tool was independently developed for the
SMAP mission
• The HDF Group was asked to generalize the concept
• The outcome: HDF Product Designer
HDF Workshop @ 2015 ESIP Summer Meeting
DM_PPT_NP_v01SESIP_0715_AJ
Key Goals
• Facilitate creation of interoperable and standards-compliant
data products in HDF5 as early as possible in the project
development process
• Support multiple computing platforms without requiring the full
software stack of development tools and libraries
• Easy and intuitive editing (create, update, move, copy, delete)
of HDF5 objects
• Collaborative approach to product design (project, team,
organization)
• Incorporation of best practices and standards from targeted
data user communities
• Integration of compliance and interoperability tests into the
design workflow
• Content import from existing files
• Content export as HDF5 files, HDF5/JSON, or as source code
in several programming languages
HDF Workshop @ 2015 ESIP Summer Meeting
DM_PPT_NP_v01SESIP_0715_AJ
5
System Architecture
Data StoreDesktop
Client
HDF5
JSON
Fortran IDL MATLAB Python
RESTful
Server
CSV
(Excel)
HDF5
JSON
HDF4 MAP
XML
NcML HDF5
Server
HDF5 File
Template
Flexible Output
HDF Workshop @ 2015 ESIP Summer Meeting
DM_PPT_NP_v01SESIP_0715_AJ
6
Software Stack
• Desktop Client
• WxPython
• CLIPS (C Language Integrated Production
System) expert system
• PyCLIPS
• RESTful Server
• Python/Tornado
• h5py
• Data Store
• PostgreSQL relational database
HDF Workshop @ 2015 ESIP Summer Meeting
DM_PPT_NP_v01SESIP_0715_AJ
Features
• Projects
• Designs
• CRUD operations on HDF5 objects
• Conventions support
• Validation services
• Collaborative workflow
HDF Workshop @ 2015 ESIP Summer Meeting
DM_PPT_NP_v01SESIP_0715_AJ
Project
• Organizational and collaborative space
• One or more users
• Zero or more designs
• Every user must belong to at least one project
• All members of a project has access to its
designs
• User project roles:
– Manager (not used yet)
– Designer
– Value Editor (not used yet)
– Viewer
HDF Workshop @ 2015 ESIP Summer Meeting
DM_PPT_NP_v01SESIP_0715_AJ
Design
• Represents content to be stored in one HDF5
file
• Not actual HDF5 file
• Versioned
– Simple timeline of checkpoints (saved versions)
– Each version must have unique label
– Only the current working version (label: HEAD) can
be edited
• Import from: NcML (netCDF XML), HDF4 file
content map (XML), HDF5/JSON
• Export as: HDF5 template, HDF5/JSON;
source code: Python, MATLAB, IDL,
FORTRAN
HDF Workshop @ 2015 ESIP Summer Meeting
DM_PPT_NP_v01SESIP_0715_AJ
CRUD Operations
• CRUD = create, read, update, delete, copy,
move
• Available on designs and HDF5 objects
• Support for HDF5 dimension scales
continuously improves
• Properties available to edit:
– Datatype
– Rank, shape, max/unlimited dimension sizes
– Storage (compact, contiguous, chunked)
– Fill value
– Compression
– Attribute value
HDF Workshop @ 2015 ESIP Summer Meeting
DM_PPT_NP_v01SESIP_0715_AJ
Conventions
• Supported:
– NetCDF User Guide Attribute Conventions
(NUG)
– Attribute Convention for Data Discovery
(ACDD)
– Climate and Forecast convention (CF)
– HDF-EOS (partial)
• Implemented using the CLIPS expert
system
HDF Workshop @ 2015 ESIP Summer Meeting
DM_PPT_NP_v01SESIP_0715_AJ
Validation Services
• A set of online services for interoperability
testing
• The level of support for conventions varies
between different software tools so it is
important to verify using actual file
• Input is HDF5 template file
• Output is typically displayed in a web
browser
HDF Workshop @ 2015 ESIP Summer Meeting
DM_PPT_NP_v01SESIP_0715_AJ
Validation Services
• Currently available:
– netCDF CDL
– Get as netCDF3 file
– CF (NCO’s ncdismember)
– ACDD (THREDDS UDDC service)
– ISO metadata (THREDDS ISO service)
– OPeNDAP Data Access Form
– THREDDS Dataset Access Page
HDF Workshop @ 2015 ESIP Summer Meeting
DM_PPT_NP_v01SESIP_0715_AJ
14
Collaboration
Individuals
Teams
Projects
Programs
HDF Workshop @ 2015 ESIP Summer Meeting
DM_PPT_NP_v01SESIP_0715_AJ
15
Collaboration
Design
(Desktop
)
Publish
(Online)
Share
(Server
)
HDF Workshop @ 2015 ESIP Summer Meeting
DM_PPT_NP_v01SESIP_0715_AJ
User Resources
• User Guide
• Code is hosted in the NASA Earthdata
Code Collaborative
• Mailing list
• Regular monthly meetings
• Us!
HDF Workshop @ 2015 ESIP Summer Meeting
DM_PPT_NP_v01SESIP_0715_AJ
DEMONSTRATION
DM_PPT_NP_v01SESIP_0715_AJ
Future Work?
• Continue improving user interface, source
code generators, …
• Adding data to HDF5 templates for further
validation tests
• Generate Word-friendly product description
to help with preparing required project
documentation
• Whole file convention compliance checks
• User feedback always welcome and can
influence planning!
HDF Workshop @ 2015 ESIP Summer Meeting
DM_PPT_NP_v01SESIP_0715_AJ
19
This work was supported under the NASA Earth Observing System Data and Information Systems
(EOSDIS) Evolution and Development (EED) Program under prime contract number NNG10HP02C. Any
opinions, findings, or conclusions expressed in this material are those of the author and do not necessarily
reflect the views of NASA.
Thank you!
Questions?
Contact: ajelenak@hdfgroup.org
HDF Workshop @ 2015 ESIP Summer Meeting
DM_PPT_NP_v01SESIP_0715_AJ
20
This work was supported by
NASA/GSFC under Raytheon Co.
contract number NNG10HP02C
HDF Workshop @ 2015 ESIP Summer Meeting

More Related Content

PPTX
Geospatial Data Abstraction Library (GDAL) Enhancement for ESDIS (GEE)
PPTX
MATLAB and Scientific Data: New Features and Capabilities
PPTX
HDF Update for DAAC Managers (2017-02-27)
PPTX
Scientific Computing and Visualization using HDF
PPTX
Putting some Spark into HDF5
Geospatial Data Abstraction Library (GDAL) Enhancement for ESDIS (GEE)
MATLAB and Scientific Data: New Features and Capabilities
HDF Update for DAAC Managers (2017-02-27)
Scientific Computing and Visualization using HDF
Putting some Spark into HDF5

What's hot (20)

PPTX
Hierarchical Data Formats (HDF) Update
PPTX
PPTX
Incorporating ISO Metadata Using HDF Product Designer
PPTX
Open-source Scientific Computing and Data Analytics using HDF
PPT
PPTX
America Runs on Excel and HDF5 - Glued together by Python
PPTX
HDF Product Designer: Using Templates to Achieve Interoperability
PDF
HDFCloud Workshop: HDF5 in the Cloud
PPTX
Utilizing HDF4 File Content Maps for the Cloud Computing
PPTX
HDF5 and Ecosystem: What Is New?
PPTX
Parallel Computing with HDF Server
PPTX
HDF - Current status and Future Directions
PPTX
Improved Methods for Accessing Scientific Data for the Masses
PPSX
HDFEOS.org User Analsys, Updates, and Future
PPT
HDF-EOS 2/5 to netCDF Converter
PPSX
Adding new servicees for HDF in THREDDS Data Server (TDS)
PPT
Caching and Buffering in HDF5
Hierarchical Data Formats (HDF) Update
Incorporating ISO Metadata Using HDF Product Designer
Open-source Scientific Computing and Data Analytics using HDF
America Runs on Excel and HDF5 - Glued together by Python
HDF Product Designer: Using Templates to Achieve Interoperability
HDFCloud Workshop: HDF5 in the Cloud
Utilizing HDF4 File Content Maps for the Cloud Computing
HDF5 and Ecosystem: What Is New?
Parallel Computing with HDF Server
HDF - Current status and Future Directions
Improved Methods for Accessing Scientific Data for the Masses
HDFEOS.org User Analsys, Updates, and Future
HDF-EOS 2/5 to netCDF Converter
Adding new servicees for HDF in THREDDS Data Server (TDS)
Caching and Buffering in HDF5
Ad

Similar to HDF Product Designer (20)

PPTX
Hdf5 current future
PPTX
Hdf5 parallel
PPTX
HDF-EOS Data Product Developer's Guide
PPTX
Introduction to HDF5 Data and Programming Models
PPTX
HDF5 OPeNDAP project update and demo
PPTX
HDF Project Status and Plans
PPTX
The State of HDF5 / Dana Robinson / The HDF Group
PPT
HDF Status and Development
PPTX
HDF4 Mapping Project Update
PPTX
HDF Tools Updates and Discussions
Ad

More from The HDF-EOS Tools and Information Center (20)

PDF
HDF5 2.0: Cloud Optimized from the Start
PDF
Using a Hierarchical Data Format v5 file as Zarr v3 Shard
PDF
Cloud-Optimized HDF5 Files - Current Status
PDF
Cloud Optimized HDF5 for the ICESat-2 mission
PPTX
Access HDF Data in the Cloud via OPeNDAP Web Service
PPTX
Upcoming New HDF5 Features: Multi-threading, sparse data storage, and encrypt...
PDF
Cloud-Optimized HDF5 Files
PDF
Accessing HDF5 data in the cloud with HSDS
PPTX
Highly Scalable Data Service (HSDS) Performance Features
PDF
Creating Cloud-Optimized HDF5 Files
PPTX
HDF5 OPeNDAP Handler Updates, and Performance Discussion
PPTX
Hyrax: Serving Data from S3
PPSX
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
PDF
HDF - Current status and Future Directions
PDF
H5Coro: The Cloud-Optimized Read-Only Library
PPTX
MATLAB Modernization on HDF5 1.10
PPTX
HDF for the Cloud - Serverless HDF
PPTX
HDF for the Cloud - New HDF Server Features
HDF5 2.0: Cloud Optimized from the Start
Using a Hierarchical Data Format v5 file as Zarr v3 Shard
Cloud-Optimized HDF5 Files - Current Status
Cloud Optimized HDF5 for the ICESat-2 mission
Access HDF Data in the Cloud via OPeNDAP Web Service
Upcoming New HDF5 Features: Multi-threading, sparse data storage, and encrypt...
Cloud-Optimized HDF5 Files
Accessing HDF5 data in the cloud with HSDS
Highly Scalable Data Service (HSDS) Performance Features
Creating Cloud-Optimized HDF5 Files
HDF5 OPeNDAP Handler Updates, and Performance Discussion
Hyrax: Serving Data from S3
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
HDF - Current status and Future Directions
H5Coro: The Cloud-Optimized Read-Only Library
MATLAB Modernization on HDF5 1.10
HDF for the Cloud - Serverless HDF
HDF for the Cloud - New HDF Server Features

Recently uploaded (20)

PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
Big Data Technologies - Introduction.pptx
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
Understanding_Digital_Forensics_Presentation.pptx
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
cuic standard and advanced reporting.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Approach and Philosophy of On baking technology
PDF
Modernizing your data center with Dell and AMD
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Machine learning based COVID-19 study performance prediction
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
Spectral efficient network and resource selection model in 5G networks
Building Integrated photovoltaic BIPV_UPV.pdf
20250228 LYD VKU AI Blended-Learning.pptx
Network Security Unit 5.pdf for BCA BBA.
Big Data Technologies - Introduction.pptx
MYSQL Presentation for SQL database connectivity
Understanding_Digital_Forensics_Presentation.pptx
The AUB Centre for AI in Media Proposal.docx
cuic standard and advanced reporting.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
Mobile App Security Testing_ A Comprehensive Guide.pdf
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Approach and Philosophy of On baking technology
Modernizing your data center with Dell and AMD
Chapter 3 Spatial Domain Image Processing.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
Machine learning based COVID-19 study performance prediction
Review of recent advances in non-invasive hemoglobin estimation
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Spectral efficient network and resource selection model in 5G networks

HDF Product Designer

  • 1. DM_PPT_NP_v01SESIP_0715_AJ HDF Product Designer Aleksandar Jelenak, H. Joe Lee, Ted Habermann Gerd Heber, John Readey, Joel Plutchak The HDF Group HDF Workshop @ 2015 ESIP Summer Meeting
  • 2. DM_PPT_NP_v01SESIP_0715_AJ 2 Data Producer’s Conundrum HDF Features Interoperability Project Requirements HDF Product Designer • Datatypes • Groups • Attributes • Dimension scales • Compression • Chunking • Scale/offset • Etc. • Conventions • Metadata • Software • netCDF • Science objectives • Data processing, discovery & distribution • Data documentation • User engagement, preparedness, feedback HDF Workshop @ 2015 ESIP Summer Meeting
  • 3. DM_PPT_NP_v01SESIP_0715_AJ Brief History • Original idea from Jeffrey Lee, who developed HDF5 Earth Science Builder/Creator toolset for the ICESat-2 mission • A similar tool was independently developed for the SMAP mission • The HDF Group was asked to generalize the concept • The outcome: HDF Product Designer HDF Workshop @ 2015 ESIP Summer Meeting
  • 4. DM_PPT_NP_v01SESIP_0715_AJ Key Goals • Facilitate creation of interoperable and standards-compliant data products in HDF5 as early as possible in the project development process • Support multiple computing platforms without requiring the full software stack of development tools and libraries • Easy and intuitive editing (create, update, move, copy, delete) of HDF5 objects • Collaborative approach to product design (project, team, organization) • Incorporation of best practices and standards from targeted data user communities • Integration of compliance and interoperability tests into the design workflow • Content import from existing files • Content export as HDF5 files, HDF5/JSON, or as source code in several programming languages HDF Workshop @ 2015 ESIP Summer Meeting
  • 5. DM_PPT_NP_v01SESIP_0715_AJ 5 System Architecture Data StoreDesktop Client HDF5 JSON Fortran IDL MATLAB Python RESTful Server CSV (Excel) HDF5 JSON HDF4 MAP XML NcML HDF5 Server HDF5 File Template Flexible Output HDF Workshop @ 2015 ESIP Summer Meeting
  • 6. DM_PPT_NP_v01SESIP_0715_AJ 6 Software Stack • Desktop Client • WxPython • CLIPS (C Language Integrated Production System) expert system • PyCLIPS • RESTful Server • Python/Tornado • h5py • Data Store • PostgreSQL relational database HDF Workshop @ 2015 ESIP Summer Meeting
  • 7. DM_PPT_NP_v01SESIP_0715_AJ Features • Projects • Designs • CRUD operations on HDF5 objects • Conventions support • Validation services • Collaborative workflow HDF Workshop @ 2015 ESIP Summer Meeting
  • 8. DM_PPT_NP_v01SESIP_0715_AJ Project • Organizational and collaborative space • One or more users • Zero or more designs • Every user must belong to at least one project • All members of a project has access to its designs • User project roles: – Manager (not used yet) – Designer – Value Editor (not used yet) – Viewer HDF Workshop @ 2015 ESIP Summer Meeting
  • 9. DM_PPT_NP_v01SESIP_0715_AJ Design • Represents content to be stored in one HDF5 file • Not actual HDF5 file • Versioned – Simple timeline of checkpoints (saved versions) – Each version must have unique label – Only the current working version (label: HEAD) can be edited • Import from: NcML (netCDF XML), HDF4 file content map (XML), HDF5/JSON • Export as: HDF5 template, HDF5/JSON; source code: Python, MATLAB, IDL, FORTRAN HDF Workshop @ 2015 ESIP Summer Meeting
  • 10. DM_PPT_NP_v01SESIP_0715_AJ CRUD Operations • CRUD = create, read, update, delete, copy, move • Available on designs and HDF5 objects • Support for HDF5 dimension scales continuously improves • Properties available to edit: – Datatype – Rank, shape, max/unlimited dimension sizes – Storage (compact, contiguous, chunked) – Fill value – Compression – Attribute value HDF Workshop @ 2015 ESIP Summer Meeting
  • 11. DM_PPT_NP_v01SESIP_0715_AJ Conventions • Supported: – NetCDF User Guide Attribute Conventions (NUG) – Attribute Convention for Data Discovery (ACDD) – Climate and Forecast convention (CF) – HDF-EOS (partial) • Implemented using the CLIPS expert system HDF Workshop @ 2015 ESIP Summer Meeting
  • 12. DM_PPT_NP_v01SESIP_0715_AJ Validation Services • A set of online services for interoperability testing • The level of support for conventions varies between different software tools so it is important to verify using actual file • Input is HDF5 template file • Output is typically displayed in a web browser HDF Workshop @ 2015 ESIP Summer Meeting
  • 13. DM_PPT_NP_v01SESIP_0715_AJ Validation Services • Currently available: – netCDF CDL – Get as netCDF3 file – CF (NCO’s ncdismember) – ACDD (THREDDS UDDC service) – ISO metadata (THREDDS ISO service) – OPeNDAP Data Access Form – THREDDS Dataset Access Page HDF Workshop @ 2015 ESIP Summer Meeting
  • 16. DM_PPT_NP_v01SESIP_0715_AJ User Resources • User Guide • Code is hosted in the NASA Earthdata Code Collaborative • Mailing list • Regular monthly meetings • Us! HDF Workshop @ 2015 ESIP Summer Meeting
  • 18. DM_PPT_NP_v01SESIP_0715_AJ Future Work? • Continue improving user interface, source code generators, … • Adding data to HDF5 templates for further validation tests • Generate Word-friendly product description to help with preparing required project documentation • Whole file convention compliance checks • User feedback always welcome and can influence planning! HDF Workshop @ 2015 ESIP Summer Meeting
  • 19. DM_PPT_NP_v01SESIP_0715_AJ 19 This work was supported under the NASA Earth Observing System Data and Information Systems (EOSDIS) Evolution and Development (EED) Program under prime contract number NNG10HP02C. Any opinions, findings, or conclusions expressed in this material are those of the author and do not necessarily reflect the views of NASA. Thank you! Questions? Contact: ajelenak@hdfgroup.org HDF Workshop @ 2015 ESIP Summer Meeting
  • 20. DM_PPT_NP_v01SESIP_0715_AJ 20 This work was supported by NASA/GSFC under Raytheon Co. contract number NNG10HP02C HDF Workshop @ 2015 ESIP Summer Meeting