SlideShare a Scribd company logo
The HDF Group

Ensuring Long Term Access to
Remotely Sensed HDF4 Data
with Layout Maps
Ruth Duerr, NSIDC
Christopher Lynnes, GES DISC
Mike Folk, Kent Yang, Peter Cao, The HDF Group
November 3-5,
2009

HDF/HDF-EOS Workshop XIII

1 www.hdfgroup.org
HDF4 files are complex

November 3-5,
2009

HDF/HDF-EOS Workshop XIII

2

www.hdfgroup.org
How do we save HDF users
from having to deal with all of
the complexity under the
hood?

November 3-5,
2009

HDF/HDF-EOS Workshop XIII

3

www.hdfgroup.org
Through the HDF software
libraries, either by using the
HDF APIs directly or by using
HDF tools that depend on the
HDF libraries.
But what about the future…
November 3-5,
2009

HDF/HDF-EOS Workshop XIII

4

www.hdfgroup.org
There is a risk in depending solely
on HDF libraries to access HDFformatted data over the long term.
It is possible, especially in the
distant future, that the libraries may
not be available.
November 3-5,
2009

HDF/HDF-EOS Workshop XIII

5

www.hdfgroup.org
“If only we could read HDF data with an
independent program that does not rely on
the HDF API…
A possible approach [would be to create] a
map of a data file, [and] utilities to find,
assemble and write out SDSes and vdatas.”
“Leveraging HDF Utilities”
Christopher Lynnes
HDF Workshop X.
November 3-5,
2009

HDF/HDF-EOS Workshop XIII

6

www.hdfgroup.org
User’s view of the HDF4 SD model

November 3-5,
2009

HDF/HDF-EOS Workshop XIII

7

www.hdfgroup.org
Mapping SDS to file offset/length

HDF4 file
layout
November 3-5,
2009

HDF/HDF-EOS Workshop XIII

8

www.hdfgroup.org
Mapping with chunks

HDF4 file
layout
November 3-5,
2009

HDF/HDF-EOS Workshop XIII

9

www.hdfgroup.org
Recap
• Problem
• The complex byte layout of HDF files makes
long-term readability of HDF data dependent
on long-term availability HDF software.

• Solution
• Create a map of the layout of data objects in
an HDF file, allowing a simple reader to be
written to access the data.

November 3-5,
2009

HDF/HDF-EOS Workshop XIII

10

www.hdfgroup.org
The HDF Group

The project – phase 1
2007-2008

November 3-5,
2009

HDF/HDF-EOS Workshop XIII

11 www.hdfgroup.org
HDF4 mapping project activities
1. Assess and categorize HDF4 data held by NASA
• Determine what types of objects to map.
• Get an idea of the magnitude of the project.

1. Develop prototype for proof of concept
• Develop markup-language based layout
specification.
• Develop tool to produce layout for an HDF4 file.
• Develop and test two independent tools to read
HDF4 data based solely on the map files
November 3-5,
2009

HDF/HDF-EOS Workshop XIII

12

www.hdfgroup.org
How many HDF4 products?
Data Center
ASF

HDF4 Products
0

GES-DISC
GHRC

54

ASDC

63

LP-DAAC

67

NSIDC

47

ORNL-DAAC

2

PO.DAAC

22

SDAC

0

MrDC

95

Total
November 3-5,
HDF/HDF-EOS
2009
Workshop XIII

236

586

13

www.hdfgroup.org
Data characteristics
Product Characteristics Examined
•

Product Identification
•
•
•
•

•
•

HDF-EOS version
For point data
•
•

•

•

•

Number of swaths
Maximum number of dimensions
Organized by time, space, both, or other
Whether dimension maps were used

For gridded data
•
•
•
•

Number of grids
Max number of dimensions in a grid
Number of projections used
Whether any grids were indexed

HDF Version

•

•

Number of SDSs
Maximum number of dimensions
Did any SDS have attributes
Was any SDS annotated
Were dimension scales used
Was compression used and if so what
kind
Was chunking used

For Vdata
•
•
•
•
•

November 3-5,
HDF/HDF-EOS
2009
Workshop XIII

Number of 8-bit rasters
Number of 24-bit rasters
Number of general rasters
Whether any rasters had attributes
Whether any rasters were compressed
Whether any rasters were chunked
Whether there were any palettes

For SDS data
•
•
•
•
•
•

Number of point data sets
Maximum number of levels

For swath data
•
•
•
•

For raster data
•
•
•
•
•
•
•

Product Name
Data Level
Archive Location
Product Version

Whether the product was multi-file
For HDF-EOS products
•
•

•

•

Number of Vdata structures
Did any Vdata have attributes
Did any Vdata fields have attributes
Was compression used and if so what
kind
Was chunking used

14

www.hdfgroup.org
HDF4 mapping prototype workflow

HDF4 File
HDF4 File
“H4.hdf”
“H4.hdf”

hmap
hmap
linked with
linked with
HDF4 library
HDF4 library

HDF4 Mapping File
HDF4 Mapping File
(XML document)
(XML document)
“H4.hdf.map.xml”
“H4.hdf.map.xml”

Groups, Data Objects,
Structural and Application
Metadata;
Locations of Object Data

Object Data

Reader 1
Reader 2
2
(C program)
(Perl Script)
(Perl Script)

November 3-5,
2009

HDF/HDF-EOS Workshop XIII

15

www.hdfgroup.org
The HDF Group

Phase 2: 2009-2011
Productizing HDF4
Mapping schema and
tools for deployment
November 3-5,
2009

HDF/HDF-EOS Workshop XIII

16

www.hdfgroup.org
Phase 2 tasks
• Revise schema
• Investigate integration of mapping schema with
existing standards
• Analyze what’s needed to include HDF-EOS 2
• Revise the XML schema

•
•
•
•

Implement production quality HDF4 map writer
Develop demo HDF4 map reader
Deploy
Optional tasks
• Implement general purpose reader
• Develop validation utilities

November 3-5,
2009

HDF/HDF-EOS Workshop XIII

17

www.hdfgroup.org
How you can help
• Project page at The HDF Group website:
• http://www.hdfgroup.org/projects/hdf4mapping/

• Consider what it might take to implement this
for your archive - contact us if you’d like
support
• Let us know if you are interested in
participating in any capacity.

November 3-5,
2009

HDF/HDF-EOS Workshop XIII

18

www.hdfgroup.org
The HDF Group

Thank You!

November 3-5, 2009

HDF/HDF-EOS Workshop XIII

19

www.hdfgroup.org
Acknowledgements
This work was supported by cooperative agreement
number NNX08AO77A from the National
Aeronautics and Space Administration (NASA).
Any opinions, findings, conclusions, or
recommendations expressed in this material are
those of the author[s] and do not necessarily reflect
the views of the National Aeronautics and Space
Administration.

November 3-5, 2009

HDF/HDF-EOS Workshop XIII

20

www.hdfgroup.org
The HDF Group

Questions/comments?

November 3-5, 2009

HDF/HDF-EOS Workshop XIII

21

www.hdfgroup.org

More Related Content

PPTX
Parallel HDF5 Developments
PPTX
Easy Access of NASA HDF data via OPeNDAP
PPTX
Easy Remote Access Via OPeNDAP
PPTX
Hierarchical Data Formats (HDF) Update
PPTX
Improving long-term preservation of EOS data by independently mapping HDF4 da...
PPT
HDF-EOS 2/5 to netCDF Converter
Parallel HDF5 Developments
Easy Access of NASA HDF data via OPeNDAP
Easy Remote Access Via OPeNDAP
Hierarchical Data Formats (HDF) Update
Improving long-term preservation of EOS data by independently mapping HDF4 da...
HDF-EOS 2/5 to netCDF Converter

What's hot (20)

PPTX
Interoperability with netCDF-4 - Experience with NPP and HDF-EOS5 products
PPTX
HDF Product Designer: Using Templates to Achieve Interoperability
PPTX
Exploiting HDF5 Technologies to Represent Geo-Information-An Example with Com...
PPTX
HDF Group Support for NPP/NPOESS/JPSS
PPTX
Open-source Scientific Computing and Data Analytics using HDF
PPTX
Product Designer Hub - Taking HPD to the Web
PPTX
HDF and netCDF Data Support in ArcGIS
PPTX
HDF & HDF-EOS Data & Support at NSIDC
PPT
PPTX
Efficiently serving HDF5 via OPeNDAP
PPTX
HDF Update for DAAC Managers (2017-02-27)
PPSX
NASA HDF/HDF-EOS Data for Dummies (and Developers)
PPTX
Support for NPP/NPOESS/JPSS by The HDF Group
PPT
HDF-EOS to GeoTIFF Conversion Tool & HDF-EOS Plug-in for HDFView
PPTX
Moving form HDF4 to HDF5/netCDF-4
PPT
The New HDF-EOS WebSite - How it can help you
Interoperability with netCDF-4 - Experience with NPP and HDF-EOS5 products
HDF Product Designer: Using Templates to Achieve Interoperability
Exploiting HDF5 Technologies to Represent Geo-Information-An Example with Com...
HDF Group Support for NPP/NPOESS/JPSS
Open-source Scientific Computing and Data Analytics using HDF
Product Designer Hub - Taking HPD to the Web
HDF and netCDF Data Support in ArcGIS
HDF & HDF-EOS Data & Support at NSIDC
Efficiently serving HDF5 via OPeNDAP
HDF Update for DAAC Managers (2017-02-27)
NASA HDF/HDF-EOS Data for Dummies (and Developers)
Support for NPP/NPOESS/JPSS by The HDF Group
HDF-EOS to GeoTIFF Conversion Tool & HDF-EOS Plug-in for HDFView
Moving form HDF4 to HDF5/netCDF-4
The New HDF-EOS WebSite - How it can help you
Ad

Similar to Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps (20)

PPTX
HDF4 Mapping Project Update
PPT
HDF OPeNDAP project update and demo
PPT
HDF Status and Development
PPT
Transitions from HDF4 to HDF5: Issues
PPTX
Introduction to HDF5 Data and Programming Models
PPT
Support for NPP/NPOESS by The HDF Group
PPT
Hdf5 intro
PPSX
Adding new servicees for HDF in THREDDS Data Server (TDS)
PPT
Integrating HDF5 with SRB
PPTX
HDF5 OPeNDAP project update and demo
HDF4 Mapping Project Update
HDF OPeNDAP project update and demo
HDF Status and Development
Transitions from HDF4 to HDF5: Issues
Introduction to HDF5 Data and Programming Models
Support for NPP/NPOESS by The HDF Group
Hdf5 intro
Adding new servicees for HDF in THREDDS Data Server (TDS)
Integrating HDF5 with SRB
HDF5 OPeNDAP project update and demo
Ad

More from The HDF-EOS Tools and Information Center (20)

PDF
HDF5 2.0: Cloud Optimized from the Start
PDF
Using a Hierarchical Data Format v5 file as Zarr v3 Shard
PDF
Cloud-Optimized HDF5 Files - Current Status
PDF
Cloud Optimized HDF5 for the ICESat-2 mission
PPTX
Access HDF Data in the Cloud via OPeNDAP Web Service
PPTX
Upcoming New HDF5 Features: Multi-threading, sparse data storage, and encrypt...
PPTX
The State of HDF5 / Dana Robinson / The HDF Group
PDF
Cloud-Optimized HDF5 Files
PDF
Accessing HDF5 data in the cloud with HSDS
PPTX
Highly Scalable Data Service (HSDS) Performance Features
PDF
Creating Cloud-Optimized HDF5 Files
PPTX
HDF5 OPeNDAP Handler Updates, and Performance Discussion
PPTX
Hyrax: Serving Data from S3
PPSX
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
PDF
HDF - Current status and Future Directions
PPSX
HDFEOS.org User Analsys, Updates, and Future
PPTX
HDF - Current status and Future Directions
PDF
H5Coro: The Cloud-Optimized Read-Only Library
PPTX
MATLAB Modernization on HDF5 1.10
HDF5 2.0: Cloud Optimized from the Start
Using a Hierarchical Data Format v5 file as Zarr v3 Shard
Cloud-Optimized HDF5 Files - Current Status
Cloud Optimized HDF5 for the ICESat-2 mission
Access HDF Data in the Cloud via OPeNDAP Web Service
Upcoming New HDF5 Features: Multi-threading, sparse data storage, and encrypt...
The State of HDF5 / Dana Robinson / The HDF Group
Cloud-Optimized HDF5 Files
Accessing HDF5 data in the cloud with HSDS
Highly Scalable Data Service (HSDS) Performance Features
Creating Cloud-Optimized HDF5 Files
HDF5 OPeNDAP Handler Updates, and Performance Discussion
Hyrax: Serving Data from S3
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
HDF - Current status and Future Directions
HDFEOS.org User Analsys, Updates, and Future
HDF - Current status and Future Directions
H5Coro: The Cloud-Optimized Read-Only Library
MATLAB Modernization on HDF5 1.10

Recently uploaded (20)

PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Encapsulation theory and applications.pdf
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
KodekX | Application Modernization Development
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
Spectroscopy.pptx food analysis technology
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Empathic Computing: Creating Shared Understanding
PPT
Teaching material agriculture food technology
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
Unlocking AI with Model Context Protocol (MCP)
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Encapsulation theory and applications.pdf
sap open course for s4hana steps from ECC to s4
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Mobile App Security Testing_ A Comprehensive Guide.pdf
KodekX | Application Modernization Development
The AUB Centre for AI in Media Proposal.docx
Spectroscopy.pptx food analysis technology
Review of recent advances in non-invasive hemoglobin estimation
Agricultural_Statistics_at_a_Glance_2022_0.pdf
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Understanding_Digital_Forensics_Presentation.pptx
Empathic Computing: Creating Shared Understanding
Teaching material agriculture food technology
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
MIND Revenue Release Quarter 2 2025 Press Release
Digital-Transformation-Roadmap-for-Companies.pptx
Reach Out and Touch Someone: Haptics and Empathic Computing

Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps

  • 1. The HDF Group Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps Ruth Duerr, NSIDC Christopher Lynnes, GES DISC Mike Folk, Kent Yang, Peter Cao, The HDF Group November 3-5, 2009 HDF/HDF-EOS Workshop XIII 1 www.hdfgroup.org
  • 2. HDF4 files are complex November 3-5, 2009 HDF/HDF-EOS Workshop XIII 2 www.hdfgroup.org
  • 3. How do we save HDF users from having to deal with all of the complexity under the hood? November 3-5, 2009 HDF/HDF-EOS Workshop XIII 3 www.hdfgroup.org
  • 4. Through the HDF software libraries, either by using the HDF APIs directly or by using HDF tools that depend on the HDF libraries. But what about the future… November 3-5, 2009 HDF/HDF-EOS Workshop XIII 4 www.hdfgroup.org
  • 5. There is a risk in depending solely on HDF libraries to access HDFformatted data over the long term. It is possible, especially in the distant future, that the libraries may not be available. November 3-5, 2009 HDF/HDF-EOS Workshop XIII 5 www.hdfgroup.org
  • 6. “If only we could read HDF data with an independent program that does not rely on the HDF API… A possible approach [would be to create] a map of a data file, [and] utilities to find, assemble and write out SDSes and vdatas.” “Leveraging HDF Utilities” Christopher Lynnes HDF Workshop X. November 3-5, 2009 HDF/HDF-EOS Workshop XIII 6 www.hdfgroup.org
  • 7. User’s view of the HDF4 SD model November 3-5, 2009 HDF/HDF-EOS Workshop XIII 7 www.hdfgroup.org
  • 8. Mapping SDS to file offset/length HDF4 file layout November 3-5, 2009 HDF/HDF-EOS Workshop XIII 8 www.hdfgroup.org
  • 9. Mapping with chunks HDF4 file layout November 3-5, 2009 HDF/HDF-EOS Workshop XIII 9 www.hdfgroup.org
  • 10. Recap • Problem • The complex byte layout of HDF files makes long-term readability of HDF data dependent on long-term availability HDF software. • Solution • Create a map of the layout of data objects in an HDF file, allowing a simple reader to be written to access the data. November 3-5, 2009 HDF/HDF-EOS Workshop XIII 10 www.hdfgroup.org
  • 11. The HDF Group The project – phase 1 2007-2008 November 3-5, 2009 HDF/HDF-EOS Workshop XIII 11 www.hdfgroup.org
  • 12. HDF4 mapping project activities 1. Assess and categorize HDF4 data held by NASA • Determine what types of objects to map. • Get an idea of the magnitude of the project. 1. Develop prototype for proof of concept • Develop markup-language based layout specification. • Develop tool to produce layout for an HDF4 file. • Develop and test two independent tools to read HDF4 data based solely on the map files November 3-5, 2009 HDF/HDF-EOS Workshop XIII 12 www.hdfgroup.org
  • 13. How many HDF4 products? Data Center ASF HDF4 Products 0 GES-DISC GHRC 54 ASDC 63 LP-DAAC 67 NSIDC 47 ORNL-DAAC 2 PO.DAAC 22 SDAC 0 MrDC 95 Total November 3-5, HDF/HDF-EOS 2009 Workshop XIII 236 586 13 www.hdfgroup.org
  • 14. Data characteristics Product Characteristics Examined • Product Identification • • • • • • HDF-EOS version For point data • • • • • Number of swaths Maximum number of dimensions Organized by time, space, both, or other Whether dimension maps were used For gridded data • • • • Number of grids Max number of dimensions in a grid Number of projections used Whether any grids were indexed HDF Version • • Number of SDSs Maximum number of dimensions Did any SDS have attributes Was any SDS annotated Were dimension scales used Was compression used and if so what kind Was chunking used For Vdata • • • • • November 3-5, HDF/HDF-EOS 2009 Workshop XIII Number of 8-bit rasters Number of 24-bit rasters Number of general rasters Whether any rasters had attributes Whether any rasters were compressed Whether any rasters were chunked Whether there were any palettes For SDS data • • • • • • Number of point data sets Maximum number of levels For swath data • • • • For raster data • • • • • • • Product Name Data Level Archive Location Product Version Whether the product was multi-file For HDF-EOS products • • • • Number of Vdata structures Did any Vdata have attributes Did any Vdata fields have attributes Was compression used and if so what kind Was chunking used 14 www.hdfgroup.org
  • 15. HDF4 mapping prototype workflow HDF4 File HDF4 File “H4.hdf” “H4.hdf” hmap hmap linked with linked with HDF4 library HDF4 library HDF4 Mapping File HDF4 Mapping File (XML document) (XML document) “H4.hdf.map.xml” “H4.hdf.map.xml” Groups, Data Objects, Structural and Application Metadata; Locations of Object Data Object Data Reader 1 Reader 2 2 (C program) (Perl Script) (Perl Script) November 3-5, 2009 HDF/HDF-EOS Workshop XIII 15 www.hdfgroup.org
  • 16. The HDF Group Phase 2: 2009-2011 Productizing HDF4 Mapping schema and tools for deployment November 3-5, 2009 HDF/HDF-EOS Workshop XIII 16 www.hdfgroup.org
  • 17. Phase 2 tasks • Revise schema • Investigate integration of mapping schema with existing standards • Analyze what’s needed to include HDF-EOS 2 • Revise the XML schema • • • • Implement production quality HDF4 map writer Develop demo HDF4 map reader Deploy Optional tasks • Implement general purpose reader • Develop validation utilities November 3-5, 2009 HDF/HDF-EOS Workshop XIII 17 www.hdfgroup.org
  • 18. How you can help • Project page at The HDF Group website: • http://www.hdfgroup.org/projects/hdf4mapping/ • Consider what it might take to implement this for your archive - contact us if you’d like support • Let us know if you are interested in participating in any capacity. November 3-5, 2009 HDF/HDF-EOS Workshop XIII 18 www.hdfgroup.org
  • 19. The HDF Group Thank You! November 3-5, 2009 HDF/HDF-EOS Workshop XIII 19 www.hdfgroup.org
  • 20. Acknowledgements This work was supported by cooperative agreement number NNX08AO77A from the National Aeronautics and Space Administration (NASA). Any opinions, findings, conclusions, or recommendations expressed in this material are those of the author[s] and do not necessarily reflect the views of the National Aeronautics and Space Administration. November 3-5, 2009 HDF/HDF-EOS Workshop XIII 20 www.hdfgroup.org
  • 21. The HDF Group Questions/comments? November 3-5, 2009 HDF/HDF-EOS Workshop XIII 21 www.hdfgroup.org

Editor's Notes

  • #5: Full quote, from proposal: Through the HDF software libraries, either by using the HDF APIs directly or by using HDF tools that depend on the HDF libraries. However there is a risk in depending solely on the HDF libraries to access HDF-formatted data over the long term. It is possible, especially in the distant future, that the libraries may not be as readily available as they are today. To address this risk, it is desirable to have a way to retrieve the data independently. At the 10th HDF workshop, Christopher Lynnes of the Goddard Earth Sciences Data and Information Services Center(GES DISC) addressed this need: “If only we could read HDF data with an independent program that does not rely on the HDF API… A possible approach [would be to] extend” hdfls to print a hierarchical map of a data file, [and] write ncdump/hdp-like utilities to find, assemble and write out SDSes and vdatas.” “Leveraging HDF Utilities,” Christopher Lynnes, 10th HDF Workshop. http://www.hdfeos.org/workshops/ws10/presentations/day3/Leveraging_HDF_Utilities.ppt.
  • #13: An XML-based prototype schema for HDF4 mapping files (XML documents) was created. For a given binary HDF4 file, an associated mapping file contains structural and application metadata for the HDF4 file, as well as the locations of the object data (array element values) in the HDF4 file. A tool was written to generate mapping files. Other tools were developed that use the mapping files to read HDF4 files without calling the HDF4 library, confirming the approach is viable. While the focus of this effort was NASA EOSDIS data stored in HDF4 files, the general methodology is also relevant to other cases where the long-term accessibility of data stored in binary files is of concern. In addition, this work demonstrates how binary HDF files can be used to efficiently store large volumes of scientific data that is referenced by text-based XML documents (the mapping files).