SlideShare a Scribd company logo
Conf-DDDD-IN
The State of HDF
Summer ESIP 2023
This work was supported by NASA/GSFC under Raytheon Technologies contract number 80GSFC21CA001.
This document does not contain technology or Technical Data controlled under either the U.S. International Traffic
in Arms Regulations or the U.S. Export Administration Regulations.
Dana Robinson
Director of Software Engineering
NASA EED-3 / The HDF Group
derobins@hdfgroup.org
Conf-DDDD-IN
2
• About us
• Software status
• Current focus
• How we can help each other
Outline
Conf-DDDD-IN
3
About us
Conf-DDDD-IN
4
• Located in Champaign, IL
• Spun off from NCSA in 2006
• Non-profit 501(c)(3)
• ~25 employees
The HDF Group
Conf-DDDD-IN
5
• Mission-driven
– Sustainable development of HDF
technologies
– Guarantee continual accessibility of HDF
data
• Services
– Maintain and develop HDF products
– Consulting and support contracts
– Training
The HDF Group
Conf-DDDD-IN
6
• HDF5 1.0.0 was released in 1998
• What will the next 25 years bring?
https://forum.hdfgroup.org/t/what-do-you-want-to-see-in-hdf5-2-0/10003
25 years of HDF5!
🎂 🎉
Conf-DDDD-IN
7
Software Status
Conf-DDDD-IN
8
• HDF5
• HDF4
• HDFView
Release schedules
Conf-DDDD-IN
9
• Latest: 1.14.1(May 2023)
• 1.8 branch retired early this year
• 1.10 and 1.12 retiring this year
• Have not scheduled 2024 yet
HDF5 Schedule
Conf-DDDD-IN
10
• 1.14.0
– Multi-dataset I/O
– Selection I/O
– Subfiling
– Onion VFD
• 1.14.1
– Maintenance release
– bugfixes, minor features
HDF5 New Features
https://www.hdfgroup.org/2023/05/release-of-hdf5-1-14-1-newsletter-194/
Conf-DDDD-IN
11
• 1.14.2
– Read-only S3 VFD improvements
• Better logging
• Temporary security credential support
– CVE-free
HDF5 New Features
Conf-DDDD-IN
12
• Latest: 4.2.16-2 (June 2023)
– Patch release (fixes a Java issue w/ HDFView)
– Bugfixes
– Build system improvements
HDF4 Schedule
https://www.hdfgroup.org/2023/07/release-of-hdf-4-2-16-2-a-patch-release-newsletter-195/
Conf-DDDD-IN
13
HDF4 Changes
https://github.com/HDFGroup/hdf4/discussions
Conf-DDDD-IN
14
As always, we:
• Strive to maintain API compatibility
– HDF5's compatibility macro scheme, etc.
• Are committed to file format forward and
backward compatibility
– Should always be able to create files in
earlier formats
– Should always be able to read earlier file
formats
HDF Product Change Policy
Conf-DDDD-IN
15
Before implementing a breaking change, we
will:
• Create a discussion post on GitHub
• Create a post in the HDF Forum
• Make an announcement in the newsletter
• Attempt to contact people who we know
might be affected
• Allow at least 30 days for comments
HDF Product Change Policy
https://forum.hdfgroup.org/t/hdf4-change-procedure/11240
Conf-DDDD-IN
16
• Latest: HDFView 3.3.0 (April 2023)
• Based on:
– HDF5 1.14.1
– HDF4 4.2.16 (4.2.16-2 coming soon)
• Releases based on older versions of
HDF5 (e.g., 1.10) have been retired
• Looking to modernize or replace this!
HDFView
https://www.hdfgroup.org/2023/04/release-of-hdfview-3-3-0-newsletter-193/
Conf-DDDD-IN
17
Current Focus
Conf-DDDD-IN
18
• Improve software quality
• Increase transparency
• Strengthen our community
• Modernize HDF Products
What are we working on?
Conf-DDDD-IN
19
HDF Product Development
HDF5
HDF4
Conf-DDDD-IN
20
HDF Product Development
HDFView
HSDS
Conf-DDDD-IN
21
• All CVE issues fixed
– https://github.com/HDFGroup/cve_hdf5
– 3 currently unfixed
• Testing improvements
– Integration testing w/ key products (h5py, etc.)
– More transparency w/ CDash
• Resolve unfixed GitHub issues
• Code cleanup
Software quality
Conf-DDDD-IN
22
HDF5, although open source, grew up in a
walled garden and this mindset persists
I'm working to fix this!
The goal is to do all product-specific (e.g.,
HDF5) work with the community
Transparency
Conf-DDDD-IN
23
• HDF5 Working Group meeting
– Every Thursday at 10:05 central time
– Email me (derobins@hdfgroup.org) for an
invite
– Covers pull requests, issues, tech discussion
• Most project planning moving to GitHub
• More testing via GitHub actions
– Will use public CDash for non-GitHub tests
Transparency
Conf-DDDD-IN
24
• Aforementioned HDF5 WG meeting
• Everything in the transparency slides,
really
• Adding external people as code owners
• Spending more time connecting with our
users, both new and old
Community
Conf-DDDD-IN
25
• Some of our key software has been
around for a long time!
– HDF(4): 35 years (1988)
– HDF5: 25 years (1998)
– HDFView: 16 years (2007)
• Although we strive to keep our software
up to date, the code is still old and could
use an overhaul
Modernization
Conf-DDDD-IN
26
• Better support for new compilers
– Especially Intel's oneAPI
• Better support for AI/ML workflows
• Better cloud integration
• Better support for heterogeneous
computing
• Windows Unicode support
Modernization
Conf-DDDD-IN
27
• Better variable-length support
• Multithreading
• Support for sparse data
• Improve performance
• Internal cleanup
– Easier to modify & debug
– Refactor code that made sense in 1998 but
not so much now
Modernization
Conf-DDDD-IN
28
How can we help each other?
Conf-DDDD-IN
29
Support our non-profit mission
Contact: info@hdfgroup.org
https://www.hdfgroup.org/donate
Conf-DDDD-IN
30
• US HUG
– August 16-18, 2023
– The Ohio State University - Columbus, OH
– https://www.hdfgroup.org/hug/hug23/
• European HUG (focus on
compression)
– September 19-21, 2023
– DESY - Hamburg, Germany
– https://indico.desy.de/event/39343/
HDF User Group Meetings
Conf-DDDD-IN
31
Thanks for your time!
Conf-DDDD-IN
32
This work was supported by NASA/GSFC under
Raytheon Technologies contract number
80GSFC21CA001.

More Related Content

Similar to The State of HDF (20)

PPTX
Hierarchical Data Formats (HDF) Update
PPTX
PPT
HDF Status and Development
PPT
HDF Software Process - Lessons Learned & Success Factors
PPTX
HDF5 and Ecosystem: What Is New?
PPTX
PPTX
HDF Tools Updates and Discussions
PPT
HDF OPeNDAP project update and demo
PPTX
HDF - Current status and Future Directions
PDF
Transition from HDF4 to HDF5
Ad

More from The HDF-EOS Tools and Information Center (20)

PDF
HDF5 2.0: Cloud Optimized from the Start
PDF
Using a Hierarchical Data Format v5 file as Zarr v3 Shard
PDF
Cloud-Optimized HDF5 Files - Current Status
PDF
Cloud Optimized HDF5 for the ICESat-2 mission
PPTX
Access HDF Data in the Cloud via OPeNDAP Web Service
PPTX
Upcoming New HDF5 Features: Multi-threading, sparse data storage, and encrypt...
PDF
Cloud-Optimized HDF5 Files
PDF
Accessing HDF5 data in the cloud with HSDS
PPTX
Highly Scalable Data Service (HSDS) Performance Features
PDF
Creating Cloud-Optimized HDF5 Files
PPTX
HDF5 OPeNDAP Handler Updates, and Performance Discussion
PPTX
Hyrax: Serving Data from S3
PPSX
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
PDF
HDF - Current status and Future Directions
PPSX
HDFEOS.org User Analsys, Updates, and Future
PDF
H5Coro: The Cloud-Optimized Read-Only Library
PPTX
MATLAB Modernization on HDF5 1.10
PPTX
HDF for the Cloud - Serverless HDF
PPTX
HDF for the Cloud - New HDF Server Features
HDF5 2.0: Cloud Optimized from the Start
Using a Hierarchical Data Format v5 file as Zarr v3 Shard
Cloud-Optimized HDF5 Files - Current Status
Cloud Optimized HDF5 for the ICESat-2 mission
Access HDF Data in the Cloud via OPeNDAP Web Service
Upcoming New HDF5 Features: Multi-threading, sparse data storage, and encrypt...
Cloud-Optimized HDF5 Files
Accessing HDF5 data in the cloud with HSDS
Highly Scalable Data Service (HSDS) Performance Features
Creating Cloud-Optimized HDF5 Files
HDF5 OPeNDAP Handler Updates, and Performance Discussion
Hyrax: Serving Data from S3
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
HDF - Current status and Future Directions
HDFEOS.org User Analsys, Updates, and Future
H5Coro: The Cloud-Optimized Read-Only Library
MATLAB Modernization on HDF5 1.10
HDF for the Cloud - Serverless HDF
HDF for the Cloud - New HDF Server Features
Ad

Recently uploaded (20)

PDF
Getting started with AI Agents and Multi-Agent Systems
PDF
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
DP Operators-handbook-extract for the Mautical Institute
PPTX
observCloud-Native Containerability and monitoring.pptx
PDF
A comparative study of natural language inference in Swahili using monolingua...
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
PDF
Architecture types and enterprise applications.pdf
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
PDF
Hindi spoken digit analysis for native and non-native speakers
PDF
NewMind AI Weekly Chronicles – August ’25 Week III
PPTX
1. Introduction to Computer Programming.pptx
PDF
Web App vs Mobile App What Should You Build First.pdf
PPTX
OMC Textile Division Presentation 2021.pptx
PDF
Hybrid model detection and classification of lung cancer
PPTX
cloud_computing_Infrastucture_as_cloud_p
PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
Getting started with AI Agents and Multi-Agent Systems
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
Group 1 Presentation -Planning and Decision Making .pptx
DP Operators-handbook-extract for the Mautical Institute
observCloud-Native Containerability and monitoring.pptx
A comparative study of natural language inference in Swahili using monolingua...
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
Architecture types and enterprise applications.pdf
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
Hindi spoken digit analysis for native and non-native speakers
NewMind AI Weekly Chronicles – August ’25 Week III
1. Introduction to Computer Programming.pptx
Web App vs Mobile App What Should You Build First.pdf
OMC Textile Division Presentation 2021.pptx
Hybrid model detection and classification of lung cancer
cloud_computing_Infrastucture_as_cloud_p
Univ-Connecticut-ChatGPT-Presentaion.pdf
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx

The State of HDF

  • 1. Conf-DDDD-IN The State of HDF Summer ESIP 2023 This work was supported by NASA/GSFC under Raytheon Technologies contract number 80GSFC21CA001. This document does not contain technology or Technical Data controlled under either the U.S. International Traffic in Arms Regulations or the U.S. Export Administration Regulations. Dana Robinson Director of Software Engineering NASA EED-3 / The HDF Group derobins@hdfgroup.org
  • 2. Conf-DDDD-IN 2 • About us • Software status • Current focus • How we can help each other Outline
  • 4. Conf-DDDD-IN 4 • Located in Champaign, IL • Spun off from NCSA in 2006 • Non-profit 501(c)(3) • ~25 employees The HDF Group
  • 5. Conf-DDDD-IN 5 • Mission-driven – Sustainable development of HDF technologies – Guarantee continual accessibility of HDF data • Services – Maintain and develop HDF products – Consulting and support contracts – Training The HDF Group
  • 6. Conf-DDDD-IN 6 • HDF5 1.0.0 was released in 1998 • What will the next 25 years bring? https://forum.hdfgroup.org/t/what-do-you-want-to-see-in-hdf5-2-0/10003 25 years of HDF5! 🎂 🎉
  • 8. Conf-DDDD-IN 8 • HDF5 • HDF4 • HDFView Release schedules
  • 9. Conf-DDDD-IN 9 • Latest: 1.14.1(May 2023) • 1.8 branch retired early this year • 1.10 and 1.12 retiring this year • Have not scheduled 2024 yet HDF5 Schedule
  • 10. Conf-DDDD-IN 10 • 1.14.0 – Multi-dataset I/O – Selection I/O – Subfiling – Onion VFD • 1.14.1 – Maintenance release – bugfixes, minor features HDF5 New Features https://www.hdfgroup.org/2023/05/release-of-hdf5-1-14-1-newsletter-194/
  • 11. Conf-DDDD-IN 11 • 1.14.2 – Read-only S3 VFD improvements • Better logging • Temporary security credential support – CVE-free HDF5 New Features
  • 12. Conf-DDDD-IN 12 • Latest: 4.2.16-2 (June 2023) – Patch release (fixes a Java issue w/ HDFView) – Bugfixes – Build system improvements HDF4 Schedule https://www.hdfgroup.org/2023/07/release-of-hdf-4-2-16-2-a-patch-release-newsletter-195/
  • 14. Conf-DDDD-IN 14 As always, we: • Strive to maintain API compatibility – HDF5's compatibility macro scheme, etc. • Are committed to file format forward and backward compatibility – Should always be able to create files in earlier formats – Should always be able to read earlier file formats HDF Product Change Policy
  • 15. Conf-DDDD-IN 15 Before implementing a breaking change, we will: • Create a discussion post on GitHub • Create a post in the HDF Forum • Make an announcement in the newsletter • Attempt to contact people who we know might be affected • Allow at least 30 days for comments HDF Product Change Policy https://forum.hdfgroup.org/t/hdf4-change-procedure/11240
  • 16. Conf-DDDD-IN 16 • Latest: HDFView 3.3.0 (April 2023) • Based on: – HDF5 1.14.1 – HDF4 4.2.16 (4.2.16-2 coming soon) • Releases based on older versions of HDF5 (e.g., 1.10) have been retired • Looking to modernize or replace this! HDFView https://www.hdfgroup.org/2023/04/release-of-hdfview-3-3-0-newsletter-193/
  • 18. Conf-DDDD-IN 18 • Improve software quality • Increase transparency • Strengthen our community • Modernize HDF Products What are we working on?
  • 21. Conf-DDDD-IN 21 • All CVE issues fixed – https://github.com/HDFGroup/cve_hdf5 – 3 currently unfixed • Testing improvements – Integration testing w/ key products (h5py, etc.) – More transparency w/ CDash • Resolve unfixed GitHub issues • Code cleanup Software quality
  • 22. Conf-DDDD-IN 22 HDF5, although open source, grew up in a walled garden and this mindset persists I'm working to fix this! The goal is to do all product-specific (e.g., HDF5) work with the community Transparency
  • 23. Conf-DDDD-IN 23 • HDF5 Working Group meeting – Every Thursday at 10:05 central time – Email me (derobins@hdfgroup.org) for an invite – Covers pull requests, issues, tech discussion • Most project planning moving to GitHub • More testing via GitHub actions – Will use public CDash for non-GitHub tests Transparency
  • 24. Conf-DDDD-IN 24 • Aforementioned HDF5 WG meeting • Everything in the transparency slides, really • Adding external people as code owners • Spending more time connecting with our users, both new and old Community
  • 25. Conf-DDDD-IN 25 • Some of our key software has been around for a long time! – HDF(4): 35 years (1988) – HDF5: 25 years (1998) – HDFView: 16 years (2007) • Although we strive to keep our software up to date, the code is still old and could use an overhaul Modernization
  • 26. Conf-DDDD-IN 26 • Better support for new compilers – Especially Intel's oneAPI • Better support for AI/ML workflows • Better cloud integration • Better support for heterogeneous computing • Windows Unicode support Modernization
  • 27. Conf-DDDD-IN 27 • Better variable-length support • Multithreading • Support for sparse data • Improve performance • Internal cleanup – Easier to modify & debug – Refactor code that made sense in 1998 but not so much now Modernization
  • 28. Conf-DDDD-IN 28 How can we help each other?
  • 29. Conf-DDDD-IN 29 Support our non-profit mission Contact: info@hdfgroup.org https://www.hdfgroup.org/donate
  • 30. Conf-DDDD-IN 30 • US HUG – August 16-18, 2023 – The Ohio State University - Columbus, OH – https://www.hdfgroup.org/hug/hug23/ • European HUG (focus on compression) – September 19-21, 2023 – DESY - Hamburg, Germany – https://indico.desy.de/event/39343/ HDF User Group Meetings
  • 32. Conf-DDDD-IN 32 This work was supported by NASA/GSFC under Raytheon Technologies contract number 80GSFC21CA001.