SlideShare a Scribd company logo
Conf-DDDD-IN
Leveraging the Cloud for HDF1 Software Testing
2019 ESIP Summer Meeting
This work was supported by NASA/GSFC under Raytheon Co. contract number NNG15HZ39C.
This document does not contain technology or Technical Data controlled under either the U.S. International Traffic
in Arms Regulations or the U.S. Export Administration Regulations.
Larry Knox
EED-2 Software Test Engineer
lrknox@hdfgroup.org
Conf-DDDD-IN
2
• IBM ppc64 and SunOS 5.11 sparc servers that
allow us to test HDF software on big-endian
machines. Big-endian format is used in storing
NPP2 HDF5 files, and in HDF4 and NetCDF-33
file formats.
• Several severs running a variety of Windows
and CentOS Linux VM4s.
• A collection of assorted re-purposed laptops
and desktops running MacOS, Windows or
CentOS Linux.
Current in-house HDF test platforms
Conf-DDDD-IN
3
• Test system consists of:
– Shell scripts on Unix (mostly Linux) systems
– BuildBot master and workers running on
Windows and Linux VMs
• We test 3 HDF5 versions, HDF4 and a dozen
applications that use these libraries, with more
than 750 test configurations.
• Each HDF5 version has about 200 test
configurations; running all configurations to test
a code revision may take 16 hours over 2 days.
Current in-house HDF Daily Testing
Conf-DDDD-IN
4
• Cloud resources are used in conjunction with in-
house testing platforms to provide more test
coverage and provide more testing time.
• Users report problems with systems and distros
we do not have installed in-house.
• Cost of testing in the cloud may be more
economical than procuring additional test
servers and maintaining them on premises.
Why leverage the Cloud for HDF
software testing?
Conf-DDDD-IN
5
• Testing for issues on multiple distros
• Immediate availability
• No Overhead – pay only when in use
• No system maintenance
• Flexibility
• Consistent environment (for performance
testing)
More reasons to test in the Cloud
Conf-DDDD-IN
6
• Amazon Web Services (AWS) Spot
instances are being used for testing HDF5
on these Linux Distributions: Amazon Linux,
Centos, Debian, Fedora, SUSE and Ubuntu
• BuildBot launches spot instances
• AWS spot instances to run HDF5 build
and regression tests in ~30 minutes are
available at ~$.02 per hour
Linux Distributions Tested on AWS5
Conf-DDDD-IN
7
• Spot instances
– 75% – 85% discount from On-Demand price
in us-east-2 (Ohio).
– Charges for Linux spot instances are based
on usage (number of seconds).
– Prefer instance type suitable for testing that
has price history both low and stable. Spot
instances are subject to termination; stability
will minimize chances of termination.
AWS Cost Management
Conf-DDDD-IN
8
• BuildBot test results:
https://cdash-internal.hdfgroup.org/
• External test results:
https://cdash.hdfgroup.org/
• We intend to move all HDF daily tests to
BuildBot and Cdash.
Test results available on CDash
Conf-DDDD-IN
9
• Purpose: Track and quantify performance
improvements due to code change
• Limit effects often seen on busy systems
• Flexible cluster size
• H5cluster tool for HDF5 installs OrangeFS on
NVMe6 or SSD7 and launches cluster with spot
instances
Customized for HDF5 by Steven Varga,
http://vargaconsulting.ca/
Parallel Performance Testing
Conf-DDDD-IN
10
1. HDF - Hierarchical Data Format
2. NPP - National Polar-orbiting Partnership
3. netCDF-3 – Network Common Data
Format version 3
4. VM - Virtual Machine
5. AWS – Amazon Web Services
6. NVMe – non-volatile memory express
7. SSD – solid-state drive
Acronyms
Conf-DDDD-IN
11
This work was supported by NASA/GSFC under
Raytheon Co. contract number NNG15HZ39C.
in partnership with

More Related Content

PPTX
PPTX
Google Colaboratory for HDF-EOS
PPTX
Parallel Computing with HDF Server
PPSX
HDFEOS.org User Analsys, Updates, and Future
PPTX
MATLAB Modernization on HDF5 1.10
PPTX
HDF5 and Ecosystem: What Is New?
PDF
H5Coro: The Cloud-Optimized Read-Only Library
Google Colaboratory for HDF-EOS
Parallel Computing with HDF Server
HDFEOS.org User Analsys, Updates, and Future
MATLAB Modernization on HDF5 1.10
HDF5 and Ecosystem: What Is New?
H5Coro: The Cloud-Optimized Read-Only Library

What's hot (20)

PPTX
Hierarchical Data Formats (HDF) Update
PPT
Caching and Buffering in HDF5
PPTX
HDF Kita Lab: JupyterLab + HDF Service
PPTX
Product Designer Hub - Taking HPD to the Web
PPTX
HDF Product Designer: Using Templates to Achieve Interoperability
PPT
Status of HDF-EOS, Related Software and Tools
PPTX
Open-source Scientific Computing and Data Analytics using HDF
PPT
Status of HDF-EOS, Related Software and Tools
PDF
HDFCloud Workshop: HDF5 in the Cloud
PPSX
Data Are from Mars, Tools Are from Venus
PPT
Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps
PPT
Support for NPP/NPOESS by The HDF Group
PPTX
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
PPSX
GDAL Enhancement for ESDIS Project
PPTX
HDF4 Mapping Project Update
PPTX
HDF Project Status and Plans
PPT
Access HDF5 Datasets via OPeNDAP's Data Access Protocol (DAP)
PPTX
Tools to improve the usability of NASA HDF Data
PPTX
Hdf5 parallel
Hierarchical Data Formats (HDF) Update
Caching and Buffering in HDF5
HDF Kita Lab: JupyterLab + HDF Service
Product Designer Hub - Taking HPD to the Web
HDF Product Designer: Using Templates to Achieve Interoperability
Status of HDF-EOS, Related Software and Tools
Open-source Scientific Computing and Data Analytics using HDF
Status of HDF-EOS, Related Software and Tools
HDFCloud Workshop: HDF5 in the Cloud
Data Are from Mars, Tools Are from Venus
Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps
Support for NPP/NPOESS by The HDF Group
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
GDAL Enhancement for ESDIS Project
HDF4 Mapping Project Update
HDF Project Status and Plans
Access HDF5 Datasets via OPeNDAP's Data Access Protocol (DAP)
Tools to improve the usability of NASA HDF Data
Hdf5 parallel
Ad

Similar to Leveraging the Cloud for HDF Software Testing (20)

PPT
HDF Software Process - Lessons Learned & Success Factors
PDF
Testing Applications—For the Cloud and in the Cloud
PPTX
Gerrit + Jenkins = Continuous Delivery For Big Data
PPTX
PPTX
PPSX
Adding new servicees for HDF in THREDDS Data Server (TDS)
PDF
Modernizing Testing as Apps Re-Architect
PPTX
HDF for the Cloud - New HDF Server Features
PPTX
HDF - Current status and Future Directions
PPTX
HDF for the Cloud - Serverless HDF
HDF Software Process - Lessons Learned & Success Factors
Testing Applications—For the Cloud and in the Cloud
Gerrit + Jenkins = Continuous Delivery For Big Data
Adding new servicees for HDF in THREDDS Data Server (TDS)
Modernizing Testing as Apps Re-Architect
HDF for the Cloud - New HDF Server Features
HDF - Current status and Future Directions
HDF for the Cloud - Serverless HDF
Ad

More from The HDF-EOS Tools and Information Center (17)

PDF
HDF5 2.0: Cloud Optimized from the Start
PDF
Using a Hierarchical Data Format v5 file as Zarr v3 Shard
PDF
Cloud-Optimized HDF5 Files - Current Status
PDF
Cloud Optimized HDF5 for the ICESat-2 mission
PPTX
Access HDF Data in the Cloud via OPeNDAP Web Service
PPTX
Upcoming New HDF5 Features: Multi-threading, sparse data storage, and encrypt...
PPTX
The State of HDF5 / Dana Robinson / The HDF Group
PDF
Cloud-Optimized HDF5 Files
PDF
Accessing HDF5 data in the cloud with HSDS
PPTX
Highly Scalable Data Service (HSDS) Performance Features
PDF
Creating Cloud-Optimized HDF5 Files
PPTX
HDF5 OPeNDAP Handler Updates, and Performance Discussion
PPTX
Hyrax: Serving Data from S3
PPSX
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
PDF
HDF - Current status and Future Directions
PPSX
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
PPTX
HDF-EOS Data Product Developer's Guide
HDF5 2.0: Cloud Optimized from the Start
Using a Hierarchical Data Format v5 file as Zarr v3 Shard
Cloud-Optimized HDF5 Files - Current Status
Cloud Optimized HDF5 for the ICESat-2 mission
Access HDF Data in the Cloud via OPeNDAP Web Service
Upcoming New HDF5 Features: Multi-threading, sparse data storage, and encrypt...
The State of HDF5 / Dana Robinson / The HDF Group
Cloud-Optimized HDF5 Files
Accessing HDF5 data in the cloud with HSDS
Highly Scalable Data Service (HSDS) Performance Features
Creating Cloud-Optimized HDF5 Files
HDF5 OPeNDAP Handler Updates, and Performance Discussion
Hyrax: Serving Data from S3
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
HDF - Current status and Future Directions
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
HDF-EOS Data Product Developer's Guide

Recently uploaded (20)

PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Spectral efficient network and resource selection model in 5G networks
DOCX
The AUB Centre for AI in Media Proposal.docx
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
cuic standard and advanced reporting.pdf
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
sap open course for s4hana steps from ECC to s4
PPT
Teaching material agriculture food technology
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Empathic Computing: Creating Shared Understanding
PDF
KodekX | Application Modernization Development
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Network Security Unit 5.pdf for BCA BBA.
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Spectral efficient network and resource selection model in 5G networks
The AUB Centre for AI in Media Proposal.docx
“AI and Expert System Decision Support & Business Intelligence Systems”
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Unlocking AI with Model Context Protocol (MCP)
cuic standard and advanced reporting.pdf
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
MIND Revenue Release Quarter 2 2025 Press Release
Per capita expenditure prediction using model stacking based on satellite ima...
Encapsulation_ Review paper, used for researhc scholars
sap open course for s4hana steps from ECC to s4
Teaching material agriculture food technology
Diabetes mellitus diagnosis method based random forest with bat algorithm
Understanding_Digital_Forensics_Presentation.pptx
Empathic Computing: Creating Shared Understanding
KodekX | Application Modernization Development
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Network Security Unit 5.pdf for BCA BBA.

Leveraging the Cloud for HDF Software Testing

  • 1. Conf-DDDD-IN Leveraging the Cloud for HDF1 Software Testing 2019 ESIP Summer Meeting This work was supported by NASA/GSFC under Raytheon Co. contract number NNG15HZ39C. This document does not contain technology or Technical Data controlled under either the U.S. International Traffic in Arms Regulations or the U.S. Export Administration Regulations. Larry Knox EED-2 Software Test Engineer lrknox@hdfgroup.org
  • 2. Conf-DDDD-IN 2 • IBM ppc64 and SunOS 5.11 sparc servers that allow us to test HDF software on big-endian machines. Big-endian format is used in storing NPP2 HDF5 files, and in HDF4 and NetCDF-33 file formats. • Several severs running a variety of Windows and CentOS Linux VM4s. • A collection of assorted re-purposed laptops and desktops running MacOS, Windows or CentOS Linux. Current in-house HDF test platforms
  • 3. Conf-DDDD-IN 3 • Test system consists of: – Shell scripts on Unix (mostly Linux) systems – BuildBot master and workers running on Windows and Linux VMs • We test 3 HDF5 versions, HDF4 and a dozen applications that use these libraries, with more than 750 test configurations. • Each HDF5 version has about 200 test configurations; running all configurations to test a code revision may take 16 hours over 2 days. Current in-house HDF Daily Testing
  • 4. Conf-DDDD-IN 4 • Cloud resources are used in conjunction with in- house testing platforms to provide more test coverage and provide more testing time. • Users report problems with systems and distros we do not have installed in-house. • Cost of testing in the cloud may be more economical than procuring additional test servers and maintaining them on premises. Why leverage the Cloud for HDF software testing?
  • 5. Conf-DDDD-IN 5 • Testing for issues on multiple distros • Immediate availability • No Overhead – pay only when in use • No system maintenance • Flexibility • Consistent environment (for performance testing) More reasons to test in the Cloud
  • 6. Conf-DDDD-IN 6 • Amazon Web Services (AWS) Spot instances are being used for testing HDF5 on these Linux Distributions: Amazon Linux, Centos, Debian, Fedora, SUSE and Ubuntu • BuildBot launches spot instances • AWS spot instances to run HDF5 build and regression tests in ~30 minutes are available at ~$.02 per hour Linux Distributions Tested on AWS5
  • 7. Conf-DDDD-IN 7 • Spot instances – 75% – 85% discount from On-Demand price in us-east-2 (Ohio). – Charges for Linux spot instances are based on usage (number of seconds). – Prefer instance type suitable for testing that has price history both low and stable. Spot instances are subject to termination; stability will minimize chances of termination. AWS Cost Management
  • 8. Conf-DDDD-IN 8 • BuildBot test results: https://cdash-internal.hdfgroup.org/ • External test results: https://cdash.hdfgroup.org/ • We intend to move all HDF daily tests to BuildBot and Cdash. Test results available on CDash
  • 9. Conf-DDDD-IN 9 • Purpose: Track and quantify performance improvements due to code change • Limit effects often seen on busy systems • Flexible cluster size • H5cluster tool for HDF5 installs OrangeFS on NVMe6 or SSD7 and launches cluster with spot instances Customized for HDF5 by Steven Varga, http://vargaconsulting.ca/ Parallel Performance Testing
  • 10. Conf-DDDD-IN 10 1. HDF - Hierarchical Data Format 2. NPP - National Polar-orbiting Partnership 3. netCDF-3 – Network Common Data Format version 3 4. VM - Virtual Machine 5. AWS – Amazon Web Services 6. NVMe – non-volatile memory express 7. SSD – solid-state drive Acronyms
  • 11. Conf-DDDD-IN 11 This work was supported by NASA/GSFC under Raytheon Co. contract number NNG15HZ39C. in partnership with