This document discusses strategies for storing and accessing HDF5 data files in cloud object storage like Amazon S3. It describes an HDF5 Virtual File Driver (VFD) developed by The HDF Group that allows reading HDF5 files directly from S3 without downloading. For better performance, the document recommends optimizing HDF5 files stored in S3 by using chunking, compression, and aggregating smaller files. It also introduces the HDF Cloud Schema which maps HDF5 objects to individual object storage objects for parallel access.
Related topics: