SlideShare a Scribd company logo
Integrating HDF5 with SRB
Object-level Access to Remote Files
Peter Cao, NCSA
December 1, 2005

Sponsored by NLADR, NFS PACI Project
in Support of NCSA-SDSC Collaboration

December 1, 2005

HDF & HDF-EOS Workshop IX
Outline
Introduction to the HDF-SRB project
The HDF-SRB model
SRB Support in HDFView

December 1, 2005

HDF & HDF-EOS Workshop IX

2/22
Overview of SRB
What is
SRB
The SDSC Storage Resource Broker (SRB)
is client-server middleware that provides a
uniform interface for connecting to
distributed data storage in multiple types
of storage resources.

December 1, 2005

HDF & HDF-EOS Workshop IX

3/22
Overview of SRB
Architecture
SRB Client

MCAT

SRB Server

DB2

ObjStore

HDF5

HPSS

Unitree

FTP

Distributed Storage Resources: database system, archival storage system, file system, ftp

December 1, 2005

HDF & HDF-EOS Workshop IX

4/22
Project Description
Motivation
SRB
Indexing and searching
Distributed data system
Access control

HDF5
Large and diverse data
High performance access
Interactive and subsetting

High performance distributed data system

December 1, 2005

HDF & HDF-EOS Workshop IX

5/22
Remote Data Access on SRB
Methods
Normal ways to access SRB:
Get the whole file: large files (100TB SCEC)
Use POSIX low level calls: low performance

New way:
Implement proxy operations to access objects
or parts of objects in one request

December 1, 2005

HDF & HDF-EOS Workshop IX

6/22
Project Description
Goals
Working prototype of client/server system for
object-level access to HDF5 stored in the SRB
Use SRB as middleware to transfer data between
the server and client
Use Object-level access for interactive and
efficient access to part of the file

December 1, 2005

HDF & HDF-EOS Workshop IX

7/22
Normal SRB File Access

Architecture

client
HDF5

HDF5 File
(whole file or a
sequence of bytes)

SRB Server

MCAT

December 1, 2005

HDF & HDF-EOS Workshop IX

8/22
Object-level File Access

Architecture

client

server
HDF5 Library
HDF5 Library

HDF5
HDF5 Object
(File, Group, Dataset,
Subset, Attribute)

HDF5 Object
(File, Group, Dataset,
Subset, Attribute)

SRB Server

HDF5-SRB Module
(pack/unpack messages)

HDF5-SRB Module
(pack/unpack messages)
MCAT

December 1, 2005

HDF & HDF-EOS Workshop IX

9/22
Examples of File Access
I need to see the eye of
Hurricane Bob!

HDF5

December 1, 2005

HDF & HDF-EOS Workshop IX

10/22
Examples of File Access
Whole file transfer

Get t
he fil
e

HDF5

December 1, 2005

HDF & HDF-EOS Workshop IX

11/22
Examples of File Access
Whole file
transfer

HDF5

HDF5

HDF5

HDF5

Transfer large image – slow!
December 1, 2005

HDF & HDF-EOS Workshop IX

12/22
Examples of File Access
SRB POSIX
API
I need to see the eye of
Hurricane Bob!

HDF5

December 1, 2005

HDF & HDF-EOS Workshop IX

13/22
Examples of File Access
SRB POSIX
API

Open file

find image file’s open
open image image found
image open

HDF5

Many small messages – slow and complex!
December 1, 2005

HDF & HDF-EOS Workshop IX

14/22
Examples of File Access
Object level

Get m
e the

ey e o

f hurr

icane

Bob

HDF5

December 1, 2005

HDF & HDF-EOS Workshop IX

15/22
Examples of File Access
Object level

Get m
e the

ey e o

f hurr

icane

Bob

HDF5

One request & small image – fast & simple!
December 1, 2005

HDF & HDF-EOS Workshop IX

16/22
HDF5-SRB Model
New objects/APIs
A new set data objects
H5File, H5Group, H5Dataset, H5Datatype, etc
Encapsulated client requests and server results

Enhanced SRB APIs
Pack/Unpack routines (exchange data between
byte stream and structure) to handle complicated
struct – string, pointers, pointers to arrays, arrays
of pointers, etc
New srbGenProxyFunct (general Proxy Function)
handles other types of request besides HDF5

December 1, 2005

HDF & HDF-EOS Workshop IX

17/22
HDF5-SRB Model
Data Flow
Client API

Server API

srbObjRequest(void *obj, int objID)

srbObjProcess(void *obj, int objID)

g()

Ms
g()

kM
s

5O
bj

::o
p

H5
Ob

je
c

t

()

un
p

ack

pac

HDF5 Library

2.

()
sg

6.

kM
ac

()
sg
kM
ac

p
1.

p
un
7.

3.
H

5.

4. Access file

srbGenProxyFunct
HDF5 file

SRB Server

December 1, 2005

HDF & HDF-EOS Workshop IX

18/22
Runing Server/Client
A SRB server that supports HDF5
HDF5 library and other external libraries (SZIP, ZLIB)
A SRB version 3.4 or later from http://www.sdsc.edu/srb/
Follow instruction on how to run SRB server from UG packed
with SRB source release or online at
http://hdf.ncsa.uiuc.edu/hdf-srb-html/HDF-SRB-UG.html

Any client application that implements HDF5-SRB
Objects
No HDF5 library is required on the client
Example client application: HDFView 2.3 or above

December 1, 2005

HDF & HDF-EOS Workshop IX

19/22
Short Demo

HDFView

Support Windows and Linux
May support SGI, AIX if there is new funding

December 1, 2005

HDF & HDF-EOS Workshop IX

20/22
Future Work?
Writing capabilities: the current HDF-SRB implementation only
supports read-only operations. We propose to add write
functionality so that users will be able to create new files and data
objects, and to modify data content and attributes.
Better support for complex datatypes such as compound datatype
and variable length datatype
Support for HDF5 indexing and ingesting HDF5 metadata, which
will enable users to access HDF5 objects directly through MCAT
Support for files across different servers (or SRB federation)
More features for HDFView: compared to its local file-handling
capabilities, the current version of HDFView has very limited
features for remote file access on an SRB server.

December 1, 2005

HDF & HDF-EOS Workshop IX

21/22
Question / Comments?

December 1, 2005

HDF & HDF-EOS Workshop IX

22/22

More Related Content

PPTX
HDF5 OPeNDAP project update and demo
PPT
HDFView and HDF Java Products
PPTX
Hierarchical Data Formats (HDF) Update
PPTX
DalmatinerDB and cockroachDB monitoring plataform
PPTX
RedisGraph 2.0 by Pieter Cailliau - Redis Day Bangalore 2020
PDF
On demand file-caching_-_gustavo_brand
ODP
Kkeithley ufonfs-gluster summit
HDF5 OPeNDAP project update and demo
HDFView and HDF Java Products
Hierarchical Data Formats (HDF) Update
DalmatinerDB and cockroachDB monitoring plataform
RedisGraph 2.0 by Pieter Cailliau - Redis Day Bangalore 2020
On demand file-caching_-_gustavo_brand
Kkeithley ufonfs-gluster summit

What's hot (20)

PPTX
Real time Analytics Using Storm and Kafka
PDF
Unifying Frontend and Backend Development with Scala - ScalaCon 2021
PDF
Expert Roundtable: The Future of Metadata After Hive Metastore
ODP
Join the super_colony_-_feb2013
PPTX
Into the cold - Object Storage in SWITCHengines
ODP
Developing apps and_integrating_with_gluster_fs_-_libgfapi
PDF
Bitsy graph database
PDF
20160401 guster-roadmap
PPTX
HDF Update for DAAC Managers (2017-02-27)
PDF
lakeFS Community Call no. 2
PPT
PDF
CEPH DAY BERLIN - WHAT'S NEW IN CEPH
PPT
The New HDF-EOS WebSite - How it can help you
PDF
Building Distributed Systems With Riak and Riak Core
PDF
How To Use Scala At Work - Airframe In Action at Arm Treasure Data
PDF
InfluxDB 2.0: Dashboarding 101 by David G. Simmons
PDF
InfluxDB 2.0 Client Libraries by Noah Crowley
PDF
Gluster d2
PPTX
A First Look at HPCC Systems 7.0, Innovation in Action
PDF
Cassandra Summit 2015 - Building a multi-tenant API PaaS with DataStax Enterp...
Real time Analytics Using Storm and Kafka
Unifying Frontend and Backend Development with Scala - ScalaCon 2021
Expert Roundtable: The Future of Metadata After Hive Metastore
Join the super_colony_-_feb2013
Into the cold - Object Storage in SWITCHengines
Developing apps and_integrating_with_gluster_fs_-_libgfapi
Bitsy graph database
20160401 guster-roadmap
HDF Update for DAAC Managers (2017-02-27)
lakeFS Community Call no. 2
CEPH DAY BERLIN - WHAT'S NEW IN CEPH
The New HDF-EOS WebSite - How it can help you
Building Distributed Systems With Riak and Riak Core
How To Use Scala At Work - Airframe In Action at Arm Treasure Data
InfluxDB 2.0: Dashboarding 101 by David G. Simmons
InfluxDB 2.0 Client Libraries by Noah Crowley
Gluster d2
A First Look at HPCC Systems 7.0, Innovation in Action
Cassandra Summit 2015 - Building a multi-tenant API PaaS with DataStax Enterp...
Ad

Similar to HDF5 and Storage Resource Broker (20)

PPT
Integrating HDF5 with SRB
PPT
HDF OPeNDAP project update and demo
PPT
Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps
PPTX
Introduction to HDF5 Data and Programming Models
PPT
Hdf5 intro
PPT
HDF Status and Development
PPTX
The State of HDF5 / Dana Robinson / The HDF Group
PDF
Parallel HDF5 Introductory Tutorial
PPT
Support for NPP/NPOESS by The HDF Group
PPTX
HDF5 and Ecosystem: What Is New?
PDF
HDF-EOS Subsetting: HEW and other tools
Integrating HDF5 with SRB
HDF OPeNDAP project update and demo
Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps
Introduction to HDF5 Data and Programming Models
Hdf5 intro
HDF Status and Development
The State of HDF5 / Dana Robinson / The HDF Group
Parallel HDF5 Introductory Tutorial
Support for NPP/NPOESS by The HDF Group
HDF5 and Ecosystem: What Is New?
HDF-EOS Subsetting: HEW and other tools
Ad

More from The HDF-EOS Tools and Information Center (20)

PDF
HDF5 2.0: Cloud Optimized from the Start
PDF
Using a Hierarchical Data Format v5 file as Zarr v3 Shard
PDF
Cloud-Optimized HDF5 Files - Current Status
PDF
Cloud Optimized HDF5 for the ICESat-2 mission
PPTX
Access HDF Data in the Cloud via OPeNDAP Web Service
PPTX
Upcoming New HDF5 Features: Multi-threading, sparse data storage, and encrypt...
PDF
Cloud-Optimized HDF5 Files
PDF
Accessing HDF5 data in the cloud with HSDS
PPTX
Highly Scalable Data Service (HSDS) Performance Features
PDF
Creating Cloud-Optimized HDF5 Files
PPTX
HDF5 OPeNDAP Handler Updates, and Performance Discussion
PPTX
Hyrax: Serving Data from S3
PPSX
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
PDF
HDF - Current status and Future Directions
PPSX
HDFEOS.org User Analsys, Updates, and Future
PPTX
HDF - Current status and Future Directions
PDF
H5Coro: The Cloud-Optimized Read-Only Library
PPTX
MATLAB Modernization on HDF5 1.10
PPTX
HDF for the Cloud - Serverless HDF
HDF5 2.0: Cloud Optimized from the Start
Using a Hierarchical Data Format v5 file as Zarr v3 Shard
Cloud-Optimized HDF5 Files - Current Status
Cloud Optimized HDF5 for the ICESat-2 mission
Access HDF Data in the Cloud via OPeNDAP Web Service
Upcoming New HDF5 Features: Multi-threading, sparse data storage, and encrypt...
Cloud-Optimized HDF5 Files
Accessing HDF5 data in the cloud with HSDS
Highly Scalable Data Service (HSDS) Performance Features
Creating Cloud-Optimized HDF5 Files
HDF5 OPeNDAP Handler Updates, and Performance Discussion
Hyrax: Serving Data from S3
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
HDF - Current status and Future Directions
HDFEOS.org User Analsys, Updates, and Future
HDF - Current status and Future Directions
H5Coro: The Cloud-Optimized Read-Only Library
MATLAB Modernization on HDF5 1.10
HDF for the Cloud - Serverless HDF

Recently uploaded (20)

PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Approach and Philosophy of On baking technology
PPTX
Big Data Technologies - Introduction.pptx
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Electronic commerce courselecture one. Pdf
PDF
Encapsulation theory and applications.pdf
PDF
Modernizing your data center with Dell and AMD
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Diabetes mellitus diagnosis method based random forest with bat algorithm
NewMind AI Weekly Chronicles - August'25 Week I
Digital-Transformation-Roadmap-for-Companies.pptx
Advanced methodologies resolving dimensionality complications for autism neur...
Approach and Philosophy of On baking technology
Big Data Technologies - Introduction.pptx
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Reach Out and Touch Someone: Haptics and Empathic Computing
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Review of recent advances in non-invasive hemoglobin estimation
Network Security Unit 5.pdf for BCA BBA.
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
NewMind AI Monthly Chronicles - July 2025
Unlocking AI with Model Context Protocol (MCP)
Electronic commerce courselecture one. Pdf
Encapsulation theory and applications.pdf
Modernizing your data center with Dell and AMD
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf

HDF5 and Storage Resource Broker

  • 1. Integrating HDF5 with SRB Object-level Access to Remote Files Peter Cao, NCSA December 1, 2005 Sponsored by NLADR, NFS PACI Project in Support of NCSA-SDSC Collaboration December 1, 2005 HDF & HDF-EOS Workshop IX
  • 2. Outline Introduction to the HDF-SRB project The HDF-SRB model SRB Support in HDFView December 1, 2005 HDF & HDF-EOS Workshop IX 2/22
  • 3. Overview of SRB What is SRB The SDSC Storage Resource Broker (SRB) is client-server middleware that provides a uniform interface for connecting to distributed data storage in multiple types of storage resources. December 1, 2005 HDF & HDF-EOS Workshop IX 3/22
  • 4. Overview of SRB Architecture SRB Client MCAT SRB Server DB2 ObjStore HDF5 HPSS Unitree FTP Distributed Storage Resources: database system, archival storage system, file system, ftp December 1, 2005 HDF & HDF-EOS Workshop IX 4/22
  • 5. Project Description Motivation SRB Indexing and searching Distributed data system Access control HDF5 Large and diverse data High performance access Interactive and subsetting High performance distributed data system December 1, 2005 HDF & HDF-EOS Workshop IX 5/22
  • 6. Remote Data Access on SRB Methods Normal ways to access SRB: Get the whole file: large files (100TB SCEC) Use POSIX low level calls: low performance New way: Implement proxy operations to access objects or parts of objects in one request December 1, 2005 HDF & HDF-EOS Workshop IX 6/22
  • 7. Project Description Goals Working prototype of client/server system for object-level access to HDF5 stored in the SRB Use SRB as middleware to transfer data between the server and client Use Object-level access for interactive and efficient access to part of the file December 1, 2005 HDF & HDF-EOS Workshop IX 7/22
  • 8. Normal SRB File Access Architecture client HDF5 HDF5 File (whole file or a sequence of bytes) SRB Server MCAT December 1, 2005 HDF & HDF-EOS Workshop IX 8/22
  • 9. Object-level File Access Architecture client server HDF5 Library HDF5 Library HDF5 HDF5 Object (File, Group, Dataset, Subset, Attribute) HDF5 Object (File, Group, Dataset, Subset, Attribute) SRB Server HDF5-SRB Module (pack/unpack messages) HDF5-SRB Module (pack/unpack messages) MCAT December 1, 2005 HDF & HDF-EOS Workshop IX 9/22
  • 10. Examples of File Access I need to see the eye of Hurricane Bob! HDF5 December 1, 2005 HDF & HDF-EOS Workshop IX 10/22
  • 11. Examples of File Access Whole file transfer Get t he fil e HDF5 December 1, 2005 HDF & HDF-EOS Workshop IX 11/22
  • 12. Examples of File Access Whole file transfer HDF5 HDF5 HDF5 HDF5 Transfer large image – slow! December 1, 2005 HDF & HDF-EOS Workshop IX 12/22
  • 13. Examples of File Access SRB POSIX API I need to see the eye of Hurricane Bob! HDF5 December 1, 2005 HDF & HDF-EOS Workshop IX 13/22
  • 14. Examples of File Access SRB POSIX API Open file find image file’s open open image image found image open HDF5 Many small messages – slow and complex! December 1, 2005 HDF & HDF-EOS Workshop IX 14/22
  • 15. Examples of File Access Object level Get m e the ey e o f hurr icane Bob HDF5 December 1, 2005 HDF & HDF-EOS Workshop IX 15/22
  • 16. Examples of File Access Object level Get m e the ey e o f hurr icane Bob HDF5 One request & small image – fast & simple! December 1, 2005 HDF & HDF-EOS Workshop IX 16/22
  • 17. HDF5-SRB Model New objects/APIs A new set data objects H5File, H5Group, H5Dataset, H5Datatype, etc Encapsulated client requests and server results Enhanced SRB APIs Pack/Unpack routines (exchange data between byte stream and structure) to handle complicated struct – string, pointers, pointers to arrays, arrays of pointers, etc New srbGenProxyFunct (general Proxy Function) handles other types of request besides HDF5 December 1, 2005 HDF & HDF-EOS Workshop IX 17/22
  • 18. HDF5-SRB Model Data Flow Client API Server API srbObjRequest(void *obj, int objID) srbObjProcess(void *obj, int objID) g() Ms g() kM s 5O bj ::o p H5 Ob je c t () un p ack pac HDF5 Library 2. () sg 6. kM ac () sg kM ac p 1. p un 7. 3. H 5. 4. Access file srbGenProxyFunct HDF5 file SRB Server December 1, 2005 HDF & HDF-EOS Workshop IX 18/22
  • 19. Runing Server/Client A SRB server that supports HDF5 HDF5 library and other external libraries (SZIP, ZLIB) A SRB version 3.4 or later from http://www.sdsc.edu/srb/ Follow instruction on how to run SRB server from UG packed with SRB source release or online at http://hdf.ncsa.uiuc.edu/hdf-srb-html/HDF-SRB-UG.html Any client application that implements HDF5-SRB Objects No HDF5 library is required on the client Example client application: HDFView 2.3 or above December 1, 2005 HDF & HDF-EOS Workshop IX 19/22
  • 20. Short Demo HDFView Support Windows and Linux May support SGI, AIX if there is new funding December 1, 2005 HDF & HDF-EOS Workshop IX 20/22
  • 21. Future Work? Writing capabilities: the current HDF-SRB implementation only supports read-only operations. We propose to add write functionality so that users will be able to create new files and data objects, and to modify data content and attributes. Better support for complex datatypes such as compound datatype and variable length datatype Support for HDF5 indexing and ingesting HDF5 metadata, which will enable users to access HDF5 objects directly through MCAT Support for files across different servers (or SRB federation) More features for HDFView: compared to its local file-handling capabilities, the current version of HDFView has very limited features for remote file access on an SRB server. December 1, 2005 HDF & HDF-EOS Workshop IX 21/22
  • 22. Question / Comments? December 1, 2005 HDF & HDF-EOS Workshop IX 22/22