SlideShare a Scribd company logo
Proactive Data Containers (PDC): An Object-centric Data
Store for Large-scale Computing Systems
Suren Byna
Lawrence Berkeley National Lab (LBNL), Berkeley
Co-authors
Quincey Koziol (LBNL), Venkat Vishwanath (ANL), Jerome Soumagne (THG), Houjun Tang (LBNL),
Kimmy Mu (THG), Bin Dong (LBNL), Richard Warren (THG), François Tessier (ANL, now @ CSCS),
Teng Wang (LBNL), and Jialin Liu (LBNL)
▪  Extreme parallelism

▪  Massive Data

▪  Hierarchical storage

Scalable data management – Three disrupting trends
2
3
Extreme parallelism
Summit, ORNL Sierra, LLNL Sunway Taihulight
NSC Wuxi, China
Trinity, LANL Cori, LBNL
Summit
- ~2.4M cores
- ~143 PFlops
- 9.7 MW
Sierra
- ~1.5M cores
- ~94 PFlops
- 7.4 MW
Taihulight
- ~10.6M cores
- ~93 PFlops
- 15 MW
§ Simulations
–  Multi-physics (FLASH) – 10 PB
–  Cosmology (NyX) – 10 PB
–  Plasma physics (VPIC) – 1 PB
§ Experimental and observational
data (EOD)
–  LHC (100 PB),
–  LSST (60 PB),
–  Genomics (100 TB to 1 PB)
Massive scientific data
FLASH
NyX
VPIC
4
LHC
LSST
Genomics
Hierarchical and heterogeneous storage
5
IO Gap
Memory
Parallel file system
(Lustre, GPFS)
Archival Storage (HPSS
tape)
IO Gap
Shared burst buffer
Memory
Parallel file system
(Lustre, GPFS)
Archival Storage (HPSS
tape)
Memory
Parallel file system
Archival storage (HPSS
tape)
Shared burst buffer
Node-local storage
Conventional
Current
Eg. Cori @ NERSC Upcoming
Campaign storage
Reading and writing data on scalable systems
6
▪  Types of parallel I/O
•  1 writer/reader, 1 file
•  N writers/readers, N files (File-per-process)
•  N writers/readers, 1 file
•  M writers/readers, 1 file
–  Aggregators
–  Two-phase I/O
•  M aggregators, M files (file-per-aggregator)
–  Variations of this mode
P0 P1 Pn-1 Pn
…
file.0
1 Writer/Reader, 1 File
P0 P1 Pn-1 Pn
…
file.0
n Writers/Readers, n Files
file.1 file.n-1 file.n
P0 P1 Pn-1 Pn
…
n Writers/Readers, 1 File
File.1
P0 P1 Pn-1 Pn
…
file.0
M Writers/Readers, M Files
file.m
P0 P1 Pn-1 Pn
…
M Writers/Readers, 1 File
File.1
Scalable Storage Systems: Challenges
7
Memory
Disk-based storage
Archival storage (HPSS
tape)
Shared burst buffer
Hardware
Node-local storage
Campaign storage
Software
High-level I/O lib
(netCDF, HDF5, etc.)
IO middleware
(POSIX, MPI-IO)
IO forwarding
Parallel file
systems
Applications
Usage
… Data (in memory)
IO software
… Files in file system
•  Challenges
–  POSIX-IO semantics hinder scalability and performance of file systems and IO software
–  Multi-level hierarchy complicates data movement, especially if user has to be involved
Tune middleware
Tune file systems
Scalable data management requirements
Use case Domain Sim/EOD/
analysis
Data size I/O Requirements
FLASH High-energy density
physics
Simulation ~1PB Data transformations, scalable I/O
interfaces, correlation among simulation
and experimental data
CMB / Planck Cosmology Simulation, EOD/
Analysis
10PB Automatic data movement optimizations
DECam & LSST Cosmology EOD/Analysis ~10TB Easy interfaces, data transformations
E3SM Climate Simulation ~10PB Async I/O, derived variables, automatic
data movement
TECA Climate Analysis ~10PB Data organization and efficient data
movement
HipMer Genomics EOD/Analysis ~100TB Scalable I/O interfaces, efficient and
automatic data movement
8
Easy interfaces and superior performance
Transparent data management
Information capture and management
8
Next Gen Storage – Proactive Data Containers (PDC)
Memory
Disk-based storage
Archival storage (HPSS
tape)
Shared burst buffer
Hardware
Node-local storage
Campaign storage
Software
High-level API Applications
Usage
… Data (in memory)
9
▪  Object-centric data access interface
§  Simple put, get interface
§  Array-based variable access
▪  Transparent data management
§  Data placement in storage hierarchy
§  Automatic data movement
▪  Information capture and
management
§  Rich metadata
§  Connection of results and raw data with
relationships
Persistent Storage API
BB FS Lustre DAOS
…
PDC System – High-level Architecture
10
▪ Object-level interface
–  Create – containers and objects
–  Add attributes
–  Put object
–  Get object
–  Delete object
▪  Array-specific interface
–  Create regions
–  Map regions in PDC objects
–  Lock
–  Release
11
Object-centric PDC Interface
J. Mu, J. Soumagne, et al., “A Transparent Server-managed Object Storage
System for HPC”, IEEE Cluster 2018
Proactive Data Container
Container
Dataset
KV-Store
Group
<root>
A B C
D E F
PDC Locus
Dataset
KV-Store
Group
Container
Collection
Locus
Container: X
<root>
A B C
D E F
Container: W
<root>
A B C
D E F
Container: Z
<root>
A B C
D E F
Collection: P
Collection: Q
PDC Collection
Container: X
<root>
A B C
D E F
Container: W
<root>
A B C
D E F
Container: Z
<root>
A B C
D E F
Container: Y
<root>
A B C
D E F
Proactive Data Container
Container
<root>
A B C
D E F
Key
▪ Object-level interface
–  Create – containers and objects
–  Add attributes
–  Put object
–  Get object
–  Delete object
▪  Array-specific interface
–  Create regions
–  Map regions in PDC objects
–  Lock
–  Release
12
Object-centric PDC Interface
J. Mu, J. Soumagne, et al., “A Transparent Server-managed Object Storage
System for HPC”, IEEE Cluster 2018
▪ Usage of compute resources for I/O
–  Shared mode – Compute nodes are shared
between applications and I/O services
–  Dedicated mode – I/O services on separate
nodes
▪  Transparent data movement by PDC
servers
–  Apps map data buffers to objects and PDC
servers place and manage data
–  Apps query for data objects using attributes
▪  Superior I/O performance
13
Transparent data movement in storage hierarchy
H. Tang, S. Byna, et al., “Toward Scalable and Asynchronous Object-centric Data Management for HPC”,
IEEE/ACM CCGrid 2018
0
350
700
1050
124 248 496 992 1984 3968 7936 15872
Time	in	seconds
Number	of	processes
HDF5	read		(Lustre) PLFS	read		(Lustre)
PDC	read		(Lustre) HDF5	read		(BB)
PDC	read		(BB)
0
250
500
750
124 248 496 992 1984 3968 7936 15872
Time	in	seconds
Number	of	processes
HDF5	write		(Lustre) PLFS	write		(Lustre)
PDC	write		(Lustre) HDF5	write		(BB)
PDC	write		(BB)
▪ Flat name space
▪ Rich metadata
–  Pre-defined tags that includes
provenance
–  User-defined tags for capturing
relationships between data objects
▪  Distributed in memory metadata
management
–  Distributed hash table and bloom
filters used for faster access
14
Metadata management
H. Tang, S. Byna, et al., “SoMeta: Scalable Object-centric Metadata Management for High Performance
Computing”, to be presented at IEEE Cluster 2017
▪ Take home message
–  Scalable storage systems impacted by:
•  Extreme level of parallelism
•  Massive amounts of scientific data
•  Transforming storage architectures
–  Proactive data containers
•  Object-centric interfaces
•  Transparent data movement in storage hierarchies
•  Scalable management of extensive metadata
15
Conclusions
16
Thanks
https://sdm.lbl.gov/pdc
Contact: Suren Byna (SByna@lbl.gov)

More Related Content

PDF
07 data structures_and_representations
PDF
04 open source_tools
PPT
Data Grid Taxonomies
PDF
Resilient Distributed Datasets
PPTX
Pilot Project for HDF5 Metadata Structures for SWOT
PPTX
ICESat-2 Metadata and Status
PDF
Databases and how to choose them
PPT
Many Task Applications for Grids and Supercomputers
07 data structures_and_representations
04 open source_tools
Data Grid Taxonomies
Resilient Distributed Datasets
Pilot Project for HDF5 Metadata Structures for SWOT
ICESat-2 Metadata and Status
Databases and how to choose them
Many Task Applications for Grids and Supercomputers

What's hot (20)

PPTX
SPD and KEA: HDF5 based file formats for Earth Observation
PPTX
Transient and persistent RDF views over relational databases in the context o...
PPTX
Next Generation Grid: Integrating Parallel and Distributed Computing Runtimes...
PDF
E-ARK-iPRES2016-Bern-October-2016
PPTX
Geo data analytics
PDF
C0312023
PDF
Time series database by Harshil Ambagade
PDF
Big data distributed processing: Spark introduction
PDF
HPC-ABDS: The Case for an Integrating Apache Big Data Stack with HPC
ODP
Google's Dremel
PPTX
Working with Scientific Data in MATLAB
DOCX
OPTIMIZING END-TO-END BIG DATA TRANSFERS OVER TERABITS NETWORK INFRASTRUCTURE
PPTX
GraphQL & DGraph with Go
PDF
Dgraph: Graph database for production environment
PPTX
A 3 dimensional data model in hbase for large time-series dataset-20120915
PPTX
Materials Data Facility: Streamlined and automated data sharing, discovery, ...
PPTX
Introduction to DGraph - A Graph Database
PPTX
Classification of Big Data Use Cases by different Facets
PDF
Partitioning SKA Dataflows for Optimal Graph Execution
PPT
BDAS RDD study report v1.2
SPD and KEA: HDF5 based file formats for Earth Observation
Transient and persistent RDF views over relational databases in the context o...
Next Generation Grid: Integrating Parallel and Distributed Computing Runtimes...
E-ARK-iPRES2016-Bern-October-2016
Geo data analytics
C0312023
Time series database by Harshil Ambagade
Big data distributed processing: Spark introduction
HPC-ABDS: The Case for an Integrating Apache Big Data Stack with HPC
Google's Dremel
Working with Scientific Data in MATLAB
OPTIMIZING END-TO-END BIG DATA TRANSFERS OVER TERABITS NETWORK INFRASTRUCTURE
GraphQL & DGraph with Go
Dgraph: Graph database for production environment
A 3 dimensional data model in hbase for large time-series dataset-20120915
Materials Data Facility: Streamlined and automated data sharing, discovery, ...
Introduction to DGraph - A Graph Database
Classification of Big Data Use Cases by different Facets
Partitioning SKA Dataflows for Optimal Graph Execution
BDAS RDD study report v1.2
Ad

Similar to Proactive Data Containers (PDC): An Object-centric Data Store for Large-scale Computing Systems (20)

PPTX
Discovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences
PDF
Don't Be Scared. Data Don't Bite. Introduction to Big Data.
PPTX
Accelerating Data-driven Discovery in Energy Science
PPTX
Enabling efficient movement of data into & out of a high-performance analysis...
PDF
Network-aware Data Management for High Throughput Flows Akamai, Cambridge, ...
PDF
Deep Learning on Apache Spark at CERN’s Large Hadron Collider with Intel Tech...
PPTX
Big Process for Big Data @ NASA
PPTX
Taming Big Data!
PPTX
Big data at experimental facilities
PPTX
The Discovery Cloud: Accelerating Science via Outsourcing and Automation
PDF
The Interplay of Workflow Execution and Resource Provisioning
PPTX
Matching Data Intensive Applications and Hardware/Software Architectures
PPTX
Matching Data Intensive Applications and Hardware/Software Architectures
PDF
Kafka & Hadoop in Rakuten
PDF
Hopsworks in the cloud Berlin Buzzwords 2019
ODP
Next-generation sequencing: Data mangement
PDF
AI Super computer update
PDF
Scientific Application Development and Early results on Summit
PDF
ER 2016 Tutorial
PDF
Vargas polyglot-persistence-cloud-edbt
Discovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences
Don't Be Scared. Data Don't Bite. Introduction to Big Data.
Accelerating Data-driven Discovery in Energy Science
Enabling efficient movement of data into & out of a high-performance analysis...
Network-aware Data Management for High Throughput Flows Akamai, Cambridge, ...
Deep Learning on Apache Spark at CERN’s Large Hadron Collider with Intel Tech...
Big Process for Big Data @ NASA
Taming Big Data!
Big data at experimental facilities
The Discovery Cloud: Accelerating Science via Outsourcing and Automation
The Interplay of Workflow Execution and Resource Provisioning
Matching Data Intensive Applications and Hardware/Software Architectures
Matching Data Intensive Applications and Hardware/Software Architectures
Kafka & Hadoop in Rakuten
Hopsworks in the cloud Berlin Buzzwords 2019
Next-generation sequencing: Data mangement
AI Super computer update
Scientific Application Development and Early results on Summit
ER 2016 Tutorial
Vargas polyglot-persistence-cloud-edbt
Ad

More from Globus (20)

PDF
Globus Compute wth IRI Workflows - GlobusWorld 2024
PDF
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
PDF
Globus Compute Introduction - GlobusWorld 2024
PDF
Globus Connect Server Deep Dive - GlobusWorld 2024
PDF
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
PDF
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
PDF
First Steps with Globus Compute Multi-User Endpoints
PDF
Enhancing Research Orchestration Capabilities at ORNL.pdf
PDF
Understanding Globus Data Transfers with NetSage
PDF
How to Position Your Globus Data Portal for Success Ten Good Practices
PDF
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
PDF
Developing Distributed High-performance Computing Capabilities of an Open Sci...
PDF
The Department of Energy's Integrated Research Infrastructure (IRI)
PDF
GlobusWorld 2024 Opening Keynote session
PDF
Enhancing Performance with Globus and the Science DMZ
PDF
Extending Globus into a Site-wide Automated Data Infrastructure.pdf
PDF
Globus at the United States Geological Survey
PDF
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
PDF
Globus Compute with Integrated Research Infrastructure (IRI) workflows
PDF
Reactive Documents and Computational Pipelines - Bridging the Gap
Globus Compute wth IRI Workflows - GlobusWorld 2024
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Globus Compute Introduction - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
First Steps with Globus Compute Multi-User Endpoints
Enhancing Research Orchestration Capabilities at ORNL.pdf
Understanding Globus Data Transfers with NetSage
How to Position Your Globus Data Portal for Success Ten Good Practices
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Developing Distributed High-performance Computing Capabilities of an Open Sci...
The Department of Energy's Integrated Research Infrastructure (IRI)
GlobusWorld 2024 Opening Keynote session
Enhancing Performance with Globus and the Science DMZ
Extending Globus into a Site-wide Automated Data Infrastructure.pdf
Globus at the United States Geological Survey
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
Globus Compute with Integrated Research Infrastructure (IRI) workflows
Reactive Documents and Computational Pipelines - Bridging the Gap

Recently uploaded (20)

PPTX
Cloud computing and distributed systems.
PPTX
Machine Learning_overview_presentation.pptx
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Encapsulation theory and applications.pdf
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PPTX
Big Data Technologies - Introduction.pptx
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
A comparative analysis of optical character recognition models for extracting...
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
Machine learning based COVID-19 study performance prediction
Cloud computing and distributed systems.
Machine Learning_overview_presentation.pptx
Diabetes mellitus diagnosis method based random forest with bat algorithm
Encapsulation theory and applications.pdf
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Big Data Technologies - Introduction.pptx
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Building Integrated photovoltaic BIPV_UPV.pdf
20250228 LYD VKU AI Blended-Learning.pptx
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
A comparative analysis of optical character recognition models for extracting...
The AUB Centre for AI in Media Proposal.docx
MIND Revenue Release Quarter 2 2025 Press Release
Chapter 3 Spatial Domain Image Processing.pdf
NewMind AI Weekly Chronicles - August'25-Week II
Mobile App Security Testing_ A Comprehensive Guide.pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Assigned Numbers - 2025 - Bluetooth® Document
Machine learning based COVID-19 study performance prediction

Proactive Data Containers (PDC): An Object-centric Data Store for Large-scale Computing Systems

  • 1. Proactive Data Containers (PDC): An Object-centric Data Store for Large-scale Computing Systems Suren Byna Lawrence Berkeley National Lab (LBNL), Berkeley Co-authors Quincey Koziol (LBNL), Venkat Vishwanath (ANL), Jerome Soumagne (THG), Houjun Tang (LBNL), Kimmy Mu (THG), Bin Dong (LBNL), Richard Warren (THG), François Tessier (ANL, now @ CSCS), Teng Wang (LBNL), and Jialin Liu (LBNL)
  • 2. ▪  Extreme parallelism ▪  Massive Data ▪  Hierarchical storage Scalable data management – Three disrupting trends 2
  • 3. 3 Extreme parallelism Summit, ORNL Sierra, LLNL Sunway Taihulight NSC Wuxi, China Trinity, LANL Cori, LBNL Summit - ~2.4M cores - ~143 PFlops - 9.7 MW Sierra - ~1.5M cores - ~94 PFlops - 7.4 MW Taihulight - ~10.6M cores - ~93 PFlops - 15 MW
  • 4. § Simulations –  Multi-physics (FLASH) – 10 PB –  Cosmology (NyX) – 10 PB –  Plasma physics (VPIC) – 1 PB § Experimental and observational data (EOD) –  LHC (100 PB), –  LSST (60 PB), –  Genomics (100 TB to 1 PB) Massive scientific data FLASH NyX VPIC 4 LHC LSST Genomics
  • 5. Hierarchical and heterogeneous storage 5 IO Gap Memory Parallel file system (Lustre, GPFS) Archival Storage (HPSS tape) IO Gap Shared burst buffer Memory Parallel file system (Lustre, GPFS) Archival Storage (HPSS tape) Memory Parallel file system Archival storage (HPSS tape) Shared burst buffer Node-local storage Conventional Current Eg. Cori @ NERSC Upcoming Campaign storage
  • 6. Reading and writing data on scalable systems 6 ▪  Types of parallel I/O •  1 writer/reader, 1 file •  N writers/readers, N files (File-per-process) •  N writers/readers, 1 file •  M writers/readers, 1 file –  Aggregators –  Two-phase I/O •  M aggregators, M files (file-per-aggregator) –  Variations of this mode P0 P1 Pn-1 Pn … file.0 1 Writer/Reader, 1 File P0 P1 Pn-1 Pn … file.0 n Writers/Readers, n Files file.1 file.n-1 file.n P0 P1 Pn-1 Pn … n Writers/Readers, 1 File File.1 P0 P1 Pn-1 Pn … file.0 M Writers/Readers, M Files file.m P0 P1 Pn-1 Pn … M Writers/Readers, 1 File File.1
  • 7. Scalable Storage Systems: Challenges 7 Memory Disk-based storage Archival storage (HPSS tape) Shared burst buffer Hardware Node-local storage Campaign storage Software High-level I/O lib (netCDF, HDF5, etc.) IO middleware (POSIX, MPI-IO) IO forwarding Parallel file systems Applications Usage … Data (in memory) IO software … Files in file system •  Challenges –  POSIX-IO semantics hinder scalability and performance of file systems and IO software –  Multi-level hierarchy complicates data movement, especially if user has to be involved Tune middleware Tune file systems
  • 8. Scalable data management requirements Use case Domain Sim/EOD/ analysis Data size I/O Requirements FLASH High-energy density physics Simulation ~1PB Data transformations, scalable I/O interfaces, correlation among simulation and experimental data CMB / Planck Cosmology Simulation, EOD/ Analysis 10PB Automatic data movement optimizations DECam & LSST Cosmology EOD/Analysis ~10TB Easy interfaces, data transformations E3SM Climate Simulation ~10PB Async I/O, derived variables, automatic data movement TECA Climate Analysis ~10PB Data organization and efficient data movement HipMer Genomics EOD/Analysis ~100TB Scalable I/O interfaces, efficient and automatic data movement 8 Easy interfaces and superior performance Transparent data management Information capture and management 8
  • 9. Next Gen Storage – Proactive Data Containers (PDC) Memory Disk-based storage Archival storage (HPSS tape) Shared burst buffer Hardware Node-local storage Campaign storage Software High-level API Applications Usage … Data (in memory) 9
  • 10. ▪  Object-centric data access interface §  Simple put, get interface §  Array-based variable access ▪  Transparent data management §  Data placement in storage hierarchy §  Automatic data movement ▪  Information capture and management §  Rich metadata §  Connection of results and raw data with relationships Persistent Storage API BB FS Lustre DAOS … PDC System – High-level Architecture 10
  • 11. ▪ Object-level interface –  Create – containers and objects –  Add attributes –  Put object –  Get object –  Delete object ▪  Array-specific interface –  Create regions –  Map regions in PDC objects –  Lock –  Release 11 Object-centric PDC Interface J. Mu, J. Soumagne, et al., “A Transparent Server-managed Object Storage System for HPC”, IEEE Cluster 2018 Proactive Data Container Container Dataset KV-Store Group <root> A B C D E F PDC Locus Dataset KV-Store Group Container Collection Locus Container: X <root> A B C D E F Container: W <root> A B C D E F Container: Z <root> A B C D E F Collection: P Collection: Q PDC Collection Container: X <root> A B C D E F Container: W <root> A B C D E F Container: Z <root> A B C D E F Container: Y <root> A B C D E F Proactive Data Container Container <root> A B C D E F Key
  • 12. ▪ Object-level interface –  Create – containers and objects –  Add attributes –  Put object –  Get object –  Delete object ▪  Array-specific interface –  Create regions –  Map regions in PDC objects –  Lock –  Release 12 Object-centric PDC Interface J. Mu, J. Soumagne, et al., “A Transparent Server-managed Object Storage System for HPC”, IEEE Cluster 2018
  • 13. ▪ Usage of compute resources for I/O –  Shared mode – Compute nodes are shared between applications and I/O services –  Dedicated mode – I/O services on separate nodes ▪  Transparent data movement by PDC servers –  Apps map data buffers to objects and PDC servers place and manage data –  Apps query for data objects using attributes ▪  Superior I/O performance 13 Transparent data movement in storage hierarchy H. Tang, S. Byna, et al., “Toward Scalable and Asynchronous Object-centric Data Management for HPC”, IEEE/ACM CCGrid 2018 0 350 700 1050 124 248 496 992 1984 3968 7936 15872 Time in seconds Number of processes HDF5 read (Lustre) PLFS read (Lustre) PDC read (Lustre) HDF5 read (BB) PDC read (BB) 0 250 500 750 124 248 496 992 1984 3968 7936 15872 Time in seconds Number of processes HDF5 write (Lustre) PLFS write (Lustre) PDC write (Lustre) HDF5 write (BB) PDC write (BB)
  • 14. ▪ Flat name space ▪ Rich metadata –  Pre-defined tags that includes provenance –  User-defined tags for capturing relationships between data objects ▪  Distributed in memory metadata management –  Distributed hash table and bloom filters used for faster access 14 Metadata management H. Tang, S. Byna, et al., “SoMeta: Scalable Object-centric Metadata Management for High Performance Computing”, to be presented at IEEE Cluster 2017
  • 15. ▪ Take home message –  Scalable storage systems impacted by: •  Extreme level of parallelism •  Massive amounts of scientific data •  Transforming storage architectures –  Proactive data containers •  Object-centric interfaces •  Transparent data movement in storage hierarchies •  Scalable management of extensive metadata 15 Conclusions