SlideShare a Scribd company logo
GlusterFS
Challenges and Futures
Jeff Darcy
Storage Developer Conference
September 17, 2012
What Is GlusterFS?
● Just your basic distributed file system
● sharding, replication, striping
● Decentralized
● no central metadata server
● core functionality on clients
● Modular
● “translators”
Translator Stacking
FUSE
DHT
AFR-1
Client-3Client-1
AFR-0
Client-2Client-0
● One to one, one to many, one to zero (?)
● Rearrange, move from client to server, ...
Distribution (now)
Server A
Server BServer C
File X
File Y
Replication (now)
● Based on changelog (“dirty flags”)
● set flags, do operation, clear flags
● use flags to determine repair (“self-heal”) after
failure
● Latency sensitive
● 3+ network round trips per user request
● implementation heavily optimized
Challenge: Multitenancy
● Focus of CloudFSHekaFS
● to be merged with GlusterFS soon-ish
● Isolate name and ID spaces
● Encrypt in flight and at rest
● Auth*
● Quota (next slide)
● Performance isolation
● cgroups
Challenge: Distributed Quota
● Can't trust clients to enforce quota
● Can't just divide equally among servers
● unequal usage (e.g. due to explicit placement)
● EDQUOT on one while still space on another
● “Quota rebalancing daemon”
● monitor/adjust continuously
● interesting problem at high scale
Challenge: Better Rebalancing
● Optimal placement vs. minimal data movement
● Different kinds of weighting
Node A
Node B
Node C
Node D
Node E
Challenge: Replication Latency
● Reaching limits of current approach
● Have to go async?
● but still ordered
● exploit compute/data locality (e.g. Hadoop)
● journaling, conflict resolution
Challenge: Directory Traversal
● Piggyback attrs (and xattrs) on readdir
● Even better: cursor approach
● read everything in opendir
● zero network activity for readdir
● less current, but more consistent
Challenge: Many Small Files
● Prefetch whole directories
● if marked, below size threshold, ...
● Exploit async journal
● only works if compute/data colocated
● Weaken consistency?
● allow create/write/close to be buffered
● directory-level fsync (magic xattr)
Conclusions
● Most of these challenges are not unique to
GlusterFS
● Modularity and incremental progress are
preferable to monolithic “solve all problems at
once”
● GlusterFS provides a good environment in
which to experiment with solutions

More Related Content

ODP
Tiering barcelona
ODP
Gluster intro-tdose
ODP
YDAL Barcelona
ODP
Lisa 2015-gluster fs-introduction
PDF
Disperse xlator ramon_datalab
ODP
Dustin Black - Red Hat Storage Server Administration Deep Dive
PDF
Gluster d2
PDF
Gluster fs current_features_and_roadmap
Tiering barcelona
Gluster intro-tdose
YDAL Barcelona
Lisa 2015-gluster fs-introduction
Disperse xlator ramon_datalab
Dustin Black - Red Hat Storage Server Administration Deep Dive
Gluster d2
Gluster fs current_features_and_roadmap

What's hot (20)

ODP
Scale out backups-with_bareos_and_gluster
PDF
State of the_gluster_-_lceu
PDF
On demand file-caching_-_gustavo_brand
ODP
Dedupe nmamit
ODP
Lcna tutorial-2012
PDF
Sdc 2012-challenges
ODP
20160130 Gluster-roadmap
PDF
Smb gluster devmar2013
ODP
Developing apps and_integrating_with_gluster_fs_-_libgfapi
PDF
OSBConf 2015 | Scale out backups with bareos and gluster by niels de vos
ODP
Gluster fs hadoop_fifth-elephant
PDF
Gluster for sysadmins
PDF
Gluster overview & future directions vault 2015
PDF
Gluster intro-tdose
ODP
Red Hat Gluster Storage : GlusterFS
PDF
Challenges with Gluster and Persistent Memory with Dan Lambright
ODP
20160401 Gluster-roadmap
ODP
Lcna example-2012
ODP
GlusterFs Architecture & Roadmap - LinuxCon EU 2013
PDF
GlusterFS And Big Data
Scale out backups-with_bareos_and_gluster
State of the_gluster_-_lceu
On demand file-caching_-_gustavo_brand
Dedupe nmamit
Lcna tutorial-2012
Sdc 2012-challenges
20160130 Gluster-roadmap
Smb gluster devmar2013
Developing apps and_integrating_with_gluster_fs_-_libgfapi
OSBConf 2015 | Scale out backups with bareos and gluster by niels de vos
Gluster fs hadoop_fifth-elephant
Gluster for sysadmins
Gluster overview & future directions vault 2015
Gluster intro-tdose
Red Hat Gluster Storage : GlusterFS
Challenges with Gluster and Persistent Memory with Dan Lambright
20160401 Gluster-roadmap
Lcna example-2012
GlusterFs Architecture & Roadmap - LinuxCon EU 2013
GlusterFS And Big Data
Ad

Viewers also liked (15)

PDF
Gluster wireshark niels_de_vos
PDF
Hands On Gluster with Jeff Darcy
ODP
Accessing gluster ufo_-_eco_willson
PDF
Qemu gluster fs
PDF
Gluster as Block Store in Containers
ODP
Leases and-caching final
ODP
Gluster d thread_synchronization_using_urcu_lca2016
ODP
Join the super_colony_-_feb2013
ODP
Gdeploy 2.0
ODP
Bug triage in_gluster
PDF
Debugging with-wireshark-niels-de-vos
ODP
Integrating gluster fs,_qemu_and_ovirt-vijay_bellur-linuxcon_eu_2013
ODP
Gsummit apis-2013
ODP
Introduction to highly_availablenfs_server_on_scale-out_storage_systems_based...
ODP
Introduction to Open Source
Gluster wireshark niels_de_vos
Hands On Gluster with Jeff Darcy
Accessing gluster ufo_-_eco_willson
Qemu gluster fs
Gluster as Block Store in Containers
Leases and-caching final
Gluster d thread_synchronization_using_urcu_lca2016
Join the super_colony_-_feb2013
Gdeploy 2.0
Bug triage in_gluster
Debugging with-wireshark-niels-de-vos
Integrating gluster fs,_qemu_and_ovirt-vijay_bellur-linuxcon_eu_2013
Gsummit apis-2013
Introduction to highly_availablenfs_server_on_scale-out_storage_systems_based...
Introduction to Open Source
Ad

Similar to Sdc challenges-2012 (20)

ODP
Gluster 3.3 deep dive
PDF
Distributed File Systems
ODP
The Future of GlusterFS and Gluster.org
PDF
GlusterFS Update and OpenStack Integration
PDF
Gluster fs current_features_and_roadmap
PDF
vBACD - Distributed Petabyte-Scale Cloud Storage with GlusterFS - 2/28
PDF
Gluster fs architecture_&_roadmap-vijay_bellur-linuxcon_eu_2013
PDF
Storage as a Service with Gluster
PDF
The Future of GlusterFS and Gluster.org
PDF
Gluster fs architecture_&_roadmap_atin_punemeetup_2015
PDF
GlusterFS : un file system open source per i big data di oggi e domani - Robe...
PDF
GlusterFs: a scalable file system for today's and tomorrow's big data
PDF
はじめてのGlusterFS
PDF
An-Insight-about-Glusterfs-and-it's-Enforcement-Techniques
ODP
Gluster fs architecture_future_directions_tlv
PDF
Gluster fs architecture_future_directions_tlv
PDF
Red Hat Gluster Storage Performance
PDF
GlusterFS as a DFS
PDF
Codemotion Rome 2015. GlusterFS
PDF
Gluster FS a filesistem for Big Data | Roberto Franchini - Codemotion Rome 2015
Gluster 3.3 deep dive
Distributed File Systems
The Future of GlusterFS and Gluster.org
GlusterFS Update and OpenStack Integration
Gluster fs current_features_and_roadmap
vBACD - Distributed Petabyte-Scale Cloud Storage with GlusterFS - 2/28
Gluster fs architecture_&_roadmap-vijay_bellur-linuxcon_eu_2013
Storage as a Service with Gluster
The Future of GlusterFS and Gluster.org
Gluster fs architecture_&_roadmap_atin_punemeetup_2015
GlusterFS : un file system open source per i big data di oggi e domani - Robe...
GlusterFs: a scalable file system for today's and tomorrow's big data
はじめてのGlusterFS
An-Insight-about-Glusterfs-and-it's-Enforcement-Techniques
Gluster fs architecture_future_directions_tlv
Gluster fs architecture_future_directions_tlv
Red Hat Gluster Storage Performance
GlusterFS as a DFS
Codemotion Rome 2015. GlusterFS
Gluster FS a filesistem for Big Data | Roberto Franchini - Codemotion Rome 2015

More from Gluster.org (20)

PDF
Automating Gluster @ Facebook - Shreyas Siravara
PDF
nfusr: a new userspace NFS client based on libnfs - Shreyas Siravara
PDF
Facebook’s upstream approach to GlusterFS - David Hasson
PDF
Throttling Traffic at Facebook Scale
PDF
GlusterFS w/ Tiered XFS
PDF
Gluster Metrics: why they are crucial for running stable deployments of all s...
PDF
Up and Running with Glusto & Glusto-Tests in 5 Minutes (or less)
PDF
Data Reduction for Gluster with VDO
PDF
Releases: What are contributors responsible for
PDF
RIO Distribution: Reconstructing the onion - Shyamsundar Ranganathan
PDF
Gluster and Kubernetes
PDF
Native Clients, more the merrier with GFProxy!
PDF
Gluster: a SWOT Analysis
PDF
GlusterD-2.0: What's Happening? - Kaushal Madappa
PDF
Scalability and Performance of CNS 3.6
PDF
What Makes Us Fail
PDF
Gluster as Native Storage for Containers - past, present and future
PDF
Heketi Functionality into Glusterd2
PDF
Architecture of the High Availability Solution for Ganesha and Samba with Kal...
PDF
Gluster Containerized Storage for Cloud Applications
Automating Gluster @ Facebook - Shreyas Siravara
nfusr: a new userspace NFS client based on libnfs - Shreyas Siravara
Facebook’s upstream approach to GlusterFS - David Hasson
Throttling Traffic at Facebook Scale
GlusterFS w/ Tiered XFS
Gluster Metrics: why they are crucial for running stable deployments of all s...
Up and Running with Glusto & Glusto-Tests in 5 Minutes (or less)
Data Reduction for Gluster with VDO
Releases: What are contributors responsible for
RIO Distribution: Reconstructing the onion - Shyamsundar Ranganathan
Gluster and Kubernetes
Native Clients, more the merrier with GFProxy!
Gluster: a SWOT Analysis
GlusterD-2.0: What's Happening? - Kaushal Madappa
Scalability and Performance of CNS 3.6
What Makes Us Fail
Gluster as Native Storage for Containers - past, present and future
Heketi Functionality into Glusterd2
Architecture of the High Availability Solution for Ganesha and Samba with Kal...
Gluster Containerized Storage for Cloud Applications

Recently uploaded (20)

PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Advanced Soft Computing BINUS July 2025.pdf
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Approach and Philosophy of On baking technology
PDF
Electronic commerce courselecture one. Pdf
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
Big Data Technologies - Introduction.pptx
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
cuic standard and advanced reporting.pdf
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
GamePlan Trading System Review: Professional Trader's Honest Take
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Advanced IT Governance
PDF
Empathic Computing: Creating Shared Understanding
PDF
NewMind AI Monthly Chronicles - July 2025
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Advanced Soft Computing BINUS July 2025.pdf
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Approach and Philosophy of On baking technology
Electronic commerce courselecture one. Pdf
NewMind AI Weekly Chronicles - August'25 Week I
Advanced methodologies resolving dimensionality complications for autism neur...
Big Data Technologies - Introduction.pptx
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
cuic standard and advanced reporting.pdf
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
Chapter 3 Spatial Domain Image Processing.pdf
Unlocking AI with Model Context Protocol (MCP)
GamePlan Trading System Review: Professional Trader's Honest Take
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Advanced IT Governance
Empathic Computing: Creating Shared Understanding
NewMind AI Monthly Chronicles - July 2025

Sdc challenges-2012

  • 1. GlusterFS Challenges and Futures Jeff Darcy Storage Developer Conference September 17, 2012
  • 2. What Is GlusterFS? ● Just your basic distributed file system ● sharding, replication, striping ● Decentralized ● no central metadata server ● core functionality on clients ● Modular ● “translators”
  • 3. Translator Stacking FUSE DHT AFR-1 Client-3Client-1 AFR-0 Client-2Client-0 ● One to one, one to many, one to zero (?) ● Rearrange, move from client to server, ...
  • 4. Distribution (now) Server A Server BServer C File X File Y
  • 5. Replication (now) ● Based on changelog (“dirty flags”) ● set flags, do operation, clear flags ● use flags to determine repair (“self-heal”) after failure ● Latency sensitive ● 3+ network round trips per user request ● implementation heavily optimized
  • 6. Challenge: Multitenancy ● Focus of CloudFSHekaFS ● to be merged with GlusterFS soon-ish ● Isolate name and ID spaces ● Encrypt in flight and at rest ● Auth* ● Quota (next slide) ● Performance isolation ● cgroups
  • 7. Challenge: Distributed Quota ● Can't trust clients to enforce quota ● Can't just divide equally among servers ● unequal usage (e.g. due to explicit placement) ● EDQUOT on one while still space on another ● “Quota rebalancing daemon” ● monitor/adjust continuously ● interesting problem at high scale
  • 8. Challenge: Better Rebalancing ● Optimal placement vs. minimal data movement ● Different kinds of weighting Node A Node B Node C Node D Node E
  • 9. Challenge: Replication Latency ● Reaching limits of current approach ● Have to go async? ● but still ordered ● exploit compute/data locality (e.g. Hadoop) ● journaling, conflict resolution
  • 10. Challenge: Directory Traversal ● Piggyback attrs (and xattrs) on readdir ● Even better: cursor approach ● read everything in opendir ● zero network activity for readdir ● less current, but more consistent
  • 11. Challenge: Many Small Files ● Prefetch whole directories ● if marked, below size threshold, ... ● Exploit async journal ● only works if compute/data colocated ● Weaken consistency? ● allow create/write/close to be buffered ● directory-level fsync (magic xattr)
  • 12. Conclusions ● Most of these challenges are not unique to GlusterFS ● Modularity and incremental progress are preferable to monolithic “solve all problems at once” ● GlusterFS provides a good environment in which to experiment with solutions