SlideShare a Scribd company logo
Gluster: Where We've Been
                               AB Periasamy
                 Office of the CTO, Red Hat

                         John Mark Walker
                   Gluster Community Guy
Topics
 The Big Idea
 Humble beginnings
      From Bangalore to Milpitas
 Scale-out + Open source == WINNING
      User-space, no metadata server, stackable
 Cloud and commoditization



06/13/12
A Data Explosion!
 74% == Unstructured data annual growth
 63,000 PB == Scale-out storage in 2015
 40% == storage-related expense for cloud
 44x == Unstructured data volume growth by
  2020

06/13/12
Conference Room    US Head Office




  06/13/12

Bengaluru Office   Bengaluru Office
Gluster Community Deployments
Gluster Production Deployments
What Can You Store?
      Media – Docs, Photos, Video
      VM Filesystem – VM Disk Images
      Big Data – Log Files, RFID Data
      Objects – Long Tail Data



06/13/12
The big idea:
   Storage should be
               simple
 Simple, scalable, low-cost
06/13/12
What is GlusterFS,
 Really?
           Gluster is a unified, distributed
            storage system
             DHT, stackable, POSIX, Swift, HDFS




06/13/12
Phase 1: Lego Kit for
  Storage
      “People who think that userspace filesystems
        are realistic for anything but toys are just
        misguided" – Linus Torvalds
      Goal: create a global namespace



06/13/12
volume testvol-posix
         type storage/posix
         option directory /media/datastore
         option volume-id 329e31c1-04cc-4386-8bb8-xxxx
     end-volume

     volume testvol-access-control
         type features/access-control
         subvolumes testvol-posix
     end-volume

     volume testvol-locks
         type features/locks
         subvolumes testvol-access-control
     end-volume

     volume testvol-io-threads
         type performance/io-threads
         subvolumes testvol-locks
     end-volume
06/13/12
Versions 1.x – 2.x
      Hand-crafted volume definition files
        See examples
      Simple configuration files
      Faster than tape? It's good!


06/13/12
Phase 2: Repeatability
  of Use Cases




06/13/12
Community-led
      Learned from community
         Desired features
         Usage profiles
      All about scalable storage of unstructured
       data

06/13/12
GlusterFS 3.0: Putting it
  all together
      Adding, removing features
      Templates – recipes for common use
       cases


06/13/12
GlusterFS 3.1 - 2010
      Elasticity: add and remove volumes w/
       glusterd
      Automation: CLI, scriptable




06/13/12
CLI Magic
 $ gluster peer probe HOSTNAME
 $ gluster volume info
 $ gluster volume create VOLNAME [stripe COUNT] 
           [replica COUNT] [transport tcp | rdma] BRICK
 $ gluster volume delete VOLNAME
 $ gluster volume add-brick VOLNAME NEW-BRICK ...
 $ gluster volume rebalance VOLNAME start

06/13/12
GlusterFS 3.2 - 2011
      Native NFS server
      Marker framework
      Geo-replication
        Asynchronous


06/13/12
And now for something
  completely different
           Commoditization and the changing
            economics of storage
           Why we're winning



06/13/12
Simple Economics
        Simplicity, scalability, less cost


  Virtualized     Multi-Tenant   Automated   Commoditized


Scale on Demand   In the Cloud   Scale Out   Open Source




  06/13/12
Simplicity Bias
      FC, FCoE, iSCSI → HTTP, Sockets
      Modified BSD OS → Linux / User Space /
                        C, Python & Java
      Appliance based → Application based




06/13/12
Scale-out Open Source
  is the winner




06/13/12
Thank you!
                     AB Periasamy
       Office of the CTO, Red Hat
                   ab@redhat.com

               John Mark Walker
         Gluster Community Guy
          johnmark@redhat.com

More Related Content

PDF
GlusterFS Presentation FOSSCOMM2013 HUA, Athens, GR
PDF
Building modern data lakes
PPTX
Container Native Storage
PPTX
Preliminary xsx die_fact_finding
PDF
Doing E-commerce Right – Magento on DigitalOcean
PPT
StorageArchitecturesForCloudVDI
PDF
Gluster FS a filesistem for Big Data | Roberto Franchini - Codemotion Rome 2015
PDF
GlusterFS As an Object Storage
GlusterFS Presentation FOSSCOMM2013 HUA, Athens, GR
Building modern data lakes
Container Native Storage
Preliminary xsx die_fact_finding
Doing E-commerce Right – Magento on DigitalOcean
StorageArchitecturesForCloudVDI
Gluster FS a filesistem for Big Data | Roberto Franchini - Codemotion Rome 2015
GlusterFS As an Object Storage

Similar to Gluster: where weve been - a history (20)

PDF
State of the_gluster_-_lceu
PDF
vBACD - Distributed Petabyte-Scale Cloud Storage with GlusterFS - 2/28
PDF
Webinar Sept 22: Gluster Partners with Redapt to Deliver Scale-Out NAS Storage
PDF
The Future of GlusterFS and Gluster.org
PDF
Gluster Webinar: Introduction to GlusterFS
PDF
GlusterFS : un file system open source per i big data di oggi e domani - Robe...
PDF
GlusterFs: a scalable file system for today's and tomorrow's big data
PDF
Introduction to GlusterFS Webinar - September 2011
ODP
The Future of GlusterFS and Gluster.org
PDF
Cloud Storage Adoption, Practice, and Deployment
ODP
GlusterFS Architecture - June 30, 2011 Meetup
PDF
Scalable POSIX File Systems in the Cloud
PDF
Intro to GlusterFS Webinar - August 2011
PDF
Gluster Webinar: Introduction to GlusterFS v3.3
PDF
Red Hat Storage - Introduction to GlusterFS
PDF
Gluster Webinar May 25: Whats New in GlusterFS 3.2
PPS
Web20expo Filesystems
PPS
Web20expo Filesystems
PPS
Web20expo Filesystems
PPS
Beyond the File System: Designing Large-Scale File Storage and Serving
State of the_gluster_-_lceu
vBACD - Distributed Petabyte-Scale Cloud Storage with GlusterFS - 2/28
Webinar Sept 22: Gluster Partners with Redapt to Deliver Scale-Out NAS Storage
The Future of GlusterFS and Gluster.org
Gluster Webinar: Introduction to GlusterFS
GlusterFS : un file system open source per i big data di oggi e domani - Robe...
GlusterFs: a scalable file system for today's and tomorrow's big data
Introduction to GlusterFS Webinar - September 2011
The Future of GlusterFS and Gluster.org
Cloud Storage Adoption, Practice, and Deployment
GlusterFS Architecture - June 30, 2011 Meetup
Scalable POSIX File Systems in the Cloud
Intro to GlusterFS Webinar - August 2011
Gluster Webinar: Introduction to GlusterFS v3.3
Red Hat Storage - Introduction to GlusterFS
Gluster Webinar May 25: Whats New in GlusterFS 3.2
Web20expo Filesystems
Web20expo Filesystems
Web20expo Filesystems
Beyond the File System: Designing Large-Scale File Storage and Serving
Ad

More from John Mark Walker (11)

PDF
OSEN SF Meetup - Business of Open Source
PPTX
Product Development in the Age of Cloud Native
PDF
From project to product
ODP
Hybrid Cloud Management with ManageIQ
PDF
The Secrets to Open Source Innovation
ODP
Gluster 3.3 deep dive
PDF
Open Source and Cloud - The Two Great Tastes...
PDF
GlusterFS Community Preso
ODP
Intro to Open Cloud Initiative
PDF
FOSS vs. Web Services Lightning Talk: Is FOSS Necessary?
ODP
Building Vibrant Open Source Communities
OSEN SF Meetup - Business of Open Source
Product Development in the Age of Cloud Native
From project to product
Hybrid Cloud Management with ManageIQ
The Secrets to Open Source Innovation
Gluster 3.3 deep dive
Open Source and Cloud - The Two Great Tastes...
GlusterFS Community Preso
Intro to Open Cloud Initiative
FOSS vs. Web Services Lightning Talk: Is FOSS Necessary?
Building Vibrant Open Source Communities
Ad

Recently uploaded (20)

PPTX
Spectroscopy.pptx food analysis technology
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
sap open course for s4hana steps from ECC to s4
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPTX
Machine Learning_overview_presentation.pptx
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
cuic standard and advanced reporting.pdf
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Spectral efficient network and resource selection model in 5G networks
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
Spectroscopy.pptx food analysis technology
Chapter 3 Spatial Domain Image Processing.pdf
NewMind AI Weekly Chronicles - August'25-Week II
The Rise and Fall of 3GPP – Time for a Sabbatical?
Review of recent advances in non-invasive hemoglobin estimation
Dropbox Q2 2025 Financial Results & Investor Presentation
sap open course for s4hana steps from ECC to s4
20250228 LYD VKU AI Blended-Learning.pptx
Machine Learning_overview_presentation.pptx
Unlocking AI with Model Context Protocol (MCP)
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
cuic standard and advanced reporting.pdf
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Mobile App Security Testing_ A Comprehensive Guide.pdf
Diabetes mellitus diagnosis method based random forest with bat algorithm
Spectral efficient network and resource selection model in 5G networks
“AI and Expert System Decision Support & Business Intelligence Systems”
Building Integrated photovoltaic BIPV_UPV.pdf

Gluster: where weve been - a history

  • 1. Gluster: Where We've Been AB Periasamy Office of the CTO, Red Hat John Mark Walker Gluster Community Guy
  • 2. Topics The Big Idea Humble beginnings From Bangalore to Milpitas Scale-out + Open source == WINNING User-space, no metadata server, stackable Cloud and commoditization 06/13/12
  • 3. A Data Explosion! 74% == Unstructured data annual growth 63,000 PB == Scale-out storage in 2015 40% == storage-related expense for cloud 44x == Unstructured data volume growth by 2020 06/13/12
  • 4. Conference Room US Head Office 06/13/12 Bengaluru Office Bengaluru Office
  • 7. What Can You Store? Media – Docs, Photos, Video VM Filesystem – VM Disk Images Big Data – Log Files, RFID Data Objects – Long Tail Data 06/13/12
  • 8. The big idea: Storage should be simple Simple, scalable, low-cost 06/13/12
  • 9. What is GlusterFS, Really? Gluster is a unified, distributed storage system DHT, stackable, POSIX, Swift, HDFS 06/13/12
  • 10. Phase 1: Lego Kit for Storage “People who think that userspace filesystems are realistic for anything but toys are just misguided" – Linus Torvalds Goal: create a global namespace 06/13/12
  • 11. volume testvol-posix type storage/posix option directory /media/datastore option volume-id 329e31c1-04cc-4386-8bb8-xxxx end-volume volume testvol-access-control type features/access-control subvolumes testvol-posix end-volume volume testvol-locks type features/locks subvolumes testvol-access-control end-volume volume testvol-io-threads type performance/io-threads subvolumes testvol-locks end-volume 06/13/12
  • 12. Versions 1.x – 2.x Hand-crafted volume definition files See examples Simple configuration files Faster than tape? It's good! 06/13/12
  • 13. Phase 2: Repeatability of Use Cases 06/13/12
  • 14. Community-led Learned from community Desired features Usage profiles All about scalable storage of unstructured data 06/13/12
  • 15. GlusterFS 3.0: Putting it all together Adding, removing features Templates – recipes for common use cases 06/13/12
  • 16. GlusterFS 3.1 - 2010 Elasticity: add and remove volumes w/ glusterd Automation: CLI, scriptable 06/13/12
  • 17. CLI Magic $ gluster peer probe HOSTNAME $ gluster volume info $ gluster volume create VOLNAME [stripe COUNT] [replica COUNT] [transport tcp | rdma] BRICK $ gluster volume delete VOLNAME $ gluster volume add-brick VOLNAME NEW-BRICK ... $ gluster volume rebalance VOLNAME start 06/13/12
  • 18. GlusterFS 3.2 - 2011 Native NFS server Marker framework Geo-replication Asynchronous 06/13/12
  • 19. And now for something completely different Commoditization and the changing economics of storage Why we're winning 06/13/12
  • 20. Simple Economics Simplicity, scalability, less cost Virtualized Multi-Tenant Automated Commoditized Scale on Demand In the Cloud Scale Out Open Source 06/13/12
  • 21. Simplicity Bias FC, FCoE, iSCSI → HTTP, Sockets Modified BSD OS → Linux / User Space / C, Python & Java Appliance based → Application based 06/13/12
  • 22. Scale-out Open Source is the winner 06/13/12
  • 23. Thank you! AB Periasamy Office of the CTO, Red Hat ab@redhat.com John Mark Walker Gluster Community Guy johnmark@redhat.com

Editor's Notes

  • #9: Add examples where complexity has been bad - EMC, Cisco, Brocade et al. certification made business out of complexity - if too complicated, doesn't scale
  • #10: Discuss approach – how GlusterFS is unique and different from other approaches - Lessons form GNU Hurd - user space distributed storage operating system - overcome some parts of the OS: implemented scheduler, POSIX locking, RDMA, MM, cf. JVM, python, etc. - no metadata separation
  • #11: If you have a bunch of files, should be as simple as an FTP server - in user space, required FUSE, POSIX translator, NAS protocol, cluster translator
  • #15: Learned about missing features Found the largest problem and wanted to solve it - patterns emerged - scalable unstructured data storage was the #1 problem people wanted to solve Had a clearer idea where we wanted to go – clear direction
  • #16: Standalone NFS replacement Active-active replicated storage Scalable, distributed storage .. And then scalable, replicated distributed storage + other combos
  • #17: Elastic features driven by cloud and virt usage - shared storage for virtual guests - flexible, self-service storage - elastic volume management became requirement - automated provisioning of storage w/ CLI (native NFS server? Or 3.2?)
  • #19: Marker famework: - story of why it's necessary - backup of data in other locales - don't need entire snapshot - users wanted to continuous, unlimited replication - don't want sysadmin intervention – on-demand - queries FS to find what files have changed - manages queue, telling rsync exactly which files to change Inotify – doesn't scale, if daemon crashes, stops tracking changes - would have to write journaling feature to maintain change queue Geo-replication – can work on high-latency, flaky networks