SlideShare a Scribd company logo
A File-System Friendly Key Value Store Cloud-Scale File Serving and Storage MaxiScale FLEX Software Platform @garyorenstein | go@maxiscale.com April 2010 Page  MaxiScale
The Era of Billions 10 billion Tweets 10 billion downloads 3 billion photos/month 12 billion views January April 2010 Page  MaxiScale
Beyond Capacity to Users and Objects Everyone is a content consumer AND creator April 2010 Page  MaxiScale
Impact of The Era of Billions Companies face relentless pressure to store and serve data cost-effectively April 2010 Page  MaxiScale
Cloud Applications Requirements: Fast, Scalable, Affordable “Light” Structured Data Retrieval Page  April 2010 MaxiScale
Basics Anatomy of a Key-Value Store April 2010 Page  Source: http://dominicfallows.co.uk MaxiScale
MaxiScale Key Value Store Benefits Page  April 2010 MaxiScale
Breakthrough FLEX Software Platform Software Infrastructure On commodity hardware Targeted to Web companies Service Providers Enterprises Delivers file serving and storage to web and scale-out applications Web / App Servers File Serving and Storage April 2010 Page  Internet MaxiScale
MaxiScale FLEX Platform: Core Technologies Distributed Platform Single Namespace  Distributed Metadata and Data Peer Sets Optimized File Serving Unified Management Distributed MapReduce-enabled Planned: Custom apps (Hadoop) Linux and Windows Clients Small File Performance (key-value store) High-Density Storage Scale-Out NAS April 2010 Page  MaxiScale Storage I/O Bandwidth CPU
Simplicity of a Single Namespace April 2010 Page  Small File Performance (key-value store) Scale-Out NAS Single mount point / /NFR/ /SFR/ Directories Large files Small objects Flat hierarchy Screaming performance MaxiScale
FLEX Software Platform In Action Utmost system efficiency from client to network to cluster Peer Set April 2010 Page  MaxiScale
Fast Lookups Directly from Disk Return values up to 1MB in a single disk I/O operation Automatically create file and index entry  using POSIX open() call Read index, then metadata and data in single disk I/O Index maps to disk blocks for single disk I/O file retrieval April 2010 Page  Index (RAM) Metadata and Data (Disk) MaxiScale
Performance Counts Customer results 4 nodes 80KB files 736 files/second 63.6 million per day Files/spindle/sec Read tests with 32KB files and small file optimizations April 2010 Page  MaxiScale
Selecting a Key Value Store April 2010 Page  MaxiScale
Accelerating Web content serving Page  Brute-force speed-up Hardware and development cost Performance Efficient key-value lookup without database queries Slash cost Fast lookup Web-scale April 2010 MaxiScale Do-it-yourself key value store RAM Cache RAM Cache Relational DB (MySQL …) Relational DB (MySQL …) MaxiScale key value store
Page  Which key value store and file system are right for your application? High Availability? Key Length? Lookup Latency? Number of Keys? Cost? Size of Values? Data Storage Density? April 2010 MaxiScale
Key value store landscape Page  Circles represent relative usable capacity at load Memcache High RAM cost in large quantities Cost MaxiScale Single disk I/O Reads Petabyte scale Relational DB Long lookup latency File systems Long read latency Solid state disk High flash cost in large quantities Billions of keys; auto- expandable Key length < 1000 characters Lookup latency 5-10ms per key Stored values up to 1MB Highly available storage Latency April 2010 MaxiScale
Placing Key Value Stores in Context April 2010 Page  MaxiScale Relational Databases Key-Value Stores File System Friendly Key-Value Stores File Systems SQL Joins ACID Big Data Billions of Small Objects Application Flexibility (unique data access mechanism) Big Data Billions of Small Objects Filename as key (or system generated key) POSIX compliance Same namespace as file system Large Objects Petabytes of content Directory structures POSIX compliance
Social Networking Case Study Page  April 2010 MaxiScale
Social networking application example Challenges Expensive filer hardware 1000’s of mount points to manage Difficult to scale at 30% year-over-year growth Before MaxiScale April 2010 Page  175 TB  content repository across multiple NAS systems Sharded database instances for profiles, messages… Application Servers Web Servers MaxiScale
Social networking application example Benefits Single storage mount point per server Serve photos and video clips in 5 milliseconds  With MaxiScale April 2010 Page  Application Servers Web Servers Financial savings Capital expense: 60% Operating expense: $105K First-year ROI: 187% News feed,  message,  and profile data Direct photo  retrieval and upload MaxiScale Value
A Deeper Look at Photo Serving Case Study Page  April 2010 MaxiScale
Examples of Photo Serving Requirements Page  April 2010 MaxiScale
Originals in NFR POSIX-compliant hierarchy /nfr/joe@maxiscale.com /vacations /florida High-Resolution Original Images IMG_01234.JPG IMG_04567.JPG IMG_06789.JPG Page  April 2010 MaxiScale
Generate thumbnails /vacations.florida. thumb96 .img01234.jpg /vacations.florida. thumb480 .img01234.jpg /vacations.florida. thumb640 .img01234.jpg Pre-Rendered Thumbnails IMG_01234.JPG 96 pixels wide 480 pixels wide 640 pixels wide Application Process Page  April 2010 MaxiScale
User Profiles Use Key Value Store in SFR to store user profiles Single I/O file retrieval Data record up to 1 MB [email_address] <id>joe@maxiscale.com <password>***** <album set>vacations <album>florida   <images> IMG_01234.JPG,  IMG_04567.JPG,  IMG_06789.JPG <thumbnail bundle 96 pixels> vacations.florida.thumbs96 <album set>family <album>kid Page  April 2010 MaxiScale
MaxiScale Photo Serving Features and Benefits Page  April 2010 MaxiScale MaxiScale Feature Benefit POSIX-compliant file system hierarchy with NFR Segment users, album sets, and albums into traditional directory structure Small File Repository (SFR) Single I/O file retrieval Thumbnails Profiles Key Value Store Single I/O profile retrieval Billions of keys Up to 1 MB value/record length Photo serving feature set leads to reduced infrastructure costs to meet user demand.
Delivering Cloud-Scale File Serving and Storage New scale-out application requirements Horizontal scale, linear performance Full availability and resiliency Utmost cost-efficiency MaxiScale delivers Cloud-Scale file serving and storage software Simple and always online Game changing cost using commodity hardware File System Friendly Key Value Store Page  MaxiScale helps companies handle relentless pressure to store and serve data cost-effectively April 2010 MaxiScale
Thank You @garyorenstein | go@maxiscale.com @maxiscale | www.MaxiScale.com Page  April 2010 MaxiScale

More Related Content

PDF
Storage Made Easy Enterprise File Fabric Brochure
PDF
Enterprise File Fabric for Igneous
PDF
The Enterprise File Fabric for Leonovus User Collaboration Interface (LUCI)
PDF
AWS Summit Singapore 2019 | Snowflake: Your Data. No Limits
PDF
Continuous Data Replication into Cloud Storage with Oracle GoldenGate
PPTX
Get Savvy with Snowflake
PDF
Dropbox - Architecture and Business Prospective
PPTX
Delivering Data Democratization in the Cloud with Snowflake
Storage Made Easy Enterprise File Fabric Brochure
Enterprise File Fabric for Igneous
The Enterprise File Fabric for Leonovus User Collaboration Interface (LUCI)
AWS Summit Singapore 2019 | Snowflake: Your Data. No Limits
Continuous Data Replication into Cloud Storage with Oracle GoldenGate
Get Savvy with Snowflake
Dropbox - Architecture and Business Prospective
Delivering Data Democratization in the Cloud with Snowflake

What's hot (19)

PDF
The Enterprise File Fabric for Scality
PDF
Analyzing Semi-Structured Data At Volume In The Cloud
PDF
Sydney: Certus Data 2.0 Vault Meetup with Snowflake - Data Vault In The Cloud
PDF
The Enterprise File Fabric for OpenIO
PDF
The Enterprise File Fabric for Cloudian | GDPR ready File Sync and Share
PDF
Snowflake for Data Engineering
PPTX
A 30 day plan to start ending your data struggle with Snowflake
PPTX
Data Sharing with Snowflake
PPTX
Introducing the Snowflake Computing Cloud Data Warehouse
PDF
The Enterprise File Fabric for Vecima MediaScaleX
PPTX
SharePoint for FREE
PDF
Demystifying Data Warehousing as a Service - DFW
PPTX
Altis AWS Snowflake Practice
PPTX
Elastic Data Warehousing
PPTX
Snowflake Overview
PPTX
Use Cases for NoSQL in Media
PDF
SLC Snowflake User Group - Mar 12, 2020
PPTX
Demystifying Data Warehouse as a Service
PDF
Snowflake Company Presentation
The Enterprise File Fabric for Scality
Analyzing Semi-Structured Data At Volume In The Cloud
Sydney: Certus Data 2.0 Vault Meetup with Snowflake - Data Vault In The Cloud
The Enterprise File Fabric for OpenIO
The Enterprise File Fabric for Cloudian | GDPR ready File Sync and Share
Snowflake for Data Engineering
A 30 day plan to start ending your data struggle with Snowflake
Data Sharing with Snowflake
Introducing the Snowflake Computing Cloud Data Warehouse
The Enterprise File Fabric for Vecima MediaScaleX
SharePoint for FREE
Demystifying Data Warehousing as a Service - DFW
Altis AWS Snowflake Practice
Elastic Data Warehousing
Snowflake Overview
Use Cases for NoSQL in Media
SLC Snowflake User Group - Mar 12, 2020
Demystifying Data Warehouse as a Service
Snowflake Company Presentation
Ad

Viewers also liked (20)

PDF
Hadoop Inside
PPT
Noble Star Services Sdn Bhd. presentation slides (latest)
PDF
Dallas Android - Android & iOS Development in C# with Xamarin
PPTX
Single page App
PDF
VS Single Page App Design with AngularJS
PDF
Cassandra v3.0 at Rakuten meet-up on 12/2/2015
PDF
DDS-to-JSON and DDS Real-time Data Storage with MongoDB
PDF
Building a Single Page App: One Page at a Time
PDF
Html5-Web-Storage
PDF
User Stories and User Story Mapping by Jason Jones
PPTX
mongoDB 3 type modeling in production
PPTX
Key-Value Pairs
KEY
NoSQL Technology
PDF
Gossip-based algorithms
PDF
NoSQL support in Informix (JSON storage, Mongo DB API)
PDF
Gossip & Key Value Store
PPTX
Five stages of grief: Evolving a multi-page web app to a single page application
PDF
Gremlin: A Graph-Based Programming Language
PPT
RESTful JSON web databases
PDF
Refactoring to a Single Page Application
Hadoop Inside
Noble Star Services Sdn Bhd. presentation slides (latest)
Dallas Android - Android & iOS Development in C# with Xamarin
Single page App
VS Single Page App Design with AngularJS
Cassandra v3.0 at Rakuten meet-up on 12/2/2015
DDS-to-JSON and DDS Real-time Data Storage with MongoDB
Building a Single Page App: One Page at a Time
Html5-Web-Storage
User Stories and User Story Mapping by Jason Jones
mongoDB 3 type modeling in production
Key-Value Pairs
NoSQL Technology
Gossip-based algorithms
NoSQL support in Informix (JSON storage, Mongo DB API)
Gossip & Key Value Store
Five stages of grief: Evolving a multi-page web app to a single page application
Gremlin: A Graph-Based Programming Language
RESTful JSON web databases
Refactoring to a Single Page Application
Ad

Similar to CloudLab: A File System Friendly Key Value Store (20)

PPTX
Watson christofer j_180208
PPT
Running a Megasite on Microsoft Technologies
PDF
Gilbane 2009 -- How Can Content Management Software Keep Pace?
PDF
Enterprise & Media Storage in the Cloud
PPT
Storage 2.0 (Unstructured Data)
PPTX
INFOGOV14 - Trusting Your KM & ECM Strategy to SharePoint
PPT
IUT presentation - English
PDF
Secrets of Enterprise Data Mining 201310
PDF
Introduction Big Data
PPTX
Jonathan Ralton - Trusting Your KM & ECM Strategy To SharePoint
PPTX
Wix Architecture at Scale - QCon London 2014
PPT
Cloud Computing 2010 - Case history - Giancarlo Gaggero - DSI Europe s.r.l.
PPT
Oracle Exec Summary 7000 Unified Storage
PDF
S100299 ibm-cos-orlando-v1804c
PPT
Archiving as a Service - A Model for the Provision of Shared Archiving Servic...
PPTX
A brief history of Azure
PPT
ECM and Open Source Software: A Disruptive Force in ECM Solutions
PPT
Info. Archive Customer Presentation - SSI version
PDF
Spca2014 navigating clouds sp_con14_mackie
PPT
MOSS 2007 Deployment Fundamentals -Part2
Watson christofer j_180208
Running a Megasite on Microsoft Technologies
Gilbane 2009 -- How Can Content Management Software Keep Pace?
Enterprise & Media Storage in the Cloud
Storage 2.0 (Unstructured Data)
INFOGOV14 - Trusting Your KM & ECM Strategy to SharePoint
IUT presentation - English
Secrets of Enterprise Data Mining 201310
Introduction Big Data
Jonathan Ralton - Trusting Your KM & ECM Strategy To SharePoint
Wix Architecture at Scale - QCon London 2014
Cloud Computing 2010 - Case history - Giancarlo Gaggero - DSI Europe s.r.l.
Oracle Exec Summary 7000 Unified Storage
S100299 ibm-cos-orlando-v1804c
Archiving as a Service - A Model for the Provision of Shared Archiving Servic...
A brief history of Azure
ECM and Open Source Software: A Disruptive Force in ECM Solutions
Info. Archive Customer Presentation - SSI version
Spca2014 navigating clouds sp_con14_mackie
MOSS 2007 Deployment Fundamentals -Part2

Recently uploaded (20)

PDF
Hybrid model detection and classification of lung cancer
PPTX
The various Industrial Revolutions .pptx
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PPTX
TLE Review Electricity (Electricity).pptx
PPTX
Modernising the Digital Integration Hub
PDF
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
PDF
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
PDF
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
NewMind AI Weekly Chronicles – August ’25 Week III
PDF
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
PDF
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
PDF
Zenith AI: Advanced Artificial Intelligence
PDF
project resource management chapter-09.pdf
PDF
1 - Historical Antecedents, Social Consideration.pdf
PDF
DP Operators-handbook-extract for the Mautical Institute
PDF
2021 HotChips TSMC Packaging Technologies for Chiplets and 3D_0819 publish_pu...
PDF
A contest of sentiment analysis: k-nearest neighbor versus neural network
PPT
Module 1.ppt Iot fundamentals and Architecture
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
Hybrid model detection and classification of lung cancer
The various Industrial Revolutions .pptx
Assigned Numbers - 2025 - Bluetooth® Document
TLE Review Electricity (Electricity).pptx
Modernising the Digital Integration Hub
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
Group 1 Presentation -Planning and Decision Making .pptx
NewMind AI Weekly Chronicles – August ’25 Week III
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
Zenith AI: Advanced Artificial Intelligence
project resource management chapter-09.pdf
1 - Historical Antecedents, Social Consideration.pdf
DP Operators-handbook-extract for the Mautical Institute
2021 HotChips TSMC Packaging Technologies for Chiplets and 3D_0819 publish_pu...
A contest of sentiment analysis: k-nearest neighbor versus neural network
Module 1.ppt Iot fundamentals and Architecture
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf

CloudLab: A File System Friendly Key Value Store

  • 1. A File-System Friendly Key Value Store Cloud-Scale File Serving and Storage MaxiScale FLEX Software Platform @garyorenstein | go@maxiscale.com April 2010 Page MaxiScale
  • 2. The Era of Billions 10 billion Tweets 10 billion downloads 3 billion photos/month 12 billion views January April 2010 Page MaxiScale
  • 3. Beyond Capacity to Users and Objects Everyone is a content consumer AND creator April 2010 Page MaxiScale
  • 4. Impact of The Era of Billions Companies face relentless pressure to store and serve data cost-effectively April 2010 Page MaxiScale
  • 5. Cloud Applications Requirements: Fast, Scalable, Affordable “Light” Structured Data Retrieval Page April 2010 MaxiScale
  • 6. Basics Anatomy of a Key-Value Store April 2010 Page Source: http://dominicfallows.co.uk MaxiScale
  • 7. MaxiScale Key Value Store Benefits Page April 2010 MaxiScale
  • 8. Breakthrough FLEX Software Platform Software Infrastructure On commodity hardware Targeted to Web companies Service Providers Enterprises Delivers file serving and storage to web and scale-out applications Web / App Servers File Serving and Storage April 2010 Page Internet MaxiScale
  • 9. MaxiScale FLEX Platform: Core Technologies Distributed Platform Single Namespace Distributed Metadata and Data Peer Sets Optimized File Serving Unified Management Distributed MapReduce-enabled Planned: Custom apps (Hadoop) Linux and Windows Clients Small File Performance (key-value store) High-Density Storage Scale-Out NAS April 2010 Page MaxiScale Storage I/O Bandwidth CPU
  • 10. Simplicity of a Single Namespace April 2010 Page Small File Performance (key-value store) Scale-Out NAS Single mount point / /NFR/ /SFR/ Directories Large files Small objects Flat hierarchy Screaming performance MaxiScale
  • 11. FLEX Software Platform In Action Utmost system efficiency from client to network to cluster Peer Set April 2010 Page MaxiScale
  • 12. Fast Lookups Directly from Disk Return values up to 1MB in a single disk I/O operation Automatically create file and index entry using POSIX open() call Read index, then metadata and data in single disk I/O Index maps to disk blocks for single disk I/O file retrieval April 2010 Page Index (RAM) Metadata and Data (Disk) MaxiScale
  • 13. Performance Counts Customer results 4 nodes 80KB files 736 files/second 63.6 million per day Files/spindle/sec Read tests with 32KB files and small file optimizations April 2010 Page MaxiScale
  • 14. Selecting a Key Value Store April 2010 Page MaxiScale
  • 15. Accelerating Web content serving Page Brute-force speed-up Hardware and development cost Performance Efficient key-value lookup without database queries Slash cost Fast lookup Web-scale April 2010 MaxiScale Do-it-yourself key value store RAM Cache RAM Cache Relational DB (MySQL …) Relational DB (MySQL …) MaxiScale key value store
  • 16. Page Which key value store and file system are right for your application? High Availability? Key Length? Lookup Latency? Number of Keys? Cost? Size of Values? Data Storage Density? April 2010 MaxiScale
  • 17. Key value store landscape Page Circles represent relative usable capacity at load Memcache High RAM cost in large quantities Cost MaxiScale Single disk I/O Reads Petabyte scale Relational DB Long lookup latency File systems Long read latency Solid state disk High flash cost in large quantities Billions of keys; auto- expandable Key length < 1000 characters Lookup latency 5-10ms per key Stored values up to 1MB Highly available storage Latency April 2010 MaxiScale
  • 18. Placing Key Value Stores in Context April 2010 Page MaxiScale Relational Databases Key-Value Stores File System Friendly Key-Value Stores File Systems SQL Joins ACID Big Data Billions of Small Objects Application Flexibility (unique data access mechanism) Big Data Billions of Small Objects Filename as key (or system generated key) POSIX compliance Same namespace as file system Large Objects Petabytes of content Directory structures POSIX compliance
  • 19. Social Networking Case Study Page April 2010 MaxiScale
  • 20. Social networking application example Challenges Expensive filer hardware 1000’s of mount points to manage Difficult to scale at 30% year-over-year growth Before MaxiScale April 2010 Page 175 TB content repository across multiple NAS systems Sharded database instances for profiles, messages… Application Servers Web Servers MaxiScale
  • 21. Social networking application example Benefits Single storage mount point per server Serve photos and video clips in 5 milliseconds With MaxiScale April 2010 Page Application Servers Web Servers Financial savings Capital expense: 60% Operating expense: $105K First-year ROI: 187% News feed, message, and profile data Direct photo retrieval and upload MaxiScale Value
  • 22. A Deeper Look at Photo Serving Case Study Page April 2010 MaxiScale
  • 23. Examples of Photo Serving Requirements Page April 2010 MaxiScale
  • 24. Originals in NFR POSIX-compliant hierarchy /nfr/joe@maxiscale.com /vacations /florida High-Resolution Original Images IMG_01234.JPG IMG_04567.JPG IMG_06789.JPG Page April 2010 MaxiScale
  • 25. Generate thumbnails /vacations.florida. thumb96 .img01234.jpg /vacations.florida. thumb480 .img01234.jpg /vacations.florida. thumb640 .img01234.jpg Pre-Rendered Thumbnails IMG_01234.JPG 96 pixels wide 480 pixels wide 640 pixels wide Application Process Page April 2010 MaxiScale
  • 26. User Profiles Use Key Value Store in SFR to store user profiles Single I/O file retrieval Data record up to 1 MB [email_address] <id>joe@maxiscale.com <password>***** <album set>vacations <album>florida <images> IMG_01234.JPG, IMG_04567.JPG, IMG_06789.JPG <thumbnail bundle 96 pixels> vacations.florida.thumbs96 <album set>family <album>kid Page April 2010 MaxiScale
  • 27. MaxiScale Photo Serving Features and Benefits Page April 2010 MaxiScale MaxiScale Feature Benefit POSIX-compliant file system hierarchy with NFR Segment users, album sets, and albums into traditional directory structure Small File Repository (SFR) Single I/O file retrieval Thumbnails Profiles Key Value Store Single I/O profile retrieval Billions of keys Up to 1 MB value/record length Photo serving feature set leads to reduced infrastructure costs to meet user demand.
  • 28. Delivering Cloud-Scale File Serving and Storage New scale-out application requirements Horizontal scale, linear performance Full availability and resiliency Utmost cost-efficiency MaxiScale delivers Cloud-Scale file serving and storage software Simple and always online Game changing cost using commodity hardware File System Friendly Key Value Store Page MaxiScale helps companies handle relentless pressure to store and serve data cost-effectively April 2010 MaxiScale
  • 29. Thank You @garyorenstein | go@maxiscale.com @maxiscale | www.MaxiScale.com Page April 2010 MaxiScale