SlideShare a Scribd company logo
Data storage in clouds
Athens OpenStack User Group #OSATH
12th Meetup, 22th April 2015
Thanassis Parathyras,
aparathyras@stackmasters.eu, @parathyras
Announcements
#OSATH has 200 members !
Greek Mailing List
• openstack-el@lists.openstack.org
• Join at http://lists.openstack.org
OpenStack Summit (https://www.openstack.org)
• 18-22 May, Vancouver
OpenStack CEE Day (http://openstackceeday.com)
• 8 June, Budapest
Outline
• Storage types
• OpenStack Storage Services
– Glance
– Cinder
– Swift
• Higher abstractions
– Data Processing aaS (Sahara)
– DBaaS (Trove)
• Other OSS projects
– GlusterFS
– Ceph
It’s all about DATA
“often called storage or memory,
is a technology consisting of computer components and
recording media used to retain digital data.
It is a core function and fundamental component of
computers.”
ent. Computer data storage, Wikipedia
File Disk Drive CD-ROM
Object Volume Image
Swift Cinder Glance
Amazon S3 Amazon EBS
AWS Marketplace –
Software Infrastructure
Storage types
Glance
OpenStack Image service
• REST API
• CRUD and Search features
• Caching and prefetching
• Supports several formats:
– raw, qcow2, vmdk, vhd, ami (aki, ari), iso, vdi
• Containers:
– bare, ovf, ami (aki, ari)
• Backend to storage (default: Swift)
• Able to aggregate multiple back-ends
– Can also increase availability
Cinder
OpenStack Block Storage service
• Volumes
• Snapshots
• Backups
• Modular architecture to support 50+ back-ends
• LVM is the default (iSCSI)
• Diverse storage types
– iSCSI, Fibre, RBD (Ceph), NFS, GPFS, …
• Manages storage resources separately
Ephemeral vs Block Storage
• Nova manages ephemeral storage coupled with VM state (non-persistent)
• Cinder manages block storage decoupled from VM lifecylce (persistent)
Swift
OpenStack Object Storage service
• REST API
• Data redundancy (3x or more)
• Drive auditing built-in
• RAID not required
• Commodity hardware (not low-end)
• High availability
• Distributed
• Eventual consistency
CAP theorem
• Choose 2 out of 3 (Consistency, Availability, Partition Tolerance)
• Swift implements an eventual consistency model
Swift in action
Access tier
• Proxy server
Storage tier
• Account server
• Container server
• Object server
• Consistency servers (Auditors,
Updaters, Replicators)
Zones
• Selected per deployment
• Determine replica isolation
(disk, server, rack, room)
Swift in detail
Upload
• PUT http://<swift_url>/<acc>/<cont>/fileA
Internals
• Consistent hashing – DHT
• Hash function: md5sum
Ring
• Static mapping to direct data location
• Zones, disks, partitions and replicas
• Across every node in the cluster
1
2
3
45
6
7
8
partition
Swift in detail
Partition power
• Estimate 100 partitions per
disk at max capacity
• Calculate closest power of 2
rounded up
• 2partition_power=partition
number
Disks and partitions
• Swift partition <> filesystem
partitions
• Presume 4 disks named
A, B, C, D
A
A
B
BC
C
D
D
partition
Putting it all together
A. Spawn a
new VM from
an Image
B. Attach
volume(s) to VM
C. Store
application objects
D. Retrieve stored
objects
Lifting more … data
OpenStack services moving up to stack
• Infrastructure (image, volume, snapshot, backup)
• Platform (database, analytics)
Trove – DBaaS
• MySQL, Percona, MariaDB, MongoDB, Couchbase, Cassandra, Redis,
PostgreSQL, Oracle
• Equivelant to Amazon {RDS, DynamoDB}
• Backup/Restore, Resize, Replication, User/DB management, etc.
Sahara – Big data analytics
• Hadoop from Hortonworks or Cloudera, Spark
• Amazon Elastic MapReduce (EMR)
• Manage and configure cluster, HDFS, MapReduce
OSS storage projects
Lots of them, common goals to support scalable, large, software-
defined storage systems
Ceph
• Based on RADOS and CRUSH provides object, block and file-system storage
• 10+ years development effort
• http://www.meetup.com/Ceph-Athens/
GlusterFS
• Simple to use scale-out storage provides unified access to files and objects
• Data stored in native format, no metadata completely algorithmic
Get involved
• Documentation
– http://docs.openstack.org
• Join the community
– http://www.openstack.org/community
• Greek mailing list
– http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-el
• Contribute
– code (development, blueprints, reviews, bugs)
– docs writing, translations, infrastructure support
Thank you!
Athens OpenStack User Group #OSATH
http://www.meetup.com/Athens-OpenStack-User-Group
Thanassis Parathyras
aparathyras@stackmasters.eu, @parathyras

More Related Content

PPTX
High Availability in OpenStack
PPTX
#OSATH Deploy OpenStack: DevStack
PPTX
Project: OpenStack, #OSATH
PPTX
Docker in OpenStack
PPTX
After summit catch up
PDF
DevStack: A developers cloud in a box.
ODP
Elastic Stack ELK, Beats, and Cloud
PDF
Bacd zenoss
High Availability in OpenStack
#OSATH Deploy OpenStack: DevStack
Project: OpenStack, #OSATH
Docker in OpenStack
After summit catch up
DevStack: A developers cloud in a box.
Elastic Stack ELK, Beats, and Cloud
Bacd zenoss

What's hot (20)

PPTX
Sinfonier: How I turned my grandmother into a data analyst - Fran J. Gomez - ...
PDF
OpenStack-Ansible Project Update
ODP
Puppetpreso
PPTX
Containerization with Microsoft Azure
PPTX
Intro to OpenStack
PPTX
Containerization in microsoft azure
PDF
Chef for OpenStack: OpenStack Spring Summit 2013
PPTX
Log analysis using Logstash,ElasticSearch and Kibana
PDF
Docker 1.12 (dockercon recap)
PDF
London Hug 19/5 - Terraform in Production
PDF
AWS Elastic Container Service - DockerHN
PDF
Mirantis v OpenStack Ansible Dawn of Production
PPTX
Changing the world with ZeroVM and Swift
PPTX
Hammock, a Good Place to Rest
PPTX
Azure container service overview
PDF
Managing Your Cisco Datacenter Network with Ansible
PDF
OpenStack en 10 minutes
PDF
Docker architecture-05
PDF
Hashicorp @ JUST EAT - Part 2
PPT
Drupal and Elasticsearch
Sinfonier: How I turned my grandmother into a data analyst - Fran J. Gomez - ...
OpenStack-Ansible Project Update
Puppetpreso
Containerization with Microsoft Azure
Intro to OpenStack
Containerization in microsoft azure
Chef for OpenStack: OpenStack Spring Summit 2013
Log analysis using Logstash,ElasticSearch and Kibana
Docker 1.12 (dockercon recap)
London Hug 19/5 - Terraform in Production
AWS Elastic Container Service - DockerHN
Mirantis v OpenStack Ansible Dawn of Production
Changing the world with ZeroVM and Swift
Hammock, a Good Place to Rest
Azure container service overview
Managing Your Cisco Datacenter Network with Ansible
OpenStack en 10 minutes
Docker architecture-05
Hashicorp @ JUST EAT - Part 2
Drupal and Elasticsearch
Ad

Similar to Data storage in clouds (20)

PPTX
OpenStack Swift In the Enterprise
PDF
New use cases for Ceph, beyond OpenStack, Luis Rico
PDF
Ceph data services in a multi- and hybrid cloud world
PPTX
Dfs in iaa_s
PDF
Red Hat Storage Day Boston - OpenStack + Ceph Storage
PDF
adp.ceph.openstack.talk
PPTX
Scalable Object Storage with Apache CloudStack and Apache Hadoop
PDF
Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters
PPTX
Storage as a service OpenStack
PDF
Introduction to OpenStack Storage
PDF
Deep Dive into Openstack Storage, Sean Cohen, Red Hat
PDF
Deep dive into OpenStack storage, Sean Cohen, Red Hat
PPTX
Ceph Day Santa Clara: Ceph at DreamHost
PDF
NAVER Ceph Storage on ssd for Container
PDF
Initial presentation of swift (for montreal user group)
PPTX
Storage as a service and OpenStack Cinder
PDF
Storing your data in the cloud: doing right reversim 2018
PDF
Peanut Butter and jelly: Mapping the deep Integration between Ceph and OpenStack
PDF
OpenStack Cinder, Implementation Today and New Trends for Tomorrow
PPT
Openstack swift - VietOpenStack 6thmeeetup
OpenStack Swift In the Enterprise
New use cases for Ceph, beyond OpenStack, Luis Rico
Ceph data services in a multi- and hybrid cloud world
Dfs in iaa_s
Red Hat Storage Day Boston - OpenStack + Ceph Storage
adp.ceph.openstack.talk
Scalable Object Storage with Apache CloudStack and Apache Hadoop
Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters
Storage as a service OpenStack
Introduction to OpenStack Storage
Deep Dive into Openstack Storage, Sean Cohen, Red Hat
Deep dive into OpenStack storage, Sean Cohen, Red Hat
Ceph Day Santa Clara: Ceph at DreamHost
NAVER Ceph Storage on ssd for Container
Initial presentation of swift (for montreal user group)
Storage as a service and OpenStack Cinder
Storing your data in the cloud: doing right reversim 2018
Peanut Butter and jelly: Mapping the deep Integration between Ceph and OpenStack
OpenStack Cinder, Implementation Today and New Trends for Tomorrow
Openstack swift - VietOpenStack 6thmeeetup
Ad

More from Thanassis Parathyras (9)

PPTX
Cycleops - Automate deployments on top of bare metal.pptx
PPTX
Add Cycleops to your development cycles - Docker Athens meetup.pptx
PDF
A practitioner's tale on clouds
PPTX
Open cloud infrastructure
PPTX
OpenStack: programmable infrastructure
PPTX
Operating the cloud
PPTX
OpenStack basics
PPTX
#OSATH Deploy OpenStack: DevStack (cont'd)
PPTX
Project CHOReOS DynaRoute Use Case
Cycleops - Automate deployments on top of bare metal.pptx
Add Cycleops to your development cycles - Docker Athens meetup.pptx
A practitioner's tale on clouds
Open cloud infrastructure
OpenStack: programmable infrastructure
Operating the cloud
OpenStack basics
#OSATH Deploy OpenStack: DevStack (cont'd)
Project CHOReOS DynaRoute Use Case

Recently uploaded (20)

PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Electronic commerce courselecture one. Pdf
PDF
Modernizing your data center with Dell and AMD
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPT
Teaching material agriculture food technology
PDF
Machine learning based COVID-19 study performance prediction
PPTX
Cloud computing and distributed systems.
PDF
[발표본] 너의 과제는 클라우드에 있어_KTDS_김동현_20250524.pdf
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Advanced Soft Computing BINUS July 2025.pdf
PDF
solutions_manual_-_materials___processing_in_manufacturing__demargo_.pdf
PDF
GamePlan Trading System Review: Professional Trader's Honest Take
PPTX
Big Data Technologies - Introduction.pptx
PDF
Empathic Computing: Creating Shared Understanding
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Electronic commerce courselecture one. Pdf
Modernizing your data center with Dell and AMD
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Teaching material agriculture food technology
Machine learning based COVID-19 study performance prediction
Cloud computing and distributed systems.
[발표본] 너의 과제는 클라우드에 있어_KTDS_김동현_20250524.pdf
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Diabetes mellitus diagnosis method based random forest with bat algorithm
Network Security Unit 5.pdf for BCA BBA.
Advanced methodologies resolving dimensionality complications for autism neur...
Understanding_Digital_Forensics_Presentation.pptx
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Spectral efficient network and resource selection model in 5G networks
Advanced Soft Computing BINUS July 2025.pdf
solutions_manual_-_materials___processing_in_manufacturing__demargo_.pdf
GamePlan Trading System Review: Professional Trader's Honest Take
Big Data Technologies - Introduction.pptx
Empathic Computing: Creating Shared Understanding

Data storage in clouds

  • 1. Data storage in clouds Athens OpenStack User Group #OSATH 12th Meetup, 22th April 2015 Thanassis Parathyras, aparathyras@stackmasters.eu, @parathyras
  • 2. Announcements #OSATH has 200 members ! Greek Mailing List • openstack-el@lists.openstack.org • Join at http://lists.openstack.org OpenStack Summit (https://www.openstack.org) • 18-22 May, Vancouver OpenStack CEE Day (http://openstackceeday.com) • 8 June, Budapest
  • 3. Outline • Storage types • OpenStack Storage Services – Glance – Cinder – Swift • Higher abstractions – Data Processing aaS (Sahara) – DBaaS (Trove) • Other OSS projects – GlusterFS – Ceph
  • 4. It’s all about DATA “often called storage or memory, is a technology consisting of computer components and recording media used to retain digital data. It is a core function and fundamental component of computers.” ent. Computer data storage, Wikipedia
  • 5. File Disk Drive CD-ROM Object Volume Image Swift Cinder Glance Amazon S3 Amazon EBS AWS Marketplace – Software Infrastructure Storage types
  • 6. Glance OpenStack Image service • REST API • CRUD and Search features • Caching and prefetching • Supports several formats: – raw, qcow2, vmdk, vhd, ami (aki, ari), iso, vdi • Containers: – bare, ovf, ami (aki, ari) • Backend to storage (default: Swift) • Able to aggregate multiple back-ends – Can also increase availability
  • 7. Cinder OpenStack Block Storage service • Volumes • Snapshots • Backups • Modular architecture to support 50+ back-ends • LVM is the default (iSCSI) • Diverse storage types – iSCSI, Fibre, RBD (Ceph), NFS, GPFS, … • Manages storage resources separately Ephemeral vs Block Storage • Nova manages ephemeral storage coupled with VM state (non-persistent) • Cinder manages block storage decoupled from VM lifecylce (persistent)
  • 8. Swift OpenStack Object Storage service • REST API • Data redundancy (3x or more) • Drive auditing built-in • RAID not required • Commodity hardware (not low-end) • High availability • Distributed • Eventual consistency CAP theorem • Choose 2 out of 3 (Consistency, Availability, Partition Tolerance) • Swift implements an eventual consistency model
  • 9. Swift in action Access tier • Proxy server Storage tier • Account server • Container server • Object server • Consistency servers (Auditors, Updaters, Replicators) Zones • Selected per deployment • Determine replica isolation (disk, server, rack, room)
  • 10. Swift in detail Upload • PUT http://<swift_url>/<acc>/<cont>/fileA Internals • Consistent hashing – DHT • Hash function: md5sum Ring • Static mapping to direct data location • Zones, disks, partitions and replicas • Across every node in the cluster 1 2 3 45 6 7 8 partition
  • 11. Swift in detail Partition power • Estimate 100 partitions per disk at max capacity • Calculate closest power of 2 rounded up • 2partition_power=partition number Disks and partitions • Swift partition <> filesystem partitions • Presume 4 disks named A, B, C, D A A B BC C D D partition
  • 12. Putting it all together A. Spawn a new VM from an Image B. Attach volume(s) to VM C. Store application objects D. Retrieve stored objects
  • 13. Lifting more … data OpenStack services moving up to stack • Infrastructure (image, volume, snapshot, backup) • Platform (database, analytics) Trove – DBaaS • MySQL, Percona, MariaDB, MongoDB, Couchbase, Cassandra, Redis, PostgreSQL, Oracle • Equivelant to Amazon {RDS, DynamoDB} • Backup/Restore, Resize, Replication, User/DB management, etc. Sahara – Big data analytics • Hadoop from Hortonworks or Cloudera, Spark • Amazon Elastic MapReduce (EMR) • Manage and configure cluster, HDFS, MapReduce
  • 14. OSS storage projects Lots of them, common goals to support scalable, large, software- defined storage systems Ceph • Based on RADOS and CRUSH provides object, block and file-system storage • 10+ years development effort • http://www.meetup.com/Ceph-Athens/ GlusterFS • Simple to use scale-out storage provides unified access to files and objects • Data stored in native format, no metadata completely algorithmic
  • 15. Get involved • Documentation – http://docs.openstack.org • Join the community – http://www.openstack.org/community • Greek mailing list – http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-el • Contribute – code (development, blueprints, reviews, bugs) – docs writing, translations, infrastructure support
  • 16. Thank you! Athens OpenStack User Group #OSATH http://www.meetup.com/Athens-OpenStack-User-Group Thanassis Parathyras aparathyras@stackmasters.eu, @parathyras