SlideShare a Scribd company logo
1© Copyright 2014 EMC Corporation. All rights reserved.
EMC ViPR Data
Services
Storage Services at Cloud Scale: EMC
Roadmap and Strategy
Download this slide
http://ouo.io/sedyiVIRTUALIZE
EVERYTHING
COMPROMISE
NOTHING
2© Copyright 2014 EMC Corporation. All rights reserved.
Agenda
 ASD Organization
 Market Drivers
 EMC ViPR Overview
 EMC ViPR Data Services
– EMC Object Strategy Overview
– EMC ViPR Object Data Service Deep Dive
– ViPR HDFS Data Service Overview
 Roadmap
3© Copyright 2014 EMC Corporation. All rights reserved.
Advanced Software
Division
4© Copyright 2014 EMC Corporation. All rights reserved.
Advanced Software Division Overview
ASD builds cloud storage and management
software optimized for the cloud and software-
defined data center
 Product portfolio:
– Cloud Data Services
▪ ViPR Data Services
▪ Atmos
▪ Centera
– Automation and Monitoring
▪ ViPR Control Services (ViPR Controller)
▪ SRM Suite
– Network Management
▪ Software Assurance Suite
Amitabh Srivastava
President, ASD
5© Copyright 2014 EMC Corporation. All rights reserved.
Market Overview &
Trends
6© Copyright 2014 EMC Corporation. All rights reserved.
The Big Data Economy
More data sources, richer content, longer utility
40
ZB
Source: IDC 2012 Digital Universe Study
7© Copyright 2014 EMC Corporation. All rights reserved.
Enterprise IT must adapt to support mixed workloads
2012-2016 Workload Growth
8© Copyright 2014 EMC Corporation. All rights reserved.
IT Storage Infrastructure Must Adapt
INEFFICIENT
SCALE
Unconstrained growth,
stranded investments
driving up fixed costs
COMPLEX
MANAGEMENT
Siloed management for
physical, virtual and cloud
creating complexity
DEVELOPMENT
RISK
Legacy infrastructure not
suited for new Web,
mobile and cloud apps
9© Copyright 2014 EMC Corporation. All rights reserved.
Big Data Storage Requirements
In-place analytics and protection of all data types
 Data Unification:
– Big Data storage must support structured, semi-
structured, and unstructured data types.
 In-Place Analytics:
– Analytics, compute workloads need to execute
where the data live.
 Data Compliance:
– More sources of data, more volume, velocity,
etc. exacerbate compliance and long-term
retention requirements
40 ZB
10© Copyright 2014 EMC Corporation. All rights reserved.
Why Software-Defined?
Software-defined accelerates innovation and agility
 Software enables easier and faster
development of new capabilities
– Tightly coupling software and hardware
hinders innovation
– Software updates are possible, but don’t
always go smoothly
 Software-defined brings innovation to
multiple platforms
– New applications and services on existing
hardware adds value
– Targeting multiple hardware platforms
increases value of software
11© Copyright 2014 EMC Corporation. All rights reserved.
EMC ViPR
EMC ViPR and ViPR Data Services
Overview
12© Copyright 2014 EMC Corporation. All rights reserved.
What is Software-defined Storage?
SDS must feature:
– Automation
– Extensibility
– Openness
Software-defined storage abstracts, and pools storage resources and
automates resource delivery and management with intelligent, policy-
driven software
SDS Characteristics:
– It’s Software
– Decouples control plane
from data plane
– Support for multiple storage
platforms
– Programmable API’s
13© Copyright 2014 EMC Corporation. All rights reserved.
EMC ViPR - Software-Defined Storage
ViPR
Data Services
ViPR
Controller
EMC ViPR Platform
Provisioning Self-Service Reporting Automation
Software Control Plane AND Data Plane
Third-Party
Isilon
Atmos
VMAX VNX VPLEX
Commodity
XtremIOCentera
14© Copyright 2014 EMC Corporation. All rights reserved.
Index
Metadata
Policy Management
Data Protection
Replication
EMC ViPR Software Defined Storage
Disk Management
Persistence Layer
Storage Engine
API Interface
Heterogonous storage
Unified
Scale-Out
Extensible
Enterprise Grade
REST, CAS, …
Storage Engine
API Interface
15© Copyright 2014 EMC Corporation. All rights reserved.
Data Services that Span Arrays and Support Hybrid Data Types
ViPR Data Services
 Storage services at cloud scale
– Built in software
– Layered over both traditional and new
storage devices
 Object and HDFS data services in 2013
– Many more to follow, at regular intervals
 Unified platform
– Data services can be used as different
semantic views on the same data e.g.
Object on File, HDFS on Object
16© Copyright 2014 EMC Corporation. All rights reserved.
ViPR Data Services: Architecture
ViPR
Data Path
ViPR
Control Path
• Distributed Infrastructure
• Device Drivers
• Elastic Volumes
• Migration
GEO-SCALE INDEX, METADATA, TRANSACTIONS
… 3rd PARTYOBJECT HDFS KEY-VALUE
Third-Party
Isilon AtmosVMAX VNX VPLEX
Commodity
XtremIOCentera
GEO SCALE INDEX, METADATA, TRANSACTIONS
17© Copyright 2014 EMC Corporation. All rights reserved.
EMC ViPR Software-defined Storage
ViPR Data Services meet the new demands of Big Data
 ViPR unifies storage
– Define data services in software, execute
across heterogeneous infrastructure
 ViPR facilitates analytics
– Use existing storage as a Big Data
repository
 ViPR simplifies compliance
– Simpler tiering, compliance capabilities,
persistence layer flexibility
40 ZB
18© Copyright 2014 EMC Corporation. All rights reserved.
EMC Object Strategy
19© Copyright 2014 EMC Corporation. All rights reserved.
Object Architecture Evolution
2002
•API (Proprietary)
•Single namespace
•Immutable Content
•Unstructured Content
2007
•Rest API
•Geo replication
• namespace
•Multi-tenant
•Unstructured Data
2013-2015
•Universal API Support
•Support Analytics
•Highly efficient Geo storage
•Semi-structured Data
•Flexible Form Factors
20© Copyright 2014 EMC Corporation. All rights reserved.
Today’s Product Offerings
•Single system to buy and grow
•Node-based, scale-out
architecture
•Homogeneous object
•Apply multiple data services across
arrays
•Support hybrid data types (eg:
Object/file, HDFS/Object)
Purpose-built object appliance
Centera / Atmos
Software-defined platform-ViPR 1.0
21© Copyright 2014 EMC Corporation. All rights reserved.
ViPR Object Data
Service Deep Dive
22© Copyright 2014 EMC Corporation. All rights reserved.
ViPR Object Data Service
 Similar to Object model popularized by
Amazon S3
– API support: Amazon S3, OpenStack Swift,
EMC Atmos
– Future API support: EMC Centera CAS
 Extensions
– Byte-range updates, atomic append, rich
ACLs
– Object on File
▪ Access a set of objects as files, directly on the
underlying file storage device, with native file system
performance
23© Copyright 2014 EMC Corporation. All rights reserved.
VIRTUAL STORAGE POOLS
Leverages ViPR Controller provisioning
Configuring Object Storage
Isilon
3rd Party
VNX
5500
VIRTUAL ARRAY
EXCHANGE DATABASEVDI
VIRTUAL ARRAY
3rd Party
VNX
7500
VMAX
40K
24© Copyright 2014 EMC Corporation. All rights reserved.
VIRTUAL STORAGE POOLS
Supports physical segregation
Configuring Object Storage
Isilon
3rd Party
VNX
5500
VIRTUAL ARRAYVIRTUAL ARRAY
CommodityCommodity Commodity
25© Copyright 2014 EMC Corporation. All rights reserved.
Within the Object Store
Buckets provide logical segregation
VIRTUAL ARRAY
CommodityCommodity Commodity
App
1
App
2
App
N
 Data is distributed and
intermingled across the storage
 Buckets grow and shrink on
demand
26© Copyright 2014 EMC Corporation. All rights reserved.
Initial processing
How it works
Isilon
3rd Party
VNX
5500
VIRTUAL ARRAY
• Scalable number of data
nodes process object requests
• Scale-out architecture with full
failover between nodes
• Virtualized for ease of
deployment
27© Copyright 2014 EMC Corporation. All rights reserved.
Writes to containers
How it works
Isilon
3rd Party
VNX
5500
VIRTUAL ARRAY
• Data stored in append-only containers
• Distributed index to track data
locations
• Write first updates container
• Then updates the index
• Success if index update succeeds
28© Copyright 2014 EMC Corporation. All rights reserved.
ViPR Object on File Capability
Extend Object capabilities to file-based storage
 A bucket of Objects can be
dynamically toggled, via API calls,
between different access modes:
– Object mode: Full Object API. No File
access.
– File mode: Native Read/Write access to
Files, directly on the underlying device,
via NFS mount. No Object access.
– Dual mode: Read access as either Object
or File.
29© Copyright 2014 EMC Corporation. All rights reserved.
Object on File: Canonical Example
Next generation video sharing Web application
uses higher latency Object storage to store and
expose uploaded video content1
Traditional video processing apps use lower
latency File storage to curate video content
2
ViPR
Data Path
GEO SCALE INDEX, METADATA, TRANSACTIONS
30© Copyright 2014 EMC Corporation. All rights reserved.
Summary
 ViPR provides storage services at cloud scale
– Built in software
– Layered over both traditional and new storage devices
 ViPR Object data service available today
– HDFS 2H 2013, many more to follow, at regular intervals
 Unified platform
– Data services can be used as different semantic views on
the same data e.g. Object on File, HDFS on Object
31© Copyright 2014 EMC Corporation. All rights reserved.
Feature Roadmap
ViPR Object Data Service
32© Copyright 2014 EMC Corporation. All rights reserved.
Non-Disclosure and Disclaimer
• The information on the following slides is EMC Confidential
and must not be shared with unauthorized parties without prior
written consent of EMC.
• EMC makes no representation and undertakes no obligations
with regard to product planning information, anticipated
product characteristics, performance specifications, or
anticipated release dates (collectively, “Roadmap Information”).
• Roadmap Information is provided by EMC as an
accommodation to the recipient solely for purposes of
discussion and without intending to be bound thereby.
ROADMAP CAN CHANGE. This information is EMC Confidential and must not be shared with unauthorized parties without prior written consent of EMC. EMC
makes no representation and undertakes no obligations with regard to product planning information, anticipated product characteristics, performance specifications,
or anticipated release dates (collectively, “Roadmap Information”). Roadmap Information is provided by EMC as an accommodation to the recipient solely for
purposes of discussion and without intending to be bound thereby.
33© Copyright 2014 EMC Corporation. All rights reserved.
EMC Object: Choice & Flexibility
•Single system to buy and grow
•Node-based, scale-out architecture
•Homogeneous object
•Support multiple data types (eg:
Object/file, HDFS/Object)
•Choice of hardware
•Support low-cost commodity
•Support multiple data types (eg:
Object/file, HDFS/Object)
Purpose-built appliance
(ViPR “v.Next” )
Software-defined platform
(ViPR 2.0)
34© Copyright 2014 EMC Corporation. All rights reserved.
ViPR Object Data Service Roadmap Overview
 Geo Replication
 Geo Distribution
 Compliance
 Commodity hardware support
35© Copyright 2014 EMC Corporation. All rights reserved.
Geo Replication
Increase data protection, access and performance
 Data replicated across sites
optimizes access and reduces
latency.
 Improved data access
 Increased data protection
 Reduced storage overhead
Site 1
Site 3
Site 2
36© Copyright 2014 EMC Corporation. All rights reserved.
Geo Distribution
Anywhere access to content
 Single namespace
– Access all data through a namespace
 Distribute and replicate across
multiple geographically distributed
sites
– Distribute index, meta-data, and data
 Active/Active architecture
– Write to and read from any location
https://accesspoint.yourcompany.com
37© Copyright 2014 EMC Corporation. All rights reserved.
Compliance
Compliance archiving and long-term retention
 Enabled at object or bucket level
 Time-based retention enforcement
 Immutability and checksum
 Platform lockdown
 Privileged delete
 SEC 17a-4f & HIPAA
38© Copyright 2014 EMC Corporation. All rights reserved.
Object on Commodity Hardware
Apply and manage full-featured Object capabilities
VIRTUAL ARRAY
CommodityCommodity Commodity
 Running on top commodity
white boxes
 End to end commodity H/W
bootstrap, setup, rolling
upgrades, and configuration
management.
 Enabling Object, HDFS, Key-
value, native file in the future
…
39© Copyright 2014 EMC Corporation. All rights reserved.
Object on Commodity Hardware
Apply and manage full-featured Object capabilities
VIRTUAL ARRAY
CommodityCommodity Commodity
 Today: Multiple object platforms
 Vision: Platform semantics
become bucket attributes
 Application capture on a bucket
basis
EMC Centera EMC Atmos
EMC CENTERA EMC ATMOS
40© Copyright 2014 EMC Corporation. All rights reserved.
Protected writes on commodity
Object on Commodity
VIRTUAL ARRAY
• Write updates multiple
containers
• Stored on different nodes/drives
• Then updates the index
• Success if index update succeeds
CommodityCommodity Commodity
41© Copyright 2014 EMC Corporation. All rights reserved.
How ViPR efficiently utilizes commodity
Object on Commodity
VIRTUAL ARRAY
• Open containers accept data
• Full containers are immutable
• Transform full containers
• Bulk operation for efficiency
• Bulk re-protection for efficiency
CommodityCommodity Commodity
OpenFull
+
42© Copyright 2014 EMC Corporation. All rights reserved.
Object on Commodity
A common index for unified Object storage
VIRTUAL ARRAY
CommodityCommodity Commodity
 Extend indexing for all object types
 Flexible metadata search
 First class citizen or for analytics set
…
EMC Centera EMC Atmos
Centera Atmos
43© Copyright 2014 EMC Corporation. All rights reserved.
Object Investments & Roadmap
1H 2013 2H 2013 2014
Increase Object Count
Improve Self-Healing
Enhanced Reporting
Serviceability
S3, Atmos, Swift APIs
Isilon, VNX, Third Party arrays
Object Data Service
HDFS Data Service (v1.1)
Gen3 Flex-240 Hardware
GeoDrive Windows updates
Atmos CDP updates
SDK updates
Watch4Net integration
EMC World announcement
Customer EAP
EMC Syncplicity integration
SEC 17a-4f compliance for
REST data
GeoDrive Windows update
Gen3 Flex-360 Hardware
IPv6
Envelope encryption
Tech refresh
GeoDrive Windows updates
Atmos CDP updates
Geo-Replication & Distribution
Compliance
Bring Your Own Commodity
Centera SDK API, Co-existence
& Transformation (Centera
arrays)
Geo-distribution
namespace
HDFS
Emc vi pr data services

More Related Content

PPTX
Emc vi pr global data services
PPTX
Emc vi pr hdfs data service technical overview
PPTX
EMC Academic Alliance overview
 
PPTX
Big Data – General Introduction
 
PPTX
EMC HADOOP Storage Strategy
PDF
EMC ViPR Services Storage Engine Architecture
 
PPTX
EMC Hadoop Starter Kit - ViPR Edition
PPTX
Emc vi pr controller tecnical customer presentation
Emc vi pr global data services
Emc vi pr hdfs data service technical overview
EMC Academic Alliance overview
 
Big Data – General Introduction
 
EMC HADOOP Storage Strategy
EMC ViPR Services Storage Engine Architecture
 
EMC Hadoop Starter Kit - ViPR Edition
Emc vi pr controller tecnical customer presentation

What's hot (20)

PPTX
Emc vi pr software defined storage
PDF
EMC Big Data | Hadoop Starter Kit | EMC Forum 2014
 
PDF
The Future of Storage : EMC Software Defined Solution
 
PPTX
Emc vi pr hdfs data service technical overview
PPTX
EMC ScaleIO Overview
PPTX
Cloud Infrastructure and Services (CIS) - Webinar
 
PDF
EMC Starter Kit - IBM BigInsights - EMC Isilon
PPTX
EMC EC Overview
PPTX
EMC config Hadoop
PDF
EMC Academic Alliance Presentation
PDF
VIPR SOFTWARE-DEFINED STORAGE
PPTX
Emc vi pr controller
PDF
S100293 hybrid-cloud-orlando-v1804a
PDF
Visibility & Security for the Virtualized Enterprise
 
PDF
Cloud Models, Considerations, & Adoption Techniques
 
PDF
Soyez Big Data ready avec Isilon
 
PPTX
Scale IO Software Defined Block Storage
PDF
S100297 ilm-archive-orlando-v1804c
PDF
S100298 pendulum-swings-orlando-v1804a
PDF
S cv3179 spectrum-integration-openstack-edge2015-v5
Emc vi pr software defined storage
EMC Big Data | Hadoop Starter Kit | EMC Forum 2014
 
The Future of Storage : EMC Software Defined Solution
 
Emc vi pr hdfs data service technical overview
EMC ScaleIO Overview
Cloud Infrastructure and Services (CIS) - Webinar
 
EMC Starter Kit - IBM BigInsights - EMC Isilon
EMC EC Overview
EMC config Hadoop
EMC Academic Alliance Presentation
VIPR SOFTWARE-DEFINED STORAGE
Emc vi pr controller
S100293 hybrid-cloud-orlando-v1804a
Visibility & Security for the Virtualized Enterprise
 
Cloud Models, Considerations, & Adoption Techniques
 
Soyez Big Data ready avec Isilon
 
Scale IO Software Defined Block Storage
S100297 ilm-archive-orlando-v1804c
S100298 pendulum-swings-orlando-v1804a
S cv3179 spectrum-integration-openstack-edge2015-v5
Ad

Viewers also liked (20)

DOC
Senha Teste
PPT
Advanced administration and problem determination
PPTX
Conjuntos
PPTX
Shampoo
XLS
Definitivas Diez Tres
PDF
Dj Deville Dpk Page 2 (Pictures)
PPT
Prima e dopo la SEO
RTF
Dokumen
PPT
Past Historical
PPT
Steve Lewis Talks About Lewis Floor Home
PPT
Prueba Power Point
PDF
Induc1
DOC
Proyecto 2010
DOC
Carrocerias karrantzara
PPTX
Hominnidos
PPT
Presentation1
PDF
InvitacióN Robert MüLler
PAGES
Pages Report
PPT
Dora
PDF
Resume
Senha Teste
Advanced administration and problem determination
Conjuntos
Shampoo
Definitivas Diez Tres
Dj Deville Dpk Page 2 (Pictures)
Prima e dopo la SEO
Dokumen
Past Historical
Steve Lewis Talks About Lewis Floor Home
Prueba Power Point
Induc1
Proyecto 2010
Carrocerias karrantzara
Hominnidos
Presentation1
InvitacióN Robert MüLler
Pages Report
Dora
Resume
Ad

Similar to Emc vi pr data services (20)

PPTX
Emc vi pr controller customer presentation
PPTX
Emc vi pr software defined storage
PPTX
Software Defined Datacenter als 'route' naar het 3e IT platform
PDF
L'agilité du cloud public dans votre datacenter avec ECS & Neutrino
 
PPTX
EMC Atmos for service providers
PPTX
EMC Vipr srm-technical Deep dive
PPTX
ECS/Cloud Object Storage - DevOps Day
PPTX
EMC Hybrid Cloud Solutions with VMware
PDF
Srm suite technical presentation nrm - tim piqueur
PPTX
True Storage Virtualization with Software-Defined Storage
PPTX
Emc ecs 2 technical deep dive workshop
PDF
EMC's IT Transformation Journey ( EMC Forum 2014 )
 
PPTX
Emc data domain technical deep dive workshop
PDF
Sa info archive-s_aksenenko
PDF
ViPR Services Storage Engine Architecture
 
PPTX
Emc vipr srm workshop
PDF
VMWare and SoftLayer Hybrid IT
PPTX
NetApp Se training storage grid webscale technical overview
PPTX
Emc data domain
PPTX
Increase Your Mission Critical Application Performance without Breaking the B...
Emc vi pr controller customer presentation
Emc vi pr software defined storage
Software Defined Datacenter als 'route' naar het 3e IT platform
L'agilité du cloud public dans votre datacenter avec ECS & Neutrino
 
EMC Atmos for service providers
EMC Vipr srm-technical Deep dive
ECS/Cloud Object Storage - DevOps Day
EMC Hybrid Cloud Solutions with VMware
Srm suite technical presentation nrm - tim piqueur
True Storage Virtualization with Software-Defined Storage
Emc ecs 2 technical deep dive workshop
EMC's IT Transformation Journey ( EMC Forum 2014 )
 
Emc data domain technical deep dive workshop
Sa info archive-s_aksenenko
ViPR Services Storage Engine Architecture
 
Emc vipr srm workshop
VMWare and SoftLayer Hybrid IT
NetApp Se training storage grid webscale technical overview
Emc data domain
Increase Your Mission Critical Application Performance without Breaking the B...

More from solarisyougood (20)

PPTX
Emc recoverpoint technical
PPTX
Emc vmax3 technical deep workshop
PPTX
Cisco prime network 4.1 technical overview
PPTX
Designing your xen desktop 7.5 environment with training guide
PPT
Ibm aix technical deep dive workshop advanced administration and problem dete...
PPT
Ibm power ha v7 technical deep dive workshop
PPT
Power8 hardware technical deep dive workshop
PPT
Power systems virtualization with power kvm
PPTX
Power vc for powervm deep dive tips & tricks
PPT
Ibm flash system v9000 technical deep dive workshop
PPTX
Emc vnx2 technical deep dive workshop
PPTX
Emc isilon technical deep dive workshop
PPTX
Emc vplex deep dive
PPTX
Cisco mds 9148 s training workshop
PPTX
Cisco cloud computing deploying openstack
PPTX
Se training storage grid webscale technical overview
PPTX
Vmware 2015 with vsphereHigh performance application platforms
PPTX
Deep dive into new features in v realizeoperations 6.0
PPTX
Reference design for v mware nsx
PPTX
Nsx security deep dive
Emc recoverpoint technical
Emc vmax3 technical deep workshop
Cisco prime network 4.1 technical overview
Designing your xen desktop 7.5 environment with training guide
Ibm aix technical deep dive workshop advanced administration and problem dete...
Ibm power ha v7 technical deep dive workshop
Power8 hardware technical deep dive workshop
Power systems virtualization with power kvm
Power vc for powervm deep dive tips & tricks
Ibm flash system v9000 technical deep dive workshop
Emc vnx2 technical deep dive workshop
Emc isilon technical deep dive workshop
Emc vplex deep dive
Cisco mds 9148 s training workshop
Cisco cloud computing deploying openstack
Se training storage grid webscale technical overview
Vmware 2015 with vsphereHigh performance application platforms
Deep dive into new features in v realizeoperations 6.0
Reference design for v mware nsx
Nsx security deep dive

Recently uploaded (20)

PPTX
A Presentation on Artificial Intelligence
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Approach and Philosophy of On baking technology
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Electronic commerce courselecture one. Pdf
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
KodekX | Application Modernization Development
PDF
cuic standard and advanced reporting.pdf
PDF
Empathic Computing: Creating Shared Understanding
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
A Presentation on Artificial Intelligence
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Unlocking AI with Model Context Protocol (MCP)
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Approach and Philosophy of On baking technology
Review of recent advances in non-invasive hemoglobin estimation
Electronic commerce courselecture one. Pdf
Chapter 3 Spatial Domain Image Processing.pdf
KodekX | Application Modernization Development
cuic standard and advanced reporting.pdf
Empathic Computing: Creating Shared Understanding
Digital-Transformation-Roadmap-for-Companies.pptx
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
NewMind AI Monthly Chronicles - July 2025
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Agricultural_Statistics_at_a_Glance_2022_0.pdf
The AUB Centre for AI in Media Proposal.docx
Network Security Unit 5.pdf for BCA BBA.
20250228 LYD VKU AI Blended-Learning.pptx

Emc vi pr data services

  • 1. 1© Copyright 2014 EMC Corporation. All rights reserved. EMC ViPR Data Services Storage Services at Cloud Scale: EMC Roadmap and Strategy Download this slide http://ouo.io/sedyiVIRTUALIZE EVERYTHING COMPROMISE NOTHING
  • 2. 2© Copyright 2014 EMC Corporation. All rights reserved. Agenda  ASD Organization  Market Drivers  EMC ViPR Overview  EMC ViPR Data Services – EMC Object Strategy Overview – EMC ViPR Object Data Service Deep Dive – ViPR HDFS Data Service Overview  Roadmap
  • 3. 3© Copyright 2014 EMC Corporation. All rights reserved. Advanced Software Division
  • 4. 4© Copyright 2014 EMC Corporation. All rights reserved. Advanced Software Division Overview ASD builds cloud storage and management software optimized for the cloud and software- defined data center  Product portfolio: – Cloud Data Services ▪ ViPR Data Services ▪ Atmos ▪ Centera – Automation and Monitoring ▪ ViPR Control Services (ViPR Controller) ▪ SRM Suite – Network Management ▪ Software Assurance Suite Amitabh Srivastava President, ASD
  • 5. 5© Copyright 2014 EMC Corporation. All rights reserved. Market Overview & Trends
  • 6. 6© Copyright 2014 EMC Corporation. All rights reserved. The Big Data Economy More data sources, richer content, longer utility 40 ZB Source: IDC 2012 Digital Universe Study
  • 7. 7© Copyright 2014 EMC Corporation. All rights reserved. Enterprise IT must adapt to support mixed workloads 2012-2016 Workload Growth
  • 8. 8© Copyright 2014 EMC Corporation. All rights reserved. IT Storage Infrastructure Must Adapt INEFFICIENT SCALE Unconstrained growth, stranded investments driving up fixed costs COMPLEX MANAGEMENT Siloed management for physical, virtual and cloud creating complexity DEVELOPMENT RISK Legacy infrastructure not suited for new Web, mobile and cloud apps
  • 9. 9© Copyright 2014 EMC Corporation. All rights reserved. Big Data Storage Requirements In-place analytics and protection of all data types  Data Unification: – Big Data storage must support structured, semi- structured, and unstructured data types.  In-Place Analytics: – Analytics, compute workloads need to execute where the data live.  Data Compliance: – More sources of data, more volume, velocity, etc. exacerbate compliance and long-term retention requirements 40 ZB
  • 10. 10© Copyright 2014 EMC Corporation. All rights reserved. Why Software-Defined? Software-defined accelerates innovation and agility  Software enables easier and faster development of new capabilities – Tightly coupling software and hardware hinders innovation – Software updates are possible, but don’t always go smoothly  Software-defined brings innovation to multiple platforms – New applications and services on existing hardware adds value – Targeting multiple hardware platforms increases value of software
  • 11. 11© Copyright 2014 EMC Corporation. All rights reserved. EMC ViPR EMC ViPR and ViPR Data Services Overview
  • 12. 12© Copyright 2014 EMC Corporation. All rights reserved. What is Software-defined Storage? SDS must feature: – Automation – Extensibility – Openness Software-defined storage abstracts, and pools storage resources and automates resource delivery and management with intelligent, policy- driven software SDS Characteristics: – It’s Software – Decouples control plane from data plane – Support for multiple storage platforms – Programmable API’s
  • 13. 13© Copyright 2014 EMC Corporation. All rights reserved. EMC ViPR - Software-Defined Storage ViPR Data Services ViPR Controller EMC ViPR Platform Provisioning Self-Service Reporting Automation Software Control Plane AND Data Plane Third-Party Isilon Atmos VMAX VNX VPLEX Commodity XtremIOCentera
  • 14. 14© Copyright 2014 EMC Corporation. All rights reserved. Index Metadata Policy Management Data Protection Replication EMC ViPR Software Defined Storage Disk Management Persistence Layer Storage Engine API Interface Heterogonous storage Unified Scale-Out Extensible Enterprise Grade REST, CAS, … Storage Engine API Interface
  • 15. 15© Copyright 2014 EMC Corporation. All rights reserved. Data Services that Span Arrays and Support Hybrid Data Types ViPR Data Services  Storage services at cloud scale – Built in software – Layered over both traditional and new storage devices  Object and HDFS data services in 2013 – Many more to follow, at regular intervals  Unified platform – Data services can be used as different semantic views on the same data e.g. Object on File, HDFS on Object
  • 16. 16© Copyright 2014 EMC Corporation. All rights reserved. ViPR Data Services: Architecture ViPR Data Path ViPR Control Path • Distributed Infrastructure • Device Drivers • Elastic Volumes • Migration GEO-SCALE INDEX, METADATA, TRANSACTIONS … 3rd PARTYOBJECT HDFS KEY-VALUE Third-Party Isilon AtmosVMAX VNX VPLEX Commodity XtremIOCentera GEO SCALE INDEX, METADATA, TRANSACTIONS
  • 17. 17© Copyright 2014 EMC Corporation. All rights reserved. EMC ViPR Software-defined Storage ViPR Data Services meet the new demands of Big Data  ViPR unifies storage – Define data services in software, execute across heterogeneous infrastructure  ViPR facilitates analytics – Use existing storage as a Big Data repository  ViPR simplifies compliance – Simpler tiering, compliance capabilities, persistence layer flexibility 40 ZB
  • 18. 18© Copyright 2014 EMC Corporation. All rights reserved. EMC Object Strategy
  • 19. 19© Copyright 2014 EMC Corporation. All rights reserved. Object Architecture Evolution 2002 •API (Proprietary) •Single namespace •Immutable Content •Unstructured Content 2007 •Rest API •Geo replication • namespace •Multi-tenant •Unstructured Data 2013-2015 •Universal API Support •Support Analytics •Highly efficient Geo storage •Semi-structured Data •Flexible Form Factors
  • 20. 20© Copyright 2014 EMC Corporation. All rights reserved. Today’s Product Offerings •Single system to buy and grow •Node-based, scale-out architecture •Homogeneous object •Apply multiple data services across arrays •Support hybrid data types (eg: Object/file, HDFS/Object) Purpose-built object appliance Centera / Atmos Software-defined platform-ViPR 1.0
  • 21. 21© Copyright 2014 EMC Corporation. All rights reserved. ViPR Object Data Service Deep Dive
  • 22. 22© Copyright 2014 EMC Corporation. All rights reserved. ViPR Object Data Service  Similar to Object model popularized by Amazon S3 – API support: Amazon S3, OpenStack Swift, EMC Atmos – Future API support: EMC Centera CAS  Extensions – Byte-range updates, atomic append, rich ACLs – Object on File ▪ Access a set of objects as files, directly on the underlying file storage device, with native file system performance
  • 23. 23© Copyright 2014 EMC Corporation. All rights reserved. VIRTUAL STORAGE POOLS Leverages ViPR Controller provisioning Configuring Object Storage Isilon 3rd Party VNX 5500 VIRTUAL ARRAY EXCHANGE DATABASEVDI VIRTUAL ARRAY 3rd Party VNX 7500 VMAX 40K
  • 24. 24© Copyright 2014 EMC Corporation. All rights reserved. VIRTUAL STORAGE POOLS Supports physical segregation Configuring Object Storage Isilon 3rd Party VNX 5500 VIRTUAL ARRAYVIRTUAL ARRAY CommodityCommodity Commodity
  • 25. 25© Copyright 2014 EMC Corporation. All rights reserved. Within the Object Store Buckets provide logical segregation VIRTUAL ARRAY CommodityCommodity Commodity App 1 App 2 App N  Data is distributed and intermingled across the storage  Buckets grow and shrink on demand
  • 26. 26© Copyright 2014 EMC Corporation. All rights reserved. Initial processing How it works Isilon 3rd Party VNX 5500 VIRTUAL ARRAY • Scalable number of data nodes process object requests • Scale-out architecture with full failover between nodes • Virtualized for ease of deployment
  • 27. 27© Copyright 2014 EMC Corporation. All rights reserved. Writes to containers How it works Isilon 3rd Party VNX 5500 VIRTUAL ARRAY • Data stored in append-only containers • Distributed index to track data locations • Write first updates container • Then updates the index • Success if index update succeeds
  • 28. 28© Copyright 2014 EMC Corporation. All rights reserved. ViPR Object on File Capability Extend Object capabilities to file-based storage  A bucket of Objects can be dynamically toggled, via API calls, between different access modes: – Object mode: Full Object API. No File access. – File mode: Native Read/Write access to Files, directly on the underlying device, via NFS mount. No Object access. – Dual mode: Read access as either Object or File.
  • 29. 29© Copyright 2014 EMC Corporation. All rights reserved. Object on File: Canonical Example Next generation video sharing Web application uses higher latency Object storage to store and expose uploaded video content1 Traditional video processing apps use lower latency File storage to curate video content 2 ViPR Data Path GEO SCALE INDEX, METADATA, TRANSACTIONS
  • 30. 30© Copyright 2014 EMC Corporation. All rights reserved. Summary  ViPR provides storage services at cloud scale – Built in software – Layered over both traditional and new storage devices  ViPR Object data service available today – HDFS 2H 2013, many more to follow, at regular intervals  Unified platform – Data services can be used as different semantic views on the same data e.g. Object on File, HDFS on Object
  • 31. 31© Copyright 2014 EMC Corporation. All rights reserved. Feature Roadmap ViPR Object Data Service
  • 32. 32© Copyright 2014 EMC Corporation. All rights reserved. Non-Disclosure and Disclaimer • The information on the following slides is EMC Confidential and must not be shared with unauthorized parties without prior written consent of EMC. • EMC makes no representation and undertakes no obligations with regard to product planning information, anticipated product characteristics, performance specifications, or anticipated release dates (collectively, “Roadmap Information”). • Roadmap Information is provided by EMC as an accommodation to the recipient solely for purposes of discussion and without intending to be bound thereby. ROADMAP CAN CHANGE. This information is EMC Confidential and must not be shared with unauthorized parties without prior written consent of EMC. EMC makes no representation and undertakes no obligations with regard to product planning information, anticipated product characteristics, performance specifications, or anticipated release dates (collectively, “Roadmap Information”). Roadmap Information is provided by EMC as an accommodation to the recipient solely for purposes of discussion and without intending to be bound thereby.
  • 33. 33© Copyright 2014 EMC Corporation. All rights reserved. EMC Object: Choice & Flexibility •Single system to buy and grow •Node-based, scale-out architecture •Homogeneous object •Support multiple data types (eg: Object/file, HDFS/Object) •Choice of hardware •Support low-cost commodity •Support multiple data types (eg: Object/file, HDFS/Object) Purpose-built appliance (ViPR “v.Next” ) Software-defined platform (ViPR 2.0)
  • 34. 34© Copyright 2014 EMC Corporation. All rights reserved. ViPR Object Data Service Roadmap Overview  Geo Replication  Geo Distribution  Compliance  Commodity hardware support
  • 35. 35© Copyright 2014 EMC Corporation. All rights reserved. Geo Replication Increase data protection, access and performance  Data replicated across sites optimizes access and reduces latency.  Improved data access  Increased data protection  Reduced storage overhead Site 1 Site 3 Site 2
  • 36. 36© Copyright 2014 EMC Corporation. All rights reserved. Geo Distribution Anywhere access to content  Single namespace – Access all data through a namespace  Distribute and replicate across multiple geographically distributed sites – Distribute index, meta-data, and data  Active/Active architecture – Write to and read from any location https://accesspoint.yourcompany.com
  • 37. 37© Copyright 2014 EMC Corporation. All rights reserved. Compliance Compliance archiving and long-term retention  Enabled at object or bucket level  Time-based retention enforcement  Immutability and checksum  Platform lockdown  Privileged delete  SEC 17a-4f & HIPAA
  • 38. 38© Copyright 2014 EMC Corporation. All rights reserved. Object on Commodity Hardware Apply and manage full-featured Object capabilities VIRTUAL ARRAY CommodityCommodity Commodity  Running on top commodity white boxes  End to end commodity H/W bootstrap, setup, rolling upgrades, and configuration management.  Enabling Object, HDFS, Key- value, native file in the future …
  • 39. 39© Copyright 2014 EMC Corporation. All rights reserved. Object on Commodity Hardware Apply and manage full-featured Object capabilities VIRTUAL ARRAY CommodityCommodity Commodity  Today: Multiple object platforms  Vision: Platform semantics become bucket attributes  Application capture on a bucket basis EMC Centera EMC Atmos EMC CENTERA EMC ATMOS
  • 40. 40© Copyright 2014 EMC Corporation. All rights reserved. Protected writes on commodity Object on Commodity VIRTUAL ARRAY • Write updates multiple containers • Stored on different nodes/drives • Then updates the index • Success if index update succeeds CommodityCommodity Commodity
  • 41. 41© Copyright 2014 EMC Corporation. All rights reserved. How ViPR efficiently utilizes commodity Object on Commodity VIRTUAL ARRAY • Open containers accept data • Full containers are immutable • Transform full containers • Bulk operation for efficiency • Bulk re-protection for efficiency CommodityCommodity Commodity OpenFull +
  • 42. 42© Copyright 2014 EMC Corporation. All rights reserved. Object on Commodity A common index for unified Object storage VIRTUAL ARRAY CommodityCommodity Commodity  Extend indexing for all object types  Flexible metadata search  First class citizen or for analytics set … EMC Centera EMC Atmos Centera Atmos
  • 43. 43© Copyright 2014 EMC Corporation. All rights reserved. Object Investments & Roadmap 1H 2013 2H 2013 2014 Increase Object Count Improve Self-Healing Enhanced Reporting Serviceability S3, Atmos, Swift APIs Isilon, VNX, Third Party arrays Object Data Service HDFS Data Service (v1.1) Gen3 Flex-240 Hardware GeoDrive Windows updates Atmos CDP updates SDK updates Watch4Net integration EMC World announcement Customer EAP EMC Syncplicity integration SEC 17a-4f compliance for REST data GeoDrive Windows update Gen3 Flex-360 Hardware IPv6 Envelope encryption Tech refresh GeoDrive Windows updates Atmos CDP updates Geo-Replication & Distribution Compliance Bring Your Own Commodity Centera SDK API, Co-existence & Transformation (Centera arrays) Geo-distribution namespace HDFS

Editor's Notes

  • #5: The Advanced Software Division (ASD) was formed in January 2012 as a result of the merger of the Cloud infrastructure Group (CIG) with the Infrastructure Management Group (IMG). This brought together EMC’s object and cloud storage platforms with all of EMC’s management software. ASD now develops cloud storage and management software for private and public cloud and optimizing storage management and automation. Amitabh Srivastava is the President of the Advanced Software Division at EMC. Before joining EMC in April 2011, Srivastava was the Senior Vice President of Server and Cloud Division at Microsoft Corporation, where he was responsible for Windows Azure and Windows Server products.
  • #7: The intelligent economy produces a constant stream of data that is being monitored and analyzed. IDC estimates that the digital universe will be 40ZB by 2020. That’s a 40 followed by 21 zeroes. Social interactions, mobile devices, facilities, equipment, R&D, simulations, and physical infrastructure all contribute to the flow of information. In aggregate, this is what is called Big Data. The Big Data economy, is characterized by: More Sources of data Communities Mobile Devices Sensors Imaging Equipment Richer Content Pictures Videos Data Streams Longer utility Durable value – information and information about information (metadata) has value for a long time after its creation. All this data can have business value. Regulatory burdens – always a contributor to the need to retain data for longer and longer periods of time, often indefinitely.
  • #8: Traditional applications such a customer relationship management (CRM) and enterprise resource planning (ERP) applications still make up the majority of enterprise workflows and they’re growing 70% per year. But next-gen Web, mobile and cloud applications are growing 700% per year. Even more importantly, these workloads generate tremendous amounts of unstructured data, with expected growth of more than 100x over the next decade.
  • #9: Based on the requirements on the previous slide, enterprise IT storage infrastructure needs to adapt. The IT architecture remains silo’ed and many of its processes are manual. The combination results in massive complexity, which makes it difficult to Scale efficiently – Having separate storage siloes results in a lot of unused capacity and higher fixed costs. Management complexity – IT has multiple storage management control points and higher operational costs. Development risk – Legacy infrastructure is simply not built to support next-gen, Web and mobile applications. Development projects take longer, require more code and rely on tight hooks to infrastructure. This limits IT agility and IT’s ability to compete with public clouds
  • #10: The era of Big Data places new demands on data storage. Storage must contend with varying data types, all of which need to be stored securely for a long periods of time and be available for analysis. Data Unification: There is an increasing focus on data unification meaning that the storage infrastructure for Big Data has to cater to structured, semi-structured, and unstructured data types. In-Place Analytics: There is a growing emphasis on in-place analytics in which the compute workloads such as Hadoop Map/Reduce operations are run right where the data lives. Data Compliance: This market is fraught with challenges stemming from regulatory and compliance requirements. As the platform that hosts data the instant it is created, storage is not immune to these challenges — and how data gets stored in the long term.
  • #11: The premise of “software-defined” is to reduce the tight coupling of software and hardware. It’s simply faster and easier to innovate through software without tight hooks into hardware. This also gives customers choice of persistence layers. It also brings new features to existing hardware, similarly to how virtualization let a Windows serve support multiple applications – even apps not written to windows. It also increases the value of the software itself since it is now available to more platforms. An ISV can write an app once and target multiple platforms, expanding their addressable market.
  • #13: SDS characteristics: It’s software – many competing solutions feature a physical appliance or software deployed on a switch. We call this “hardware-defined software-defined storage”. SDS should be software without requiring a truck, 2 wheeler or physical installation. Decouples control plane from data plane – this is the essence of software-defined. Separating the control and data paths means you can centrally manage heterogeneous physical devices, centralize all data provisioning and data management tasks, and leverage storage data services but not necessitate overhead on the data path. You can continue to use the unique data services embedded in the storage arrays. Support for multiple storage platforms – SDS should support file, block, object from multiple vendors Programmable API’s – SDS should be API driven. This makes storage a programmatic resource of the SDDC SDS must feature: Automation – the value is in reducing the # of management control points and automating repetitive manual tasks Extensibility – SDS should enable anyone to extend their storage environment support additional arrays and create new data services that run on top. Openness – That doesn’t mean “open source”, it means it can’t be proprietary. The platform should feature open, easily accessible APIs. You can’t be extensible if you’re not open. At least not effectively.
  • #14: The second data service that we will provide in 2013 is HDFS. HDFS is becoming increasingly popular as a file system layer for distributed applications, beyond Hadoop Scenarios: High aggregate throughput access to data, e.g. MapReduce. In some cases, low latency access. Concerns: Scale, durability, cost, management
  • #15: ViPR SDS abstracts the data services – the storage engine and APIs - from physical resources and creates a unified storage platform that has the following characteristics: Heterogeneous The storage engine that can target a variety of storage devices Unified Cloud data types can be layered over traditional storage Flexible Data services are implemented in software, hardware agnostic Scale-out Data services leverage node-based, scale-out architecture for linear scale, efficiency Extensible Partners and customers can extend and customize the platform and develop new data services Enterprise-grade Leverage and extend the capabilities of underlying storage devices – for example, you can store objects on a file-based arrays and make use of snapshots for enterprise-level data protection.
  • #16: ViPR aggregates multi-vendor heterogeneous storage into a unified storage platform, that, in turn, can be leveraged as a logical scale-out layer which can serve as the underlying infrastructure for hosting a range of data services to support collecting, managing and utilizing unstructured content at massive scale. ViPR Data Services are implemented in software and feature a simple, lightweight, low-touch, scale-out design. Data services are storage abstractions that reflect the combination of a data type (file, object or block of data), access protocols (iSCSI, NFS, REST, etc.), and durability, availability, and security characteristics (snapshots, replication, etc.) In ViPR, block, file, object, and HDFS are all data services, though ViPR is not in the data path for file and block (these can be thought of us “control services”). Object and HDFS are available in 2013 with more to follow. Data services can be used to provide different semantic views of the same data. You can manipulate a file as a file or as an object without having to move the data to a different platform that features that semantic.
  • #17: This depicts the architecture for ViPR and highlights the data services functionality. At the bottom are the physical arrays that ViPR can manage. Above the arrays is the ViPR controller which has features that enable a distributed infrastructure (Cassandra, a distributed DB and Zookeeper to manage status of different nodes in the system) and device drivers to hook into APIs of arrays so the Controller can automate provisioning, management, etc. The ViPR Controller can also offer lifecycle services such as elastic volumes. ViPR can use the control path functionality to spin up data services. A good example are elastic volumes – similar to Amazon EBS. ViPR can provision volumes at scale. On top of that are ViPR data services. The real differentiation and IP we’ve built: Geo-scale index – performant distributed indexing mechanism to find data very quickly. Geo-scale metadata – store metadata associated with customer data separately from data. It’s stored in an efficient way so we can efficiently run operations against the metadata. Transactions – executing a large # of user transactions concurrently with very little latency Key bottleneck for object stores are the metadata database and the index. They’re bottlenecks since they are usually stored in a database and databases can’t scale. ViPR does not store metadata or the index in a separate database. The geo-scale index and geo-scale metadata are stored on the actual arrays. The improves performance, particularly at large, distributed scale. ViPR data services provides primitives like read/write. ViPR provides a RESTful API and EMC has also build different semantic heads on top of that API. For example, the Object data service will support the Atmos, OpenStack Swift and Amazon S3 APIs. We’ll also add, in the future, file heads, Key Value (tables) and expose the ViPR API to allow others to build new data services.
  • #18: ViPR and ViPR data services address the new requirements for Big Data mentioned previously. Data Unification: ViPR abstracts storage into a software layer and manages with intelligent, policy-driven software. ViPR data services are software that span arrays and support different semantic views of the same data. ViPR creates a unified storage platform that includes block, file, object, HDFS and more data services. In-Place Analytics: Again, data services provide a way to manipulate data by different mechanisms in place. Using the HDFS data service, organizations can bring analytics to the data rather then bringing the data to a separate analytics infrastructure. Data Compliance: ViPR creates a unified platform that combines the features of array-level data protection with policy-driven data management to simplify tiering and compliance.
  • #20: Object storage EMC pioneered object storage over a decade ago with the release of EMC Centera, which today remains the leader in compliance archiving. At this early stage, object storage featured a single namespace, a proprietary API and scaled more efficiently that file-based archive solutions. CAS was valuable for storing unstructured content and guaranteeing the authenticity and immutability of content. EMC continued its innovation in object storage with the introduction of EMC Atmos which is an industry leader for cloud storage and next-generation Web, mobile and REST-based applications. Atmos built on the strengths of Centera – linear scale, efficiency, API access - by adding support for a RESTful API, multi-tenancy and adding geo-replication and distribution of content. It also featured a single namespace and multi-tenancy. EMC raised the bar again for object storage with the coming availability of EMC ViPR Object Data Services that extend object capabilities to heterogeneous file-based storage. The strategy foe EMC moving forward is to provide universal API access, multiple access methods and support analytics on unstructured data. It’s more than object storage, it’s providing cloud storage that can offer universal access to object, file, block and HDFS data services on highly-efficient scale-out geo-distributed storage that can support unstructured and semi-structured data. And, by being truly software-defined, gives customers hardware options - supporting scale-out appliances, commodity, and legacy infrastructure.
  • #21: Object storage is not one-size-fits all. EMC is investing in purpose-built hardware/software platforms and software-defined solutions that will give customers the freedom to choose the object storage platform that meets their unique requirements. Today, customers can choose a dedicated object platform such as Centera and Atmos, or begin right away with the software-defined approach. If customers are purely focused on object storage, a purpose-built appliance gives them a single system to buy and scale and they don’t have to buy a compute and network infrastructure separately. It’s a homogenous object platform that’s more plug and play. If customers have more general needs beyond object, the software-defined approach of ViPR is the choice. Today, ViPR can support storing objects on file-based arrays such as Isilon, VNX and NetApp and support the Atmos API as well as OpenStack Swift and Amazon S3. Customers would choose the software-defined approach today if they want to use their existing file-based storage to support objects and RESTful APIs , Atmos, OpenStack and amazon S3. The software-defined approach also appeals to customers that see value in applying multiple data services across their arrays. In 2014, ViPR will add support for the CAS API and Centera hardware. Later in the future, ViPR will also support Atmos hardware. Customers can invest with confidence in Centera and Atmos, knowing there is a path to ViPR and a co-existence strategy that leverages their investments and bring new capabilities to existing Centera and Atmos deployments.
  • #23: The ViPR object data services is very similar to the Amazon S3 model but ViPR adds extensions such as byte-range updates, atomic append, rich ACLs, etc. On top of the Object data service, ViPR includes several API heads, Amazon S3, OpenStack Swift and our own EMC Atmos. ViPR will add support for the CAS API in the first half of 2014. A key capability of the Object Data Service is its ability to store objects on file based storage. Now, an enterprise or service provider can use their existing file-based infrastructure to store and manipulate objects as files or objects on existing file-based storage.
  • #24: The ViPR object data service leverages the same configuration and provisioning process provided by the ViPR Controller. A storage administrator defines Virtual Storage Arrays and configures Virtual Storage Pools. The administrator configures the object Virtual Storage Pool with certain characteristics and defines how the actual objects are stored on the underlying arrays.
  • #25: The object data services supports physical segregation of the objects. An object virtual storage pool can write to multiple, physically separate arrays. In this example, one pool is writing to open virtual storage arrays that is writing to multiple commodity arrays. The other object virtual storage pool is writing to a different virtual array that is writing to three distinct physical file arrays.
  • #26: In addition to physical segregation, buckets provide logical segregation within the object store. Just like in S3, a user can create buckets which logically segregate applications or sets of data. These buckets can grown and shrink on-demand. The actual data objects are distributed and intermingled across the physical devices that comprise the virtual storage array.
  • #27: The actual processing of objects is done by a virtual set of data nodes. The data nodes are simply VMs the process object requests and feature a scale-out architecture to facilitate full failover between nodes. Being VMs, they’re easy to deploy.
  • #28: A very unique aspect of the ViPR object data service is that it stores objects in “chunks” or containers. Containers are append-only, once a container is full, it becomes immutable. Another important differentiator that was mentioned previously is the distributed index to track data locations. Rather than maintaining a separate index in a database, which can’t scale well, the index is located on the actual arrays in containers. In this way, the index can scale massively and be geo-distributed. A key bottleneck of existing object storage platforms is the indexing and metadata databases. By storing the index and metadata on the actual arrays, the ViPR object data service can locate objects in a massive scale, geo-distributed object store much faster. The way an write works, is the that write first updates the container, then updates the index. Only after the index is successfully updated will the write be successful. One thing to note about container immutability. Over time, authorized deletes may result in a container which has very few indexed objects in it. There is a garbage collection process that will identify a container with a large % of objects that are no longer in the index. In this case, the ViPR object data service can create a new append-only container, move the objects and update the index. This ensures efficient storage.
  • #29: For the first time, organizations, via the ViPR object data service, extend object storage to existing file based storage such as EMC VNX and EMC Isilon and third party file-based storage such as NetApp FAS. A bucket of objects can be dynamically toggled between three different access modes: Object mode: Full Object API. No File access. File mode: Native Read/Write access to Files, directly on the underlying device, via NFS mount. No Object access. Dual mode: Read access as either Object or File. Enterprises can deploy REST-based applications (based on S3, Atmos or OpenStack Swift) on their existing file-based storage. This fits a couple different use cases: This gives them the ability to deploy a private cloud with existing file-based capacity instead of buying a dedicated object storage platform. It allows them to protect objects with existing NAS data protection capabilities such as snapshots, replication. Perhaps there are compliance concerns or a backup process they want to extend to object data. Support mixed-use workflows. They may have a need to manipulate the same data with a REST-based app and a file-based app. Example on the next slide.
  • #30: In this example, a next-gen video sharing Web application ingests objects into a higher latency object store, then exposes the uploaded video content. Before exposing that content, however, a video processing application that is written to a lower latency file storage platform such as an Isilon, needs to perform analytics, curation, watermarking, etc. before posting back to the Web. Consequently the organization must copy all the uploaded object data into a file-based storage, perform the curation, then copy back to the object store. This is a long, complex process. With ViPR Object, now the objects can be ingested via REST (S3, Atmos or OpenStack Swift) directly to the Isilon (or VNX or other filer). An administrator can simply toggle to file mode, mount NFS, perform the curation and save the files. Toggle back to object mode and expose the curated objects to the Web. The data never has to be moved or copied and only one storage platform is required.
  • #33: Product information covers confidential information Contains future product information Can only be presented under NDA
  • #34: Software-defined storage generally, and ViPR data services specifically enables EMC to innovate horizontally across various storage technologies. EMC can introduce new capabilities in software that work across heterogeneous storage. And customers will have the choice of using a vendor-provided storage array or their own storage on their own terms. The array model will continue and customers will continue to buy Atmos and Centera arrays, however, the software be ViPR Software (Controller and Data Services). In building out the roadmap for object storage and ViPR, the mission is to provide choice and flexibility for our customers. EMC understands that, in addition to the software-defined approach of ViPR, there will still be customers that will want to buy an purpose-built appliance, particular Atmos customers, although many Centera customers also will want a purpose-built object appliance. For those Atmos and Centera customers that want to stay with a purpose-built appliance, they can simply stay with their existing investment. In the future, ViPR will add capabilities such as geo-replication, geo-distribution, compliance, etc. an Atmos customer will simply be able to refresh their Atmos investment. The next tech refresh of Atmos (v.Next or “Atmos 3.0”) will feature ViPR Controller and ViPR Data Services (specifically the ViPR Object data service) on Dense 480 hardware. In addition to all the functionality that an Atmos customer uses and expects, they can also expect access to new data services such as HDFS and to take advantage of the new indexing and metadata that will add speed and efficiency. Other customers may choose the purely software-defined approach. This will give them the ability to choose different hardware platforms as well as low-cost commodity. This is for a customer that needs to go beyond homogeneous object and support multiple data types across multiple arrays.
  • #35: The following slides provide a high-level preview of roadmap for the Object data service. The four core capabilities in addition to adding support for Centera arrays and the Centera CAS API are: Geo Replication Geo Distribution Compliance Commodity hardware support
  • #36: ViPR will support multi-site in the next major release and the ability to replicate objects across multiple sites. More details on replication mechanisms (mirroring, distributed erasure coding) will be available in 2014.
  • #37: With multi-site, also comes the ability to distribute and replicate content and the index and metadata across the sites and access everything through a single namespace. This will also deliver the active-active architecture that Atmos customers use.
  • #38: Atmos recently added support for SEC compliance. And, in order for ViPR Object to support Centera, it will need compliance features. ViPR Object will add those capabilities at object or bucket level.
  • #39: ViPR will add support for storing objects on commodity hardware. Today, ViPR Object and HDFS data services assume an underlying shared file system. Commodity drives like JBODs are block-only so they do not provide this file system. So EMC is creating its own distributed file persistence layer to allow the ViPR software stack to sit directly on top of JBOD. In the future, ViPR will add support for HDFS and Key-Value and native file on commodity.
  • #40: Today, there are many object platforms from multiple vendors. The vision for ViPR is to represent platform semantics as bucket attributes. The uniqueness of a Centera or an Atmos platform can be represented in buckets. Buckets can also be used to capture specific application functionality. This is the benefit of designing things in software. The freedom from hardware provides a great deal of flexibility.
  • #41: To make commodity support possible, ViPR supported protected writes. Configurable by an administrator, a write will update multiple containers that are stored on different nodes and drives, then updates the index. The index will only update after all the writes have succeeded. The writes are not confirmed/written until the index updates succeeds. This is certainly overkill for existing object platforms or NAS systems (since they have a file system) but is a requirement for supporting commodity disks.
  • #42: As mentioned earlier, ViPR efficiently stores objects in append-only containers. This also ensures efficient utilization on commodity. As stated previously, this can be overkill for an existing object or NAS filer that has its own persistent file system, but is important for efficient storage on JBOD. To repeat, only open containers can accept objects. One a container is full, it becomes immutable. One thing to note about container immutability. Over time, authorized deletes may result in a container which has very few indexed objects in it. There is a garbage collection process, a bulk operation that will identify a container with a large % of objects that are no longer in the index. In this case, the ViPR object data service can create a new append-only container, move the objects and update the index. This ensures efficient storage.
  • #43: ViPR presents a common index for unified object storage. Regardless of whether the logical bucket represents an Atmos, a Centera, S3 application, etc., ViPR has a unified index extended to all object types. This will enable more flexible metadata search and treats object storage as a first-class citizen. It also makes the different object types available for analytics (eg HDFS on Object).
  • #44: Centera: Primary Use Case: On premise compliance archive solution. Meets stringent compliance requirements of various verticals, Six 9’s reliability. Deploy and forget. Best of breed reliability, performance, and compliance. Roadmap: Hardware refreshes. Increased scale. Atmos: Primary Use Case: Active cloud storage infrastructure for next generation geo distributed applications. Best of breed geo replication and efficiency. Roadmap: Hardware refreshes. Compliance. Operational efficiency. ViPR Object Service: Primary: State of art hyper-scale object service on all arrays. Unifies object layers over Centera, Atmos and File. Enables mix and match Roadmap: Geo, Compliance, HDFS and other Data Service