SlideShare a Scribd company logo
Big Data:
Movement, Warehousing, & Virtualization
Presented to TSAM Data Management Stream – July 14th, 2011
Overview

• Major Industry Trends

• Data Virtualization & Distributed Storage

• Impacts to Our Industry

• Solution Alignment to Technology




                                              2
Trend #1: Cost Structure of Storage

•  Cost                                     2009
                                            -  67 TB in 4U
  •  Distributed commodity storage is       -  24.5x multiple
                                            -  Reliability as key
     25x cheaper than Tier 1 SAN               differentiator
  •  High reliability (replication) it is   -  With replication
                                               (55x)
     closer to 55x                          -  Equivalent
                                               Performance


•  Performance
  •  Distributed is now faster
  •  Flash Exacerbates
                                                                    Source: BackBlaze.com



                                            2011
•  Decreasing Differentiators               -  145 TB in 4U (Disk)
                                            -  27 TB in 4U (Flash)
  •  Perceived Reliability                  -  26x multiple
  •  Enterprise Management                  -  w/ Data Fabric /
                                               Virtualization is
  •  Legacy Compatibility                      as reliable
                                            -  Higher
                                               Performance
                                                                                            3
Trend #2: Moving from Blocks to Data
•  Blocks are a legacy to tape storage

•  Deeply embedded in the OS / Driver fabric and most legacy DB
   architectures

•  Horribly inefficient for modern requirements
  •  Replication / Synchronization (>100x retransmission)
  •  Networks are not designed for blocks
  •  Applications have to Load / Store
  •  Wall Street data usage is different than standard Fortune 500 (more dynamic data
     and higher churn rates)
  •  WAN Optimization can not fully solve

•  Atomic Data is an emerging model
  •  DB Rows / Messages are the historical Atomic example
  •  PaaS interfaces are ALL data and file driven

•  What is YOUR interface?
                                                                                        4
Trend #3: End of Single Location
•  Single Location Warehouse’s are Challenged
  •  Time to Query
  •  User Experience & SLA
  •  Data volumes and WAN bandwidth
  •  Regulatory and Security
  •  Integrated System Dependencies
  •  Clients / customers / applications are all in motion (mobile platform & need for

•  Impact of Moving from Single Location
  •  Dynamic data synchronization
     • 1 Second global SLA for data synchronization – emerging standard for risk
     • Mechanisms for distribute data sync are different
        •  PUSH = the new Data Fabric
        •  PULL = existing WAN Optimization
        •  Need for a new model for WAN optimization (beyond zlib / dedupe)
     • Networks can’t handle file copy (block) it must be data
  •  Elasticity in data movement – the “fabric” must be able to buffer
  •  Turns the file and database replication and model on it’s head: 1 to many

                                                                                        5
Data Virtualization & Distributed Storage

•  Data Virtualization Layers
 •  Data (storage, DB, cache, streaming
    sources, state, etc…)
 •  Data Fabric (data movement,
    reliability, buffering, WAN services)
 •  Data transformation (EII) and
    coordination services (virtualization)
 •  Data Access / Interface
                   &
•  Distributed Storage Model
 •  Data (storage, DB, cache, streaming
    sources, state, etc…)
 •  Data Fabric (data movement,
    reliability, buffering, WAN services)
 •  Legacy Interfaces

                                             6
Impact of the New Model

• Database Vendor Market
 •  New Architectures (column store & distributed) can have the same
    reliability, enterprise features and far better performance
 •  Monolithic DB solutions no longer need to rely upon storage for
    DR / reliability
• Cost Structure – One size does NOT fit all
• Platform
 •  Cloud – Public / Private
 •  Existing Infrastructure
 •  Is there any difference
• Elasticity of Compute


                                                                       7
Adoption

•  Early Adopters of the Model in the Enterprise
 •  Big Data and Mining:
    • Options
    • Back testing
    • Regulatory and compliance
    • Real-time risk
    • Global position & Instrument Master
    • Best Execution
 •  Hot-Hot DR
 •  Global Data Availability

•  Flexible Computing Utilizing Cloud Technologies
 •  Complex derivative pricing
 •  Grid – DR
 •  Seamless integration of remote locations / venues

                                                        8
About Tervela: Data In Motion
The Tervela Data Fabric                                                    Products
The fastest, most reliable, and cost
effective data transport system for globally
                                                                            TMX: Message Switch
distributed, mission-critical applications.                                 Message transport through the fabric

 •  10-100x performance increase
  over traditional solutions                                                TPE: Persistence Engine
                                                                            Embedded storage within the fabric
 •  Beyond 5x9’s
  built-in fault tolerance & high availability
                                                                            TPM: Provisioning & Management
                                                                            Central management of the fabric
 •  50% faster to deliver new apps
  simple development tools & embedded services                              Data Fabric
                                                                            Optimized for Distributed Data and
 •  Data-layer security                                                     Applications
  integrated data entitlements & protection
                                                                            Client APIs
                                                                            C, C++, C#, Java, JMS, PaaS

                                                 Virtual Data Fabric Appliance
                                                 Free Download
                                                 www.tervela.com/download


                                                                                                                   9
Q&A




      10
Big Data:
Movement, Warehousing, & virtualization
Presented to TSAM Data Management Stream – July 14th, 2011




                                                             11

More Related Content

PPTX
Fi nf068c73aef66f694f31a049aff3f4
PDF
My sql competitive update
PDF
Using Distributed In-Memory Computing for Fast Data Analysis
PDF
MT42 The impact of high performance Oracle workloads on the evolution of the ...
PDF
Scalar, nimble, brocade, commvault, star trek into darkness, toronto, 05 16 2013
PPTX
4 Ways To Save Big Money in Your Data Center and Private Cloud
PDF
Enterprise Laptop Backup- Druva inSync
PDF
[IC Manage] Workspace Acceleration & Network Storage Reduction
Fi nf068c73aef66f694f31a049aff3f4
My sql competitive update
Using Distributed In-Memory Computing for Fast Data Analysis
MT42 The impact of high performance Oracle workloads on the evolution of the ...
Scalar, nimble, brocade, commvault, star trek into darkness, toronto, 05 16 2013
4 Ways To Save Big Money in Your Data Center and Private Cloud
Enterprise Laptop Backup- Druva inSync
[IC Manage] Workspace Acceleration & Network Storage Reduction

What's hot (20)

PDF
Data Domain Architecture
PDF
MT44 Dell EMC Data Protection: What You Need to Know About Data Protection Ev...
PDF
Webinar presentation
PPTX
Hitachi Accelerated Flash Storage
PDF
8 Strategies For Building A Modern DataCenter
PDF
Exadata meeting business challenges! - Doug Cackett
PDF
Greenplum hadoop
PDF
Ugif 12 2011-informix iwa
PDF
EMC Unified Analytics Platform. Gintaras Pelenis
PDF
IBM BladeCenter: Build smarter IT. IBM Systems and Technology
PDF
Application acceleration from the data storage perspective
PDF
IBM Systems solution for SAP NetWeaver Business Warehouse Accelerator
PDF
MT41 Dell EMC VMAX: Ask the Experts
PDF
Future Proofing MySQL by Robert Hodges, Continuent
PDF
Charon Vax Workflow One Case Study Final
PDF
Why is Virtualization Creating Storage Sprawl? By Storage Switzerland
PDF
Software defined storage rev. 2.0
PPSX
INCONTRI AL CINEMA - HUS VM: la nuova piattaforma unificata di Hitachi Data...
PDF
Oracle en Entel Summit 2010
PDF
IBM SONAS Brochure
Data Domain Architecture
MT44 Dell EMC Data Protection: What You Need to Know About Data Protection Ev...
Webinar presentation
Hitachi Accelerated Flash Storage
8 Strategies For Building A Modern DataCenter
Exadata meeting business challenges! - Doug Cackett
Greenplum hadoop
Ugif 12 2011-informix iwa
EMC Unified Analytics Platform. Gintaras Pelenis
IBM BladeCenter: Build smarter IT. IBM Systems and Technology
Application acceleration from the data storage perspective
IBM Systems solution for SAP NetWeaver Business Warehouse Accelerator
MT41 Dell EMC VMAX: Ask the Experts
Future Proofing MySQL by Robert Hodges, Continuent
Charon Vax Workflow One Case Study Final
Why is Virtualization Creating Storage Sprawl? By Storage Switzerland
Software defined storage rev. 2.0
INCONTRI AL CINEMA - HUS VM: la nuova piattaforma unificata di Hitachi Data...
Oracle en Entel Summit 2010
IBM SONAS Brochure
Ad

Viewers also liked (15)

PDF
Objets connectés et quantified self 21082013
PPTX
Vivez plus longtemps et mieux avec le m-health
PPTX
IESA - culture digitale - cours 1
PPTX
Data warehousing and data mining
PDF
E-HEALTH 2016 - Sierre - Switzerland
PDF
Business Intelligence : Offres du marché et benchmarking
PPTX
IESA - Introduction à la Veille Stratégique Digitale
PPTX
IESA culture digitale - cours 2
PPTX
Data Warehousing 3 Feet Deep
PDF
Alfresco 4.0 en français
PPTX
Data warehouse
PDF
HUBREPORT - Future of Data & CRM [EXTRAIT]
PPTX
DATA WAREHOUSING
PPT
Data Warehouse Modeling
PDF
BigData_Chp1: Introduction à la Big Data
Objets connectés et quantified self 21082013
Vivez plus longtemps et mieux avec le m-health
IESA - culture digitale - cours 1
Data warehousing and data mining
E-HEALTH 2016 - Sierre - Switzerland
Business Intelligence : Offres du marché et benchmarking
IESA - Introduction à la Veille Stratégique Digitale
IESA culture digitale - cours 2
Data Warehousing 3 Feet Deep
Alfresco 4.0 en français
Data warehouse
HUBREPORT - Future of Data & CRM [EXTRAIT]
DATA WAREHOUSING
Data Warehouse Modeling
BigData_Chp1: Introduction à la Big Data
Ad

Similar to Big Data: Movement, Warehousing, & Virtualization (20)

PDF
Big data movement webcast
PDF
Storage simplicity value_110810
PDF
Oow Ppt 2
PDF
Extending The Value Of Oracle Crm On Demand Through Cloud Based Extensibility
PDF
Harvard university i tv3.2
PDF
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
PDF
Data Virtualization: An Essential Component of a Cloud Data Lake
PPTX
Cloud fest 2012_jc02
PDF
IBM Aspera overview
PPTX
Webinar: Hyperconvergence is Broken, Learn How to Fix it!
PDF
Modernize Your Oracle Environment with an Agile Data Infrastructure
PDF
Gluster open stack dev summit 042011
PDF
Cloud - NDT - Presentation
PDF
Data management in cloud computing trainee
PDF
The Shifting Landscape of Data Integration
PPT
Elastic Caching for a Smarter Planet - Make Every Transaction Count
PDF
Info Sec 2010 Possibilities And Security Challenges Of Cloud Computing (Han...
PPTX
stackArmor - Security MicroSummit - McAfee
PDF
Symantec Appliances Strategy Launch
PDF
Oracle Storage Cloud Conference
Big data movement webcast
Storage simplicity value_110810
Oow Ppt 2
Extending The Value Of Oracle Crm On Demand Through Cloud Based Extensibility
Harvard university i tv3.2
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
Data Virtualization: An Essential Component of a Cloud Data Lake
Cloud fest 2012_jc02
IBM Aspera overview
Webinar: Hyperconvergence is Broken, Learn How to Fix it!
Modernize Your Oracle Environment with an Agile Data Infrastructure
Gluster open stack dev summit 042011
Cloud - NDT - Presentation
Data management in cloud computing trainee
The Shifting Landscape of Data Integration
Elastic Caching for a Smarter Planet - Make Every Transaction Count
Info Sec 2010 Possibilities And Security Challenges Of Cloud Computing (Han...
stackArmor - Security MicroSummit - McAfee
Symantec Appliances Strategy Launch
Oracle Storage Cloud Conference

Recently uploaded (20)

PDF
project resource management chapter-09.pdf
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
PDF
Web App vs Mobile App What Should You Build First.pdf
PDF
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
PDF
A novel scalable deep ensemble learning framework for big data classification...
PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PDF
Enhancing emotion recognition model for a student engagement use case through...
PDF
Hindi spoken digit analysis for native and non-native speakers
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
August Patch Tuesday
PDF
WOOl fibre morphology and structure.pdf for textiles
PPTX
TLE Review Electricity (Electricity).pptx
PDF
1 - Historical Antecedents, Social Consideration.pdf
PDF
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Assigned Numbers - 2025 - Bluetooth® Document
project resource management chapter-09.pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
Web App vs Mobile App What Should You Build First.pdf
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
A novel scalable deep ensemble learning framework for big data classification...
Univ-Connecticut-ChatGPT-Presentaion.pdf
Programs and apps: productivity, graphics, security and other tools
gpt5_lecture_notes_comprehensive_20250812015547.pdf
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
Enhancing emotion recognition model for a student engagement use case through...
Hindi spoken digit analysis for native and non-native speakers
Group 1 Presentation -Planning and Decision Making .pptx
August Patch Tuesday
WOOl fibre morphology and structure.pdf for textiles
TLE Review Electricity (Electricity).pptx
1 - Historical Antecedents, Social Consideration.pdf
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
NewMind AI Weekly Chronicles - August'25-Week II
Assigned Numbers - 2025 - Bluetooth® Document

Big Data: Movement, Warehousing, & Virtualization

  • 1. Big Data: Movement, Warehousing, & Virtualization Presented to TSAM Data Management Stream – July 14th, 2011
  • 2. Overview • Major Industry Trends • Data Virtualization & Distributed Storage • Impacts to Our Industry • Solution Alignment to Technology 2
  • 3. Trend #1: Cost Structure of Storage •  Cost 2009 -  67 TB in 4U •  Distributed commodity storage is -  24.5x multiple -  Reliability as key 25x cheaper than Tier 1 SAN differentiator •  High reliability (replication) it is -  With replication (55x) closer to 55x -  Equivalent Performance •  Performance •  Distributed is now faster •  Flash Exacerbates Source: BackBlaze.com 2011 •  Decreasing Differentiators -  145 TB in 4U (Disk) -  27 TB in 4U (Flash) •  Perceived Reliability -  26x multiple •  Enterprise Management -  w/ Data Fabric / Virtualization is •  Legacy Compatibility as reliable -  Higher Performance 3
  • 4. Trend #2: Moving from Blocks to Data •  Blocks are a legacy to tape storage •  Deeply embedded in the OS / Driver fabric and most legacy DB architectures •  Horribly inefficient for modern requirements •  Replication / Synchronization (>100x retransmission) •  Networks are not designed for blocks •  Applications have to Load / Store •  Wall Street data usage is different than standard Fortune 500 (more dynamic data and higher churn rates) •  WAN Optimization can not fully solve •  Atomic Data is an emerging model •  DB Rows / Messages are the historical Atomic example •  PaaS interfaces are ALL data and file driven •  What is YOUR interface? 4
  • 5. Trend #3: End of Single Location •  Single Location Warehouse’s are Challenged •  Time to Query •  User Experience & SLA •  Data volumes and WAN bandwidth •  Regulatory and Security •  Integrated System Dependencies •  Clients / customers / applications are all in motion (mobile platform & need for •  Impact of Moving from Single Location •  Dynamic data synchronization • 1 Second global SLA for data synchronization – emerging standard for risk • Mechanisms for distribute data sync are different •  PUSH = the new Data Fabric •  PULL = existing WAN Optimization •  Need for a new model for WAN optimization (beyond zlib / dedupe) • Networks can’t handle file copy (block) it must be data •  Elasticity in data movement – the “fabric” must be able to buffer •  Turns the file and database replication and model on it’s head: 1 to many 5
  • 6. Data Virtualization & Distributed Storage •  Data Virtualization Layers •  Data (storage, DB, cache, streaming sources, state, etc…) •  Data Fabric (data movement, reliability, buffering, WAN services) •  Data transformation (EII) and coordination services (virtualization) •  Data Access / Interface & •  Distributed Storage Model •  Data (storage, DB, cache, streaming sources, state, etc…) •  Data Fabric (data movement, reliability, buffering, WAN services) •  Legacy Interfaces 6
  • 7. Impact of the New Model • Database Vendor Market •  New Architectures (column store & distributed) can have the same reliability, enterprise features and far better performance •  Monolithic DB solutions no longer need to rely upon storage for DR / reliability • Cost Structure – One size does NOT fit all • Platform •  Cloud – Public / Private •  Existing Infrastructure •  Is there any difference • Elasticity of Compute 7
  • 8. Adoption •  Early Adopters of the Model in the Enterprise •  Big Data and Mining: • Options • Back testing • Regulatory and compliance • Real-time risk • Global position & Instrument Master • Best Execution •  Hot-Hot DR •  Global Data Availability •  Flexible Computing Utilizing Cloud Technologies •  Complex derivative pricing •  Grid – DR •  Seamless integration of remote locations / venues 8
  • 9. About Tervela: Data In Motion The Tervela Data Fabric Products The fastest, most reliable, and cost effective data transport system for globally TMX: Message Switch distributed, mission-critical applications. Message transport through the fabric •  10-100x performance increase over traditional solutions TPE: Persistence Engine Embedded storage within the fabric •  Beyond 5x9’s built-in fault tolerance & high availability TPM: Provisioning & Management Central management of the fabric •  50% faster to deliver new apps simple development tools & embedded services Data Fabric Optimized for Distributed Data and •  Data-layer security Applications integrated data entitlements & protection Client APIs C, C++, C#, Java, JMS, PaaS Virtual Data Fabric Appliance Free Download www.tervela.com/download 9
  • 10. Q&A 10
  • 11. Big Data: Movement, Warehousing, & virtualization Presented to TSAM Data Management Stream – July 14th, 2011 11