SlideShare a Scribd company logo
1
Moving Cold Data to Hadoop
2
2 Trends
Forcing a revolution in enterprise architecture
3
Industry Leaders Compete and Win with Data1TREND
More Data Beats Better Algorithms
Collecting interaction data from ecommerce, social media, offline, and call centers
enables a “customer 360 view” and consumer intimacy
Competitive Advantage is Decided by 0.5%
Consumer financial services: 1% improvement in fraud detection means hundreds of millions of dollars
Advertising and retail: 0.5% improvement in lift means millions of dollars increase in profitability
4
Big Data is Overwhelming Traditional Systems
• Mission-critical reliability
• Transaction guarantees
• Deep security
• Real-time performance
• Backup and recovery
• Interactive SQL
• Rich analytics
• Workload management
• Data governance
• Backup and recovery
Enterprise
Data
Architecture
2TREND
ENTERPRISE
USERS
OPERATIONAL
SYSTEMS
ANALYTICAL
SYSTEMS
PRODUCTION
REQUIREMENTS
PRODUCTION
REQUIREMENTS
OUTSIDE SOURCES
5
And 2 Realities
6
OPERATIONAL
SYSTEMS
ANALYTICAL
SYSTEMS
ENTERPRISE
USERS
1REALITY
• Data staging
• Archive
• Data transformation
• Data exploration
• Streaming,
interactions
Hadoop Relieves the Pressure from Enterprise Systems
2 Interoperability
1 Reliability and DR
4
Supports operations
and analytics
3 High performance
Keys for Production Success
7
FOUNDATION
Architecture Matters for Success2REALITY
Data protection
& security
High performance
Multi-tenancy
Real-time operational
& analytical apps
Open standards
for integration
NEW APPLICATIONS SLAs TRUSTEDINFORMATION LOWERTCO
8
Data Warehouse Optimization
9
TDWI: Evolving Data Warehouse Architectures
2
1 Data Staging & Archive
3 Big Data Analytics
2 ETL
Hadoop Uses in
Data Warehouse Environment
Source: TDWI April 2014
10
The MapR Advantage
• Scale Reliability Across the Enterprise
– Advanced multi-tenancy
– Business continuity – HA, DR
• Speed
– 2-7x faster than other Hadoop distro’s
– Ultra-fast data ingest (100M data points per sec)
– NFS & R/W file system
• Real-time & Self-Service Data Exploration
– On-the-fly SQL without up-front schema
– Fast lookups and queries
Best Hadoop Platform for Data Warehouse Optimization & Analytics
Security
Streaming
NoSQL & Search
Provisioning
&
coordination
ML, Graph
W orkflow
& Data Governance
Batch
SQL
INTEGRATED
COMMERCIAL
ENGINES
TOOLSCOMPUTE
ENGINES
Batch
Interactive
Real-time
Online
Others
Management
Operations
Governance
Audits
Security
MapR-FS MapR-DB
MapR Data Platform
11
Attunity Solutions
Right Data. Right Place. Right Time.
12
Attunity – Growing, Modular Portfolio
Delivering
Big Data
for
Analytics
13
Data Warehouse Optimization with Hadoop
1
2
3
Assess and identify data and workloads to
rebalance on Hadoop
Develop a roadmap to move data and
workloads
Implement the roadmap incrementally and
iteratively
14
Completely analyze workloads and data usage
Reduce costs | Optimize performance | Justify investments
The Data Dashboard
User Activity Data Usage Workload Performance
Attunity Visibility – The Data Dashboard
15
Attunity Replicate
• Real-time data movement
• Change Data Capture (CDC)
• Broadest platform support
• Files - MF - RDBMS - Hadoop
• Non-intrusive architecture
• Automation of standard maintenance
tasks
• “Click-to-Load” design
16
MapR and Attunity
17
MapR and Attunity Are a Great Partnership
• Complimentary set of enterprise-grade features
– Focus on Data
• Movement
• Identification
• Usage
• High availability
• Scale
• Data Warehouse Optimization
– Experience across broad set of use cases/workloads
• Customer 360 view
• Telco
• Internet of Things (IoT)
18
Additional Resources
• Go to: www.Attunity.com/mapr
• Find us on Twitter:
– @mapR
– @attunity
• Watch our video
• View the Moving Cold Data to Hadoop webinar

More Related Content

PPTX
Optimize Data for the Logical Data Warehouse
PPTX
How Glidewell Moves Data to Amazon Redshift
PPTX
Accelerating Big Data Analytics
PPTX
Digital Business Transformation in the Streaming Era
PPTX
Break Free From Oracle with Attunity and Microsoft
PPTX
Attunity Solutions for Teradata
PPTX
How to Operationalise Real-Time Hadoop in the Cloud
PDF
Data platform architecture
Optimize Data for the Logical Data Warehouse
How Glidewell Moves Data to Amazon Redshift
Accelerating Big Data Analytics
Digital Business Transformation in the Streaming Era
Break Free From Oracle with Attunity and Microsoft
Attunity Solutions for Teradata
How to Operationalise Real-Time Hadoop in the Cloud
Data platform architecture

What's hot (20)

PPTX
Best Practices for Supercharging Cloud Analytics on Amazon Redshift
PPTX
Real-time Data Pipelines with SAP and Apache Kafka
PPTX
Atlanta Data Science Meetup | Qubole slides
PPTX
Optimizing industrial operations using the big data ecosystem
PPTX
Versa Shore Microsoft APS PDW webinar
PPTX
Webinar: The Modern Streaming Data Stack with Kinetica & StreamSets
PPTX
Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...
PDF
Database Camp 2016 @ United Nations, NYC - Michael Glukhovsky, Co-Founder, Re...
PPTX
The Microsoft BigData Story
PDF
Big Data Computing Architecture
PPTX
Modernizing Your Data Warehouse using APS
PDF
What is an Open Data Lake? - Data Sheets | Whitepaper
PPTX
Solving Performance Problems on Hadoop
PPTX
Accelerating Data Warehouse Modernization
PDF
Database Camp 2016 @ United Nations, NYC - Brad Bebee, CEO, Blazegraph
PPTX
Big Data Analytics in the Cloud with Microsoft Azure
PPTX
Free Servers to Build Big Data System on: Bing’s Approach
PDF
Seeing Redshift: How Amazon Changed Data Warehousing Forever
PPTX
Streaming Real-time Data to Azure Data Lake Storage Gen 2
PPTX
Pentaho Analytics on MongoDB
Best Practices for Supercharging Cloud Analytics on Amazon Redshift
Real-time Data Pipelines with SAP and Apache Kafka
Atlanta Data Science Meetup | Qubole slides
Optimizing industrial operations using the big data ecosystem
Versa Shore Microsoft APS PDW webinar
Webinar: The Modern Streaming Data Stack with Kinetica & StreamSets
Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...
Database Camp 2016 @ United Nations, NYC - Michael Glukhovsky, Co-Founder, Re...
The Microsoft BigData Story
Big Data Computing Architecture
Modernizing Your Data Warehouse using APS
What is an Open Data Lake? - Data Sheets | Whitepaper
Solving Performance Problems on Hadoop
Accelerating Data Warehouse Modernization
Database Camp 2016 @ United Nations, NYC - Brad Bebee, CEO, Blazegraph
Big Data Analytics in the Cloud with Microsoft Azure
Free Servers to Build Big Data System on: Bing’s Approach
Seeing Redshift: How Amazon Changed Data Warehousing Forever
Streaming Real-time Data to Azure Data Lake Storage Gen 2
Pentaho Analytics on MongoDB
Ad

Viewers also liked (20)

PDF
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...
PPTX
Seattle Scalability Meetup - Ted Dunning - MapR
PDF
Tdwi solution spotlight presentation slides
PDF
Tdwi agile data warehouse - dv, what is the buzz about
PPTX
NYC Hadoop Meetup - MapR, Architecture, Philosophy and Applications
PDF
TDWI Roundtable: The HANA EDW
PPTX
Эволюция Big Data и Information Management. Reference Architecture.
PPTX
SQL-on-Hadoop with Apache Drill
PPTX
Map r hadoop-security-mar2014 (2)
PDF
Hadoop and Your Enterprise Data Warehouse
PPTX
Executive BI, Analytics, Modeling and Insights Strategy Framework Practices
PPT
Going MAD: A Framework For Delivering Pervasive BI Solutions
PPTX
Design Patterns for working with Fast Data in Kafka
PDF
Big Data Journey
PDF
Why Elastic? @ 50th Vinitaly 2016
PPT
Gartner: The BI, Analytics and Performance Management Framework
PDF
Elastic v5.0.0 Update uptoalpha3 v0.2 - 김종민
PPTX
Using Familiar BI Tools and Hadoop to Analyze Enterprise Networks
PDF
Understanding Metadata: Why it's essential to your big data solution and how ...
PDF
MapR-DB Elasticsearch Integration
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...
Seattle Scalability Meetup - Ted Dunning - MapR
Tdwi solution spotlight presentation slides
Tdwi agile data warehouse - dv, what is the buzz about
NYC Hadoop Meetup - MapR, Architecture, Philosophy and Applications
TDWI Roundtable: The HANA EDW
Эволюция Big Data и Information Management. Reference Architecture.
SQL-on-Hadoop with Apache Drill
Map r hadoop-security-mar2014 (2)
Hadoop and Your Enterprise Data Warehouse
Executive BI, Analytics, Modeling and Insights Strategy Framework Practices
Going MAD: A Framework For Delivering Pervasive BI Solutions
Design Patterns for working with Fast Data in Kafka
Big Data Journey
Why Elastic? @ 50th Vinitaly 2016
Gartner: The BI, Analytics and Performance Management Framework
Elastic v5.0.0 Update uptoalpha3 v0.2 - 김종민
Using Familiar BI Tools and Hadoop to Analyze Enterprise Networks
Understanding Metadata: Why it's essential to your big data solution and how ...
MapR-DB Elasticsearch Integration
Ad

Similar to Which data should you move to Hadoop? (20)

PPTX
Data Con LA 2018 - Populating your Enterprise Data Hub for Next Gen Analytics...
PPTX
Enterprise Data Hub: The Next Big Thing in Big Data
PDF
The Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
PPTX
Integrating Hadoop into your enterprise IT environment
PDF
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
PDF
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
PPTX
Fast Data Strategy Houston Roadshow Presentation
PPTX
From Data to Services at the Speed of Business
PPTX
Opportunity: Data, Analytic & Azure
PPTX
Skilwise Big data
PPTX
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
PPTX
Skillwise Big Data part 2
PPTX
MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...
PPTX
Assessing New Databases– Translytical Use Cases
PPTX
Real time data integration best practices and architecture
PDF
Data Platform Overview
PPTX
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
PDF
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
PPT
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
PDF
Key Considerations for Putting Hadoop in Production SlideShare
Data Con LA 2018 - Populating your Enterprise Data Hub for Next Gen Analytics...
Enterprise Data Hub: The Next Big Thing in Big Data
The Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
Integrating Hadoop into your enterprise IT environment
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Fast Data Strategy Houston Roadshow Presentation
From Data to Services at the Speed of Business
Opportunity: Data, Analytic & Azure
Skilwise Big data
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
Skillwise Big Data part 2
MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...
Assessing New Databases– Translytical Use Cases
Real time data integration best practices and architecture
Data Platform Overview
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
Key Considerations for Putting Hadoop in Production SlideShare

Recently uploaded (20)

PPTX
Computer network topology notes for revision
PPTX
Major-Components-ofNKJNNKNKNKNKronment.pptx
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PDF
Launch Your Data Science Career in Kochi – 2025
PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
Global journeys: estimating international migration
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPT
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
PPT
Reliability_Chapter_ presentation 1221.5784
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PPTX
Supervised vs unsupervised machine learning algorithms
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
Computer network topology notes for revision
Major-Components-ofNKJNNKNKNKNKronment.pptx
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
Launch Your Data Science Career in Kochi – 2025
climate analysis of Dhaka ,Banglades.pptx
Business Ppt On Nestle.pptx huunnnhhgfvu
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
STUDY DESIGN details- Lt Col Maksud (21).pptx
Data_Analytics_and_PowerBI_Presentation.pptx
oil_refinery_comprehensive_20250804084928 (1).pptx
.pdf is not working space design for the following data for the following dat...
Global journeys: estimating international migration
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
Reliability_Chapter_ presentation 1221.5784
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
Supervised vs unsupervised machine learning algorithms
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...

Which data should you move to Hadoop?

  • 1. 1 Moving Cold Data to Hadoop
  • 2. 2 2 Trends Forcing a revolution in enterprise architecture
  • 3. 3 Industry Leaders Compete and Win with Data1TREND More Data Beats Better Algorithms Collecting interaction data from ecommerce, social media, offline, and call centers enables a “customer 360 view” and consumer intimacy Competitive Advantage is Decided by 0.5% Consumer financial services: 1% improvement in fraud detection means hundreds of millions of dollars Advertising and retail: 0.5% improvement in lift means millions of dollars increase in profitability
  • 4. 4 Big Data is Overwhelming Traditional Systems • Mission-critical reliability • Transaction guarantees • Deep security • Real-time performance • Backup and recovery • Interactive SQL • Rich analytics • Workload management • Data governance • Backup and recovery Enterprise Data Architecture 2TREND ENTERPRISE USERS OPERATIONAL SYSTEMS ANALYTICAL SYSTEMS PRODUCTION REQUIREMENTS PRODUCTION REQUIREMENTS OUTSIDE SOURCES
  • 6. 6 OPERATIONAL SYSTEMS ANALYTICAL SYSTEMS ENTERPRISE USERS 1REALITY • Data staging • Archive • Data transformation • Data exploration • Streaming, interactions Hadoop Relieves the Pressure from Enterprise Systems 2 Interoperability 1 Reliability and DR 4 Supports operations and analytics 3 High performance Keys for Production Success
  • 7. 7 FOUNDATION Architecture Matters for Success2REALITY Data protection & security High performance Multi-tenancy Real-time operational & analytical apps Open standards for integration NEW APPLICATIONS SLAs TRUSTEDINFORMATION LOWERTCO
  • 9. 9 TDWI: Evolving Data Warehouse Architectures 2 1 Data Staging & Archive 3 Big Data Analytics 2 ETL Hadoop Uses in Data Warehouse Environment Source: TDWI April 2014
  • 10. 10 The MapR Advantage • Scale Reliability Across the Enterprise – Advanced multi-tenancy – Business continuity – HA, DR • Speed – 2-7x faster than other Hadoop distro’s – Ultra-fast data ingest (100M data points per sec) – NFS & R/W file system • Real-time & Self-Service Data Exploration – On-the-fly SQL without up-front schema – Fast lookups and queries Best Hadoop Platform for Data Warehouse Optimization & Analytics Security Streaming NoSQL & Search Provisioning & coordination ML, Graph W orkflow & Data Governance Batch SQL INTEGRATED COMMERCIAL ENGINES TOOLSCOMPUTE ENGINES Batch Interactive Real-time Online Others Management Operations Governance Audits Security MapR-FS MapR-DB MapR Data Platform
  • 11. 11 Attunity Solutions Right Data. Right Place. Right Time.
  • 12. 12 Attunity – Growing, Modular Portfolio Delivering Big Data for Analytics
  • 13. 13 Data Warehouse Optimization with Hadoop 1 2 3 Assess and identify data and workloads to rebalance on Hadoop Develop a roadmap to move data and workloads Implement the roadmap incrementally and iteratively
  • 14. 14 Completely analyze workloads and data usage Reduce costs | Optimize performance | Justify investments The Data Dashboard User Activity Data Usage Workload Performance Attunity Visibility – The Data Dashboard
  • 15. 15 Attunity Replicate • Real-time data movement • Change Data Capture (CDC) • Broadest platform support • Files - MF - RDBMS - Hadoop • Non-intrusive architecture • Automation of standard maintenance tasks • “Click-to-Load” design
  • 17. 17 MapR and Attunity Are a Great Partnership • Complimentary set of enterprise-grade features – Focus on Data • Movement • Identification • Usage • High availability • Scale • Data Warehouse Optimization – Experience across broad set of use cases/workloads • Customer 360 view • Telco • Internet of Things (IoT)
  • 18. 18 Additional Resources • Go to: www.Attunity.com/mapr • Find us on Twitter: – @mapR – @attunity • Watch our video • View the Moving Cold Data to Hadoop webinar