SlideShare a Scribd company logo
1© 2015 Pivotal Software, Inc. All rights reserved.
2© 2015 Pivotal Software, Inc. All rights reserved.
Agenda
• Hortonworks Data Platform Overview
• Pivotal Big Data Suite Overview
• Pivotal HAWQ
• Demo
• Pivotal HAWQ & HDP Business Value and Use Cases
• Q&A
HAWQ
3© 2015 Pivotal Software, Inc. All rights reserved.
Your Hosts
Parham Parvizi
PRODUCT MANAGER, PIVOTAL HAWQ, PIVOTAL
Parham Parvizi ​is a Product Manager​​ at Pivotal​, where​ he is
responsible for driving the technical product roadmap of ​the
company's flagship SQL on Hadoop product – Pivotal HAWQ.
Shivaji Dutta
DEVELOPER EVANGELIST / SR PARTNER SOLUTIONS
ENGINEERING, HORTONWORKS
Shivaji is Sr. Partner Engineer with Hortonworks. He has over
18 years of Software Development and Consulting Experience.
Page4 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Hadoop for the Enterprise:
Implement a Modern Data Architecture with HDP
Customer Momentum
• 437 customers (as of March 31, 2015)
• 105 customers added in Q1 2015
Hortonworks Data Platform
• Completely open multi-tenant platform for any app and any data.
• A centralized architecture of consistent enterprise services for
resource management, security, operations, and governance.
Partner for Customer Success
• Open source community leadership focus on enterprise needs
• Unrivaled world class support
• Founded in 2011
• Original 24 architects, developers,
operators of Hadoop from Yahoo!
• 600+ Employees
• 1100+ Ecosystem Partners
Page 5 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
HDP Makes Hadoop Enterprise-Ready
Hortonworks Data Platform
Multi-tenant data platform built on a centralized
architecture of shared enterprise services
YARN: data operating system
Governance Security
Operations
Resource management
Existing
applications
New
analytics
Partner
applications
Data access: batch, interactive, real-time
Storage
Key Benefits
• Consolidates all data sets
• Delivers real-time insights
• Integrates with data center
• Scalable and affordable
Page 6 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Hortonworks Data Platform
Page 7 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
SQL Engines on HDP
• Apache Hive + Tez + ORC
• Apache Phoenix
• Spark SQL (Tech Preview)
• HAWQ
Page 8 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Pivotal and Hortonworks
• Joint engineering
Pivotal HD and HDP based on a common core
Pivotal HAWQ certified on HDP
• Co-founders of Open Data Platform
PIVOTAL AND HORTONWORKS
ARE STRONG DRIVERS OF
OPEN SOURCE SOFTWARE
PIVOTAL AND HORTONWORKS
ARE STRONG DRIVERS OF
OPEN SOURCE SOFTWARE
ODP#
(Enterprise Hardening)
Hortonworks
Data Platform
(HDP)
Pivotal Hadoop
Distribution
Other apps/tools*
*ex. Analytics apps and visualization tools
#OpenDataPlatform.org
Page 9 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Pivotal HAWQ on HDP Certified
10© 2015 Pivotal Software, Inc. All rights reserved.
BUSINESS
VALUE FROM DATA
Transforming companies into
data-driven enterprises with
open, agile, cloud-ready end-
to-end solutions
PLATFORM
AT YOUR SERVICE
Pioneering an open vision for
cloud-based, agile application
development
A BETTER WAY
TO BUILD PRODUCTS
World-class application
development services,
‘Pivots’, & transformative
methodologies
Cloud Foundry Big Data Suite Pivotal Labs
Pivotal – Business Groups
11© 2015 Pivotal Software, Inc. All rights reserved.
12© 2015 Pivotal Software, Inc. All rights reserved.
HAWQ + HDP
HDP
Open
Enterprise
Hadoop
Pivotal HAWQ
100%
ANSI SQL
Performance
Complex Query
Accessible
Pivotal HAWQ
13© 2015 Pivotal Software, Inc. All rights reserved.
HAWQ + HDP Pivotal HAWQ
• Discover New Relationships
• Enable Data Science
• Analyze External Sources
• Query All Data Types!
Multi-level
Fault Tolerance
Granular
Authorization
Resource
Pools
High multi-tenancy
100% ANSI
SQL Standard
OLAP
Extensions
JDBC ODBC
Connectivity
MPP
Architecture
Online
Expansion
HDFS
Petabyte Scale
Cost Based Optimizer
Dynamic
Pipelining
ACID +
Transaction
al
Multi-Language
UDF Support
Built-in Data
Science
Library
Extensible
(PXF)
Query External
Sources
Hardened, 10+ Years Tested, Production Proven
Accessibility + Usability
HDFS Native
File Formats
• Manage Multiple Workloads
• Petabyte Scale Analytics
• Sub-second Performance
• Leverage Existing
Skills & Tools
• Easily Integrate with
Other Tools
Compression
+ Partitioning
core
compliance
• Well Integrated with
Hortonworks Data
Platform
14© 2015 Pivotal Software, Inc. All rights reserved.
Reasons Why
Customers
Will Prefer
HDP + HAWQ
5
Pivotal HAWQ
15© 2015 Pivotal Software, Inc. All rights reserved.
Reasons Why
Customers
Will Prefer
HDP + HAWQ
• Up to 30x SQL on Hadoop performance
advantage
• Faster time to insight
• Massive MPP scalability to petabytes
Benefits: Near real-time latency, complex
queries and advanced analytics
at scale
1. Advanced Analytics Performance Pivotal HAWQ
5
16© 2015 Pivotal Software, Inc. All rights reserved.
Reasons Why
Customers
Will Prefer
HDP + HAWQ
• ANSI SQL-92, -99, -2003
• All 99 TPC-DS queries tested, no
modifications
• Plus, OLAP extensions
• Complete ACID integrity and reliability
Benefits: 100% SQL compliant
No risk to SQL applications
All native on HDP via HAWQ
2. 100% ANSI SQL Compliant Pivotal HAWQ
5
17© 2015 Pivotal Software, Inc. All rights reserved.
Reasons Why
Customers
Will Prefer
HDP + HAWQ
• Advanced machine learning for big data
• Local, in database operation
• Exceptional MPP/parallel performance
• Open source, Postgres-based
Benefits: Advanced, highly scalable,
machine learning, directly on
HDP data
3. Integrated Machine Learning Pivotal HAWQ
5
18© 2015 Pivotal Software, Inc. All rights reserved.
Reasons Why
Customers
Will Prefer
HDP + HAWQ
• HDP and Pivotal HD, easily managed via
Ambari
• On premises, in cloud, or PaaS
• Hbase, Avro, Parquet, ORC and more
• Plus, connectors to make HAWQ data
available to other SQL query tools
Benefits: Flexibility
Accessibility
Portability
4. Flexible Deployment Pivotal HAWQ
5
19© 2015 Pivotal Software, Inc. All rights reserved.
Reasons Why
Customers
Will Prefer
HDP + HAWQ
• Cost-based query optimization
• Robust query plan optimization
• Complex big data management
Benefits: Optimize performance and costs
Maximize HDP cluster resources
Offload EDW without compromise
5. Query Optimization Options Pivotal HAWQ
5
20© 2015 Pivotal Software, Inc. All rights reserved.
HAWQ over Competition - Impala
• 100% TPC-DS Compatible
• HAWQ completed 58/99 TPC-DS queries 12 hours faster!
• Multi-dimensional queries with subqueries, dynamic partition
elimination, large table joins, and roll-ups
• Higher concurrency
• Only partial ANSI-SQL compatibility
58/99 TPC-DS queries
• Exposure to application errors due
to SQL incompatibilities
• Single-dimension queries.
• No nesting, small table joins, no roll-ups
• Limited performance range
• No machine learning!
100%
ANSI
SQL
Querycomplexity&speedrequirements
+
-
0%
Pivotal HAWQ
21© 2015 Pivotal Software, Inc. All rights reserved.
TPC-DS Results vs. Impala Pivotal HAWQ
HAWQ Faster
88% of queries
12 hours
Impala Faster
12% of queries
Subset of TPC-DS Queries Comparison of HAWQ vs. Impala
TPC-DS Queries
QueryRuntimeDifference(s)(+HAWQFaster/-ImpalaFaster)
22© 2015 Pivotal Software, Inc. All rights reserved.
HAWQ Integration with Hive & Value-Add
 Query all Hive tables via PXF
 Easily move between HAWQ and Hive
 Value-Add:
– Application sub-second performance and faster time to insight
– Integration with traditional BI Reporting tools and complex machine generated
SQL
– Data Science driven application requiring built-in machine learning
– Large queries across multiple dataset to find new relationship and patterns
– Silo Analytical application with large ad-hoc users and high multi-tenancy
– Complex SQL statements with multi-level selects, partitions and rollups
+
Pivotal HAWQ
23© 2015 Pivotal Software, Inc. All rights reserved.
Collaboration on support
Full SQL on Hadoop
Performance Leadership
Common ODP core
BDS + HDP
Focus on solution life-cycle
Exceptional performance, applications run
without SQL errors, leverage
existing SQL skills
No vendor lock-in protects investment and
grows ecosystem
Apache open source availability of software
Benefits Summary
24© 2015 Pivotal Software, Inc. All rights reserved.
+
Open Enterprise Hadoop Powering Digital Transformation
Working together to digitally transform companies
into innovative enterprises
25© 2015 Pivotal Software, Inc. All rights reserved.

More Related Content

PPTX
Oracle Big Data Appliance and Big Data SQL for advanced analytics
PDF
Presentacin webinar move_up_to_power8_with_scale_out_servers_final
PPTX
Can you Re-Platform your Teradata, Oracle, Netezza and SQL Server Analytic Wo...
PDF
HAWQ: a massively parallel processing SQL engine in hadoop
PDF
OpenPOWER Update
PPTX
Expand a Data warehouse with Hadoop and Big Data
PPTX
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
PPTX
Pivotal Strata NYC 2015 Apache HAWQ Launch
Oracle Big Data Appliance and Big Data SQL for advanced analytics
Presentacin webinar move_up_to_power8_with_scale_out_servers_final
Can you Re-Platform your Teradata, Oracle, Netezza and SQL Server Analytic Wo...
HAWQ: a massively parallel processing SQL engine in hadoop
OpenPOWER Update
Expand a Data warehouse with Hadoop and Big Data
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
Pivotal Strata NYC 2015 Apache HAWQ Launch

What's hot (20)

PPTX
Swimming Across the Data Lake, Lessons learned and keys to success
PPTX
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
PDF
Presentation big dataappliance-overview_oow_v3
PDF
Machine Learning Everywhere
PPTX
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise
PPTX
Enabling the Real Time Analytical Enterprise
PPTX
Lessons learned processing 70 billion data points a day using the hybrid cloud
PPTX
Tame Big Data with Oracle Data Integration
PPTX
Apache Hadoop India Summit 2011 talk "Data Integration on Hadoop" by Sanjay K...
PDF
Webinar turbo charging_data_science_hawq_on_hdp_final
PDF
Analytics Modernization: Configuring SAS® Grid Manager for Hadoop
PPTX
Hortonworks Oracle Big Data Integration
PDF
50 Shades of SQL
PPTX
Extending Hortonworks with Oracle's Big Data Platform
PPTX
It Takes a Village: Organizational Alignment to Deliver Big Data Value in Hea...
PPTX
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
PPTX
The DAP - Where YARN, HBase, Kafka and Spark go to Production
PDF
Alexandre Vasseur - Evolution of Data Architectures: From Hadoop to Data Lake...
PDF
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
Swimming Across the Data Lake, Lessons learned and keys to success
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
Presentation big dataappliance-overview_oow_v3
Machine Learning Everywhere
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise
Enabling the Real Time Analytical Enterprise
Lessons learned processing 70 billion data points a day using the hybrid cloud
Tame Big Data with Oracle Data Integration
Apache Hadoop India Summit 2011 talk "Data Integration on Hadoop" by Sanjay K...
Webinar turbo charging_data_science_hawq_on_hdp_final
Analytics Modernization: Configuring SAS® Grid Manager for Hadoop
Hortonworks Oracle Big Data Integration
50 Shades of SQL
Extending Hortonworks with Oracle's Big Data Platform
It Takes a Village: Organizational Alignment to Deliver Big Data Value in Hea...
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
The DAP - Where YARN, HBase, Kafka and Spark go to Production
Alexandre Vasseur - Evolution of Data Architectures: From Hadoop to Data Lake...
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
Ad

Similar to Pivotal HAWQ and Hortonworks Data Platform: Modern Data Architecture for IT Transformation (20)

PPTX
Achieving Mega-Scale Business Intelligence Through Speed of Thought Analytics...
PPTX
Apache HAWQ and Apache MADlib: Journey to Apache
PPTX
A new platform for a new era emc
PDF
SQL and Machine Learning on Hadoop
PDF
Federated Queries with HAWQ - SQL on Hadoop and Beyond
PPTX
SoCal BigData Day
PDF
Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Ins...
 
PDF
HAWQ Meets Hive - Querying Unmanaged Data
PDF
Pivotal deep dive_on_pivotal_hd_world_class_hdfs_platform
 
PPTX
Hawq meets Hive - DataWorks San Jose 2017
PDF
SQL and Machine Learning on Hadoop using HAWQ
PDF
Apache conbigdata2015 christiantzolov-federated sql on hadoop and beyond- lev...
PPTX
Create a Smarter Data Lake with HP Haven and Apache Hadoop
PPTX
Hortonworks.bdb
PDF
ds_Pivotal_Big_Data_Suite_Product_Suite
PPTX
Hive edw-dataworks summit-eu-april-2017
PPTX
An Apache Hive Based Data Warehouse
PDF
Hawq wp 042313_final
 
PPTX
Hybrid Data Warehouse Hadoop Implementations
PPTX
Modernize Your Existing EDW with IBM Big SQL & Hortonworks Data Platform
Achieving Mega-Scale Business Intelligence Through Speed of Thought Analytics...
Apache HAWQ and Apache MADlib: Journey to Apache
A new platform for a new era emc
SQL and Machine Learning on Hadoop
Federated Queries with HAWQ - SQL on Hadoop and Beyond
SoCal BigData Day
Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Ins...
 
HAWQ Meets Hive - Querying Unmanaged Data
Pivotal deep dive_on_pivotal_hd_world_class_hdfs_platform
 
Hawq meets Hive - DataWorks San Jose 2017
SQL and Machine Learning on Hadoop using HAWQ
Apache conbigdata2015 christiantzolov-federated sql on hadoop and beyond- lev...
Create a Smarter Data Lake with HP Haven and Apache Hadoop
Hortonworks.bdb
ds_Pivotal_Big_Data_Suite_Product_Suite
Hive edw-dataworks summit-eu-april-2017
An Apache Hive Based Data Warehouse
Hawq wp 042313_final
 
Hybrid Data Warehouse Hadoop Implementations
Modernize Your Existing EDW with IBM Big SQL & Hortonworks Data Platform
Ad

More from VMware Tanzu (20)

PDF
Spring into AI presented by Dan Vega 5/14
PDF
What AI Means For Your Product Strategy And What To Do About It
PDF
Make the Right Thing the Obvious Thing at Cardinal Health 2023
PPTX
Enhancing DevEx and Simplifying Operations at Scale
PDF
Spring Update | July 2023
PPTX
Platforms, Platform Engineering, & Platform as a Product
PPTX
Building Cloud Ready Apps
PDF
Spring Boot 3 And Beyond
PDF
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
PDF
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
PDF
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
PPTX
tanzu_developer_connect.pptx
PDF
Tanzu Virtual Developer Connect Workshop - French
PDF
Tanzu Developer Connect Workshop - English
PDF
Virtual Developer Connect Workshop - English
PDF
Tanzu Developer Connect - French
PDF
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
PDF
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
PDF
SpringOne Tour: The Influential Software Engineer
PDF
SpringOne Tour: Domain-Driven Design: Theory vs Practice
Spring into AI presented by Dan Vega 5/14
What AI Means For Your Product Strategy And What To Do About It
Make the Right Thing the Obvious Thing at Cardinal Health 2023
Enhancing DevEx and Simplifying Operations at Scale
Spring Update | July 2023
Platforms, Platform Engineering, & Platform as a Product
Building Cloud Ready Apps
Spring Boot 3 And Beyond
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
tanzu_developer_connect.pptx
Tanzu Virtual Developer Connect Workshop - French
Tanzu Developer Connect Workshop - English
Virtual Developer Connect Workshop - English
Tanzu Developer Connect - French
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
SpringOne Tour: The Influential Software Engineer
SpringOne Tour: Domain-Driven Design: Theory vs Practice

Recently uploaded (20)

PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
Empathic Computing: Creating Shared Understanding
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPTX
Big Data Technologies - Introduction.pptx
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
KodekX | Application Modernization Development
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPT
Teaching material agriculture food technology
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
Spectral efficient network and resource selection model in 5G networks
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Empathic Computing: Creating Shared Understanding
20250228 LYD VKU AI Blended-Learning.pptx
Per capita expenditure prediction using model stacking based on satellite ima...
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Understanding_Digital_Forensics_Presentation.pptx
Big Data Technologies - Introduction.pptx
Network Security Unit 5.pdf for BCA BBA.
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
KodekX | Application Modernization Development
“AI and Expert System Decision Support & Business Intelligence Systems”
Dropbox Q2 2025 Financial Results & Investor Presentation
Teaching material agriculture food technology
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Spectral efficient network and resource selection model in 5G networks

Pivotal HAWQ and Hortonworks Data Platform: Modern Data Architecture for IT Transformation

  • 1. 1© 2015 Pivotal Software, Inc. All rights reserved.
  • 2. 2© 2015 Pivotal Software, Inc. All rights reserved. Agenda • Hortonworks Data Platform Overview • Pivotal Big Data Suite Overview • Pivotal HAWQ • Demo • Pivotal HAWQ & HDP Business Value and Use Cases • Q&A HAWQ
  • 3. 3© 2015 Pivotal Software, Inc. All rights reserved. Your Hosts Parham Parvizi PRODUCT MANAGER, PIVOTAL HAWQ, PIVOTAL Parham Parvizi ​is a Product Manager​​ at Pivotal​, where​ he is responsible for driving the technical product roadmap of ​the company's flagship SQL on Hadoop product – Pivotal HAWQ. Shivaji Dutta DEVELOPER EVANGELIST / SR PARTNER SOLUTIONS ENGINEERING, HORTONWORKS Shivaji is Sr. Partner Engineer with Hortonworks. He has over 18 years of Software Development and Consulting Experience.
  • 4. Page4 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Hadoop for the Enterprise: Implement a Modern Data Architecture with HDP Customer Momentum • 437 customers (as of March 31, 2015) • 105 customers added in Q1 2015 Hortonworks Data Platform • Completely open multi-tenant platform for any app and any data. • A centralized architecture of consistent enterprise services for resource management, security, operations, and governance. Partner for Customer Success • Open source community leadership focus on enterprise needs • Unrivaled world class support • Founded in 2011 • Original 24 architects, developers, operators of Hadoop from Yahoo! • 600+ Employees • 1100+ Ecosystem Partners
  • 5. Page 5 © Hortonworks Inc. 2011 – 2015. All Rights Reserved HDP Makes Hadoop Enterprise-Ready Hortonworks Data Platform Multi-tenant data platform built on a centralized architecture of shared enterprise services YARN: data operating system Governance Security Operations Resource management Existing applications New analytics Partner applications Data access: batch, interactive, real-time Storage Key Benefits • Consolidates all data sets • Delivers real-time insights • Integrates with data center • Scalable and affordable
  • 6. Page 6 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Hortonworks Data Platform
  • 7. Page 7 © Hortonworks Inc. 2011 – 2015. All Rights Reserved SQL Engines on HDP • Apache Hive + Tez + ORC • Apache Phoenix • Spark SQL (Tech Preview) • HAWQ
  • 8. Page 8 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Pivotal and Hortonworks • Joint engineering Pivotal HD and HDP based on a common core Pivotal HAWQ certified on HDP • Co-founders of Open Data Platform PIVOTAL AND HORTONWORKS ARE STRONG DRIVERS OF OPEN SOURCE SOFTWARE PIVOTAL AND HORTONWORKS ARE STRONG DRIVERS OF OPEN SOURCE SOFTWARE ODP# (Enterprise Hardening) Hortonworks Data Platform (HDP) Pivotal Hadoop Distribution Other apps/tools* *ex. Analytics apps and visualization tools #OpenDataPlatform.org
  • 9. Page 9 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Pivotal HAWQ on HDP Certified
  • 10. 10© 2015 Pivotal Software, Inc. All rights reserved. BUSINESS VALUE FROM DATA Transforming companies into data-driven enterprises with open, agile, cloud-ready end- to-end solutions PLATFORM AT YOUR SERVICE Pioneering an open vision for cloud-based, agile application development A BETTER WAY TO BUILD PRODUCTS World-class application development services, ‘Pivots’, & transformative methodologies Cloud Foundry Big Data Suite Pivotal Labs Pivotal – Business Groups
  • 11. 11© 2015 Pivotal Software, Inc. All rights reserved.
  • 12. 12© 2015 Pivotal Software, Inc. All rights reserved. HAWQ + HDP HDP Open Enterprise Hadoop Pivotal HAWQ 100% ANSI SQL Performance Complex Query Accessible Pivotal HAWQ
  • 13. 13© 2015 Pivotal Software, Inc. All rights reserved. HAWQ + HDP Pivotal HAWQ • Discover New Relationships • Enable Data Science • Analyze External Sources • Query All Data Types! Multi-level Fault Tolerance Granular Authorization Resource Pools High multi-tenancy 100% ANSI SQL Standard OLAP Extensions JDBC ODBC Connectivity MPP Architecture Online Expansion HDFS Petabyte Scale Cost Based Optimizer Dynamic Pipelining ACID + Transaction al Multi-Language UDF Support Built-in Data Science Library Extensible (PXF) Query External Sources Hardened, 10+ Years Tested, Production Proven Accessibility + Usability HDFS Native File Formats • Manage Multiple Workloads • Petabyte Scale Analytics • Sub-second Performance • Leverage Existing Skills & Tools • Easily Integrate with Other Tools Compression + Partitioning core compliance • Well Integrated with Hortonworks Data Platform
  • 14. 14© 2015 Pivotal Software, Inc. All rights reserved. Reasons Why Customers Will Prefer HDP + HAWQ 5 Pivotal HAWQ
  • 15. 15© 2015 Pivotal Software, Inc. All rights reserved. Reasons Why Customers Will Prefer HDP + HAWQ • Up to 30x SQL on Hadoop performance advantage • Faster time to insight • Massive MPP scalability to petabytes Benefits: Near real-time latency, complex queries and advanced analytics at scale 1. Advanced Analytics Performance Pivotal HAWQ 5
  • 16. 16© 2015 Pivotal Software, Inc. All rights reserved. Reasons Why Customers Will Prefer HDP + HAWQ • ANSI SQL-92, -99, -2003 • All 99 TPC-DS queries tested, no modifications • Plus, OLAP extensions • Complete ACID integrity and reliability Benefits: 100% SQL compliant No risk to SQL applications All native on HDP via HAWQ 2. 100% ANSI SQL Compliant Pivotal HAWQ 5
  • 17. 17© 2015 Pivotal Software, Inc. All rights reserved. Reasons Why Customers Will Prefer HDP + HAWQ • Advanced machine learning for big data • Local, in database operation • Exceptional MPP/parallel performance • Open source, Postgres-based Benefits: Advanced, highly scalable, machine learning, directly on HDP data 3. Integrated Machine Learning Pivotal HAWQ 5
  • 18. 18© 2015 Pivotal Software, Inc. All rights reserved. Reasons Why Customers Will Prefer HDP + HAWQ • HDP and Pivotal HD, easily managed via Ambari • On premises, in cloud, or PaaS • Hbase, Avro, Parquet, ORC and more • Plus, connectors to make HAWQ data available to other SQL query tools Benefits: Flexibility Accessibility Portability 4. Flexible Deployment Pivotal HAWQ 5
  • 19. 19© 2015 Pivotal Software, Inc. All rights reserved. Reasons Why Customers Will Prefer HDP + HAWQ • Cost-based query optimization • Robust query plan optimization • Complex big data management Benefits: Optimize performance and costs Maximize HDP cluster resources Offload EDW without compromise 5. Query Optimization Options Pivotal HAWQ 5
  • 20. 20© 2015 Pivotal Software, Inc. All rights reserved. HAWQ over Competition - Impala • 100% TPC-DS Compatible • HAWQ completed 58/99 TPC-DS queries 12 hours faster! • Multi-dimensional queries with subqueries, dynamic partition elimination, large table joins, and roll-ups • Higher concurrency • Only partial ANSI-SQL compatibility 58/99 TPC-DS queries • Exposure to application errors due to SQL incompatibilities • Single-dimension queries. • No nesting, small table joins, no roll-ups • Limited performance range • No machine learning! 100% ANSI SQL Querycomplexity&speedrequirements + - 0% Pivotal HAWQ
  • 21. 21© 2015 Pivotal Software, Inc. All rights reserved. TPC-DS Results vs. Impala Pivotal HAWQ HAWQ Faster 88% of queries 12 hours Impala Faster 12% of queries Subset of TPC-DS Queries Comparison of HAWQ vs. Impala TPC-DS Queries QueryRuntimeDifference(s)(+HAWQFaster/-ImpalaFaster)
  • 22. 22© 2015 Pivotal Software, Inc. All rights reserved. HAWQ Integration with Hive & Value-Add  Query all Hive tables via PXF  Easily move between HAWQ and Hive  Value-Add: – Application sub-second performance and faster time to insight – Integration with traditional BI Reporting tools and complex machine generated SQL – Data Science driven application requiring built-in machine learning – Large queries across multiple dataset to find new relationship and patterns – Silo Analytical application with large ad-hoc users and high multi-tenancy – Complex SQL statements with multi-level selects, partitions and rollups + Pivotal HAWQ
  • 23. 23© 2015 Pivotal Software, Inc. All rights reserved. Collaboration on support Full SQL on Hadoop Performance Leadership Common ODP core BDS + HDP Focus on solution life-cycle Exceptional performance, applications run without SQL errors, leverage existing SQL skills No vendor lock-in protects investment and grows ecosystem Apache open source availability of software Benefits Summary
  • 24. 24© 2015 Pivotal Software, Inc. All rights reserved. + Open Enterprise Hadoop Powering Digital Transformation Working together to digitally transform companies into innovative enterprises
  • 25. 25© 2015 Pivotal Software, Inc. All rights reserved.