SlideShare a Scribd company logo
Managing Apache HAWQ
with Apache AMBARI
Apache Ambari Meetup - June 27, 2016
Alexander Denissov
Bhuvnesh Chaudhary
Mithun Mathew
Apache HAWQ
(incubating)
Apache Ambari
Hadoop-native SQL query engine and advanced analytics
MPP database that offers:
1
2
3
4
5
interactive query execution
high performance
machine learning algorithms
tools for Data Analysts and Data Scientists
processing for large and complex data sets
APACHE HAWQ (incubating)
APACHE HAWQ (incubating) ARCHITECTURE
HAWQ - AMBARI INTEGRATION SCOPE
Installation and configuration
Topology and configuration recommendations and validations
Kerberos and High Availability support
HAWQ Master - HAWQ Standby failover
Service and Component Alerts
Visual Widgets
HAWQ - AMBARI INTEGRATION EFFORT
Praises
Ambari’s pluggable architecture makes integrations like this possible and easy
Kerberos setup is fully metadata driven — major kudos!
Challenges
HAWQ is not part of the HDP stack and is not available in Ambari out-of-the box
Advanced features and wizards require JavaScript code modifications
Driven by the team of engineers at Pivotal
Developed integrations from basic to more advanced
Invaluable support from Ambari Community
THANK YOU!
RECOMMEND SERVICE TOPOLOGY
VALIDATE SERVICE TOPOLOGY
RECOMMEND AND VALIDATE CONFIGS
HAWQ SERVICE SUMMARY PAGE
HAWQ SERVICE ACTIONS
ACTIVATE HAWQ STANDBY WIZARD
Activate HAWQ Standby
Wizard
(Manual Operation)
ACTIVATE HAWQ STANDBY WIZARD
HAWQ Standby Master
promoted to HAWQ Master
Add HAWQ Standby Master
action becomes visible
HAWQ ALERTS
Status of HAWQ Components
Communication issues
between HAWQ Components
HAWQ AMBARI FUTURE INTEGRATION
Support automated upgrade independent of stack
Ongoing related work: AMBARI-14854, AMBARI-12885
Ambari requires service restart for pushing configuration changes.
What if, the service can reload configurations without restart?
Ongoing related work: AMBARI-17241
HAWQ Upgrade
Dynamic Configuration Reload
Display query history
Manage resource queues
HAWQ View
Currently Ambari does not support configuration
changes without restarting service
Some parameters do NOT require restart!
HDFS dfs.heartbeat.interval, dfs.namenode.heartbeat.recheck-interval
HAWQ default_hash_table_bucket_number, hawq_rm_memory_limit_perseg
DYNAMIC CONFIGURATION RELOAD
Currently Ambari does not support configuration
changes without restarting service
Some parameters do NOT require restart!
HDFS dfs.heartbeat.interval, dfs.namenode.heartbeat.recheck-interval
HAWQ default_hash_table_bucket_number, hawq_rm_memory_limit_perseg
DOWNTIME!!!
Consequence of Restarting the Service:
DYNAMIC CONFIGURATION RELOAD
No more DOWNTIME!!!
DYNAMIC CONFIGURATION RELOAD
resources/common-services/HAWQ/2.0.0/configurations/hawq-site.xml
<property>	
				<name>default_hash_table_bucket_number</name>	
				<value>6</value>	
				<supports-reload>true</supports-reload>	
</property>
resources/common-services/HAWQ/2.0.0/package/scripts/hawqmaster.py
class	HawqMaster(Script):	
		def	start(self,	env):	
				…	
		def	stop(self,	env):	
				…	
		def	reload(self,	env):	
				self.configure(env)	
				Execute(‘hawq	master	reload’,	…)
HOW TO USE
Ambari UI
Show Reload
Button
Desired Configs
Updater
HeartBeat
Processor
ServiceComponentHost
(updates requires_reload)
Request Handling
and Execution
Ambari
Agent
Reload
Method
Ambari Server
Ambari Web
Ambari
Agent
Reload
Method
Ambari
Agent
Reload
Method
Ambari
Agent
Reload
Method
REST
API
POST
PUT
GET
Heartbeats
invalidates
requires_reload
user specifies
COLLABORATION DIAGRAM
Feedback?
AMBARI-17241
Reload vs Restart - Are they mutually exclusive?
THINGS TO DECIDE
Seriously, purple?

More Related Content

PDF
Pivotal HAWQ 소개
PDF
SQL and Machine Learning on Hadoop using HAWQ
PPTX
Pivotal Strata NYC 2015 Apache HAWQ Launch
PDF
HAWQ: a massively parallel processing SQL engine in hadoop
PDF
Multitenancy At Bloomberg - HBase and Oozie
PPTX
The DAP - Where YARN, HBase, Kafka and Spark go to Production
PPTX
How to Use Apache Zeppelin with HWX HDB
PDF
Hadoop integration with SAP HANA
Pivotal HAWQ 소개
SQL and Machine Learning on Hadoop using HAWQ
Pivotal Strata NYC 2015 Apache HAWQ Launch
HAWQ: a massively parallel processing SQL engine in hadoop
Multitenancy At Bloomberg - HBase and Oozie
The DAP - Where YARN, HBase, Kafka and Spark go to Production
How to Use Apache Zeppelin with HWX HDB
Hadoop integration with SAP HANA

What's hot (20)

PPTX
Integrating Apache Spark and NiFi for Data Lakes
PPT
Running Zeppelin in Enterprise
PDF
Getting involved with Open Source at the ASF
PDF
Predictive Analytics and Machine Learning …with SAS and Apache Hadoop
PDF
SAP HANA SPS10- Hadoop Integration
PPTX
Format Wars: from VHS and Beta to Avro and Parquet
PDF
Hortonworks Technical Workshop: What's New in HDP 2.3
PPTX
Hadoop first ETL on Apache Falcon
PPTX
Enabling Diverse Workload Scheduling in YARN
PDF
Apache Spark—Apache HBase Connector: Feature Rich and Efficient Access to HBa...
PPTX
GPORCA: Query Optimization as a Service
PPTX
Apache Ambari: Managing Hadoop and YARN
PPTX
Double Your Hadoop Hardware Performance with SmartSense
PDF
Hortonworks tech workshop in-memory processing with spark
PPTX
Securing Hadoop with Apache Ranger
PDF
GridGain 6.0: Open Source In-Memory Computing Platform - Nikita Ivanov
PPTX
What's new in Ambari
PDF
Best Practices for Hadoop Data Analysis with Tableau and Hortonworks Data Pla...
PPTX
Hortonworks for Financial Analysts Presentation
PDF
HBaseCon2017 Spark HBase Connector: Feature Rich and Efficient Access to HBas...
Integrating Apache Spark and NiFi for Data Lakes
Running Zeppelin in Enterprise
Getting involved with Open Source at the ASF
Predictive Analytics and Machine Learning …with SAS and Apache Hadoop
SAP HANA SPS10- Hadoop Integration
Format Wars: from VHS and Beta to Avro and Parquet
Hortonworks Technical Workshop: What's New in HDP 2.3
Hadoop first ETL on Apache Falcon
Enabling Diverse Workload Scheduling in YARN
Apache Spark—Apache HBase Connector: Feature Rich and Efficient Access to HBa...
GPORCA: Query Optimization as a Service
Apache Ambari: Managing Hadoop and YARN
Double Your Hadoop Hardware Performance with SmartSense
Hortonworks tech workshop in-memory processing with spark
Securing Hadoop with Apache Ranger
GridGain 6.0: Open Source In-Memory Computing Platform - Nikita Ivanov
What's new in Ambari
Best Practices for Hadoop Data Analysis with Tableau and Hortonworks Data Pla...
Hortonworks for Financial Analysts Presentation
HBaseCon2017 Spark HBase Connector: Feature Rich and Efficient Access to HBas...
Ad

Viewers also liked (20)

PPTX
Pivotal hawq internals
PPTX
Ambari Views - Overview
PPTX
PXF BDAM 2016
PDF
gsoc_mentor for Shivram Mani
PPTX
PXF HAWQ Unmanaged Data
PPTX
Apache Zeppelin Meetup Christian Tzolov 1/21/16
PPTX
Hawq Hcatalog Integration
PPTX
Apache HAWQ : An Introduction
PPTX
Pivotal HAWQ - High Availability (2014)
PPTX
Build & test Apache Hawq
PDF
Massively Parallel Processing with Procedural Python - Pivotal HAWQ
PPTX
Apache HAWQ and Apache MADlib: Journey to Apache
PDF
Phd tutorial hawq_v0.1
PPTX
SQL On Hadoop
PPTX
Pivotal HAWQ and Hortonworks Data Platform: Modern Data Architecture for IT T...
PDF
Webinar turbo charging_data_science_hawq_on_hdp_final
PDF
How to manage Hortonworks HDB Resources with YARN
PDF
Greenplum Architecture
PPTX
Apache HAWQ Architecture
PDF
MPP vs Hadoop
Pivotal hawq internals
Ambari Views - Overview
PXF BDAM 2016
gsoc_mentor for Shivram Mani
PXF HAWQ Unmanaged Data
Apache Zeppelin Meetup Christian Tzolov 1/21/16
Hawq Hcatalog Integration
Apache HAWQ : An Introduction
Pivotal HAWQ - High Availability (2014)
Build & test Apache Hawq
Massively Parallel Processing with Procedural Python - Pivotal HAWQ
Apache HAWQ and Apache MADlib: Journey to Apache
Phd tutorial hawq_v0.1
SQL On Hadoop
Pivotal HAWQ and Hortonworks Data Platform: Modern Data Architecture for IT T...
Webinar turbo charging_data_science_hawq_on_hdp_final
How to manage Hortonworks HDB Resources with YARN
Greenplum Architecture
Apache HAWQ Architecture
MPP vs Hadoop
Ad

Similar to Managing Apache HAWQ with Apache AMBARI (17)

PPTX
PPTX
Ambari hadoop-ops-meetup-2013-09-19.final
PPTX
Simplified Cluster Operation and Troubleshooting
PPTX
Simplified Cluster Operation & Troubleshooting
PPTX
Apache Ambari: Simplified Hadoop Cluster Operation & Troubleshooting
PPTX
Discover HDP 2.1: Using Apache Ambari to Manage Hadoop Clusters
PDF
SQL and Machine Learning on Hadoop
PDF
My ambariexperience
PPTX
Apache Ambari - What's New in 2.2
PPTX
Apache Ambari Meetup at Dataworks Summit 2017
PPTX
Apache Ambari - HDP Cluster Upgrades Operational Deep Dive and Troubleshooting
PPTX
Streamline Hadoop DevOps with Apache Ambari
PPTX
Managing Enterprise Hadoop Clusters with Apache Ambari
PPTX
Managing Enterprise Hadoop Clusters with Apache Ambari
PDF
Hortonworks Technical Workshop: Apache Ambari
PPTX
Streamline Hadoop DevOps with Apache Ambari
PPTX
Apache Ambari - What's New in 2.4
Ambari hadoop-ops-meetup-2013-09-19.final
Simplified Cluster Operation and Troubleshooting
Simplified Cluster Operation & Troubleshooting
Apache Ambari: Simplified Hadoop Cluster Operation & Troubleshooting
Discover HDP 2.1: Using Apache Ambari to Manage Hadoop Clusters
SQL and Machine Learning on Hadoop
My ambariexperience
Apache Ambari - What's New in 2.2
Apache Ambari Meetup at Dataworks Summit 2017
Apache Ambari - HDP Cluster Upgrades Operational Deep Dive and Troubleshooting
Streamline Hadoop DevOps with Apache Ambari
Managing Enterprise Hadoop Clusters with Apache Ambari
Managing Enterprise Hadoop Clusters with Apache Ambari
Hortonworks Technical Workshop: Apache Ambari
Streamline Hadoop DevOps with Apache Ambari
Apache Ambari - What's New in 2.4

Recently uploaded (20)

PDF
Getting Started with Data Integration: FME Form 101
PDF
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Zenith AI: Advanced Artificial Intelligence
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PPTX
1. Introduction to Computer Programming.pptx
PDF
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
PPT
What is a Computer? Input Devices /output devices
PDF
1 - Historical Antecedents, Social Consideration.pdf
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
PPTX
Tartificialntelligence_presentation.pptx
PDF
Developing a website for English-speaking practice to English as a foreign la...
PPT
Module 1.ppt Iot fundamentals and Architecture
PPTX
O2C Customer Invoices to Receipt V15A.pptx
PDF
Architecture types and enterprise applications.pdf
PDF
STKI Israel Market Study 2025 version august
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PDF
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
Getting Started with Data Integration: FME Form 101
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
Univ-Connecticut-ChatGPT-Presentaion.pdf
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Zenith AI: Advanced Artificial Intelligence
Group 1 Presentation -Planning and Decision Making .pptx
1. Introduction to Computer Programming.pptx
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
What is a Computer? Input Devices /output devices
1 - Historical Antecedents, Social Consideration.pdf
Assigned Numbers - 2025 - Bluetooth® Document
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
Tartificialntelligence_presentation.pptx
Developing a website for English-speaking practice to English as a foreign la...
Module 1.ppt Iot fundamentals and Architecture
O2C Customer Invoices to Receipt V15A.pptx
Architecture types and enterprise applications.pdf
STKI Israel Market Study 2025 version august
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf

Managing Apache HAWQ with Apache AMBARI