SlideShare a Scribd company logo
Ambari Metrics System (AMS)
& Grafana
Aravindan (avijayan@apache)
Prajwal (prao@hortonworks)
2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Agenda
 Intro to Grafana
 Grafana integration
 Demo
 AMS - Overview
 AMS - Features & highlights
 AMS – API Overview
 Support utilities
Contributors : Sid Wagle, Dmytro Sen, Myroslav Papirkovskyy, Aravindan Vijayan & Prajwal Rao
3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Grafana
4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Grafana Introduction
 Open source dashboard builder integrated with AMS.
 Available from Ambari-2.2.2
 Pre-defined host level and service level (HDFS, HBase, Yarn etc)
dashboards.
 Added to Ambari through API after upgrade
5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
AMS - Grafana Integration
 AMS is added as a Grafana datasource
 Service dashboards are created if not already present
– System, HDFS, HBase, YARN (2.2.2)
– Hive, Kafka, HBase per user and per table metrics, Storm (2.4.0)
 Grafana Supports Basic Auth & Https
6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Demo
7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
AMS Overview
8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ambari Metrics Service - Motivation
 Limited Ganglia capabilities
 OpenTSDB – GPL license and needs a Hadoop cluster
 Need service level aggregation as well as time based
 Alerts based on metrics system
 Ability to scale past a 1000 nodes
 Ability to perform analytics based on a use case
 Allow fine grained control over aspects like: retention, collection
intervals, aggregation
 Pluggable and Extensible
First version released with Ambari 2.0.0
9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
AMS Architecture
 Custom Sinks – HDFS, YARN, HBASE, STORM, KAFKA, FLUME, ACCUMULO
 Monitors – lightweight daemon for system metrics
 Collector – API daemon + HBase (embedded / distributed)
 Phoenix schema designed for fast reads
 Managed HBase
 Grafana support from version 2.2.2
High level component arch
AMBARI
Collector API
GRAFANA
HBASE
PHOENIX
HDP
SERVICES
SYSTEM MONITORSSINKS METRICS COLLECTOR
10 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Cluster Zookeeper
METRICS MONITOR
YARN
KAFKA
FLUME
METRICS SINKS
HBASE
STORM
HIVE
NIFI
HDFS
METRICS COLLECTOR
HBASE
Master + RS
PHOENIX
Aggregators
Collector API
Helix
Participant
METRICS COLLECTOR
HBASE
Master + RS
PHOENIX
Aggregators
Collector API
Helix
Participant
AMS Arch Details - HA
11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ambari Metrics Service
 Active-Active Collector configuration (HBase fully managed)
 Partition / Failover aggregate functions using Apache Helix
 Deterministic client side distribution of write load with failover
 Automatic discovery for Sink and Ambari (No restart needed)
AMS – Distributed / HA architecture
12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ambari Metrics Service
 Automatic pre-defined Region splits based on cluster services
 Aggregator v2 implemented to utilize the Region splits for query parallelism
 Ambari Metrics Cache to improve availability and reduce request/response
footprint
 Major HBase/Phoenix - HDP improvements influenced by AMS:
– FIFO compaction policy enabled for all AMS tables
– Normalizer to do automatic region split/merge for large tables
– Native HBase timestamp support to optimize date range queries in Phoenix
Performance improvements - Under the hood
13 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Feature Highlights,
API
14 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ambari Metrics Service
 Ambari native widgets for service level metrics (2.1.0)
 Ambari Metrics UI - Grafana integration (2.2.2)
 Pre-built Grafana dashboards to improve cluster debug-ability (2.2.2)
 Alerts based on metrics system in Ambari (2.2.2)
 Container metrics stored in AMS HBase with Zeppelin dashboards (2.4.0)
 Fully distributed mode with failover (2.5.0)
Feature Highlights sorted by release version
15 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Aggregations ( Schema ) Request Interval
Ambari Metrics Service - Aggregators
Precision Data30 SECOND
5 MINUTE
1 HOUR
1 DAY
5 MINUTE
1 HOUR
1 DAY
Across Hosts
Across Time
CLUSTER AGGREGATES HOST AGGREGATES
< 2 hours
2 hrs – 1 day
1 day - 30 days
> 30 days
16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
 GET
– Timeseries & Point in Time data
– Aggregate & Post Processing functions
– Precision
 POST
– Used by Monitors and Sinks
 Discovered metrics info
 All hosted applications
Ambari Metrics Service - API
API
Metrics API
Metadata API
17 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ambari Metrics Service - API
Metrics GET API
Metrics GET API
<Metrics
Collector
Host>:6188/ws/
v1/timeline/met
rics
metricNames
hostNames
appId
startTime
endTime
precision
Required
Optional
instanceId
18 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Metadata API
Ambari Metrics Service - API
Metrics Metadata
Hosts Metadata
19 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Debugging utils for AMS-Grafana
Older AMS
instance
Export script Exported
metrics
Mimicked AMS
endpoint (Flask
Server)
Import script
Grafana
datasource
ambari/ambari-server/src/main/resources/scripts/export_ams_metrics.py
20 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Future
 AMS – External mode
 AMS – Tee to external storage by providing pluggable sink interface
 AMS – Failover with temp storage for HDFS
 Grafana – Support for AMS HA
 Grafana - More analytic dashboards
 Grafana – Alerts integration with Grafana 3.2.0
21 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Q & A

More Related Content

PPTX
Ambari metrics system - Apache ambari meetup (DataWorks Summit 2017)
PPTX
Apache Ambari Meetup at Dataworks Summit 2017
PPTX
Apache Ambari - What's New in 2.0.0
PPTX
Apache Ambari: Past, Present, Future
PPTX
Apache Ambari - What's New in 2.4
PPTX
Apache Ambari - What's New in 2.1
PPTX
Apache Ambari - What's New in 1.7.0
PPTX
Cloudy with a chance of Hadoop - DataWorks Summit 2017 San Jose
Ambari metrics system - Apache ambari meetup (DataWorks Summit 2017)
Apache Ambari Meetup at Dataworks Summit 2017
Apache Ambari - What's New in 2.0.0
Apache Ambari: Past, Present, Future
Apache Ambari - What's New in 2.4
Apache Ambari - What's New in 2.1
Apache Ambari - What's New in 1.7.0
Cloudy with a chance of Hadoop - DataWorks Summit 2017 San Jose

What's hot (20)

PPTX
Managing Enterprise Hadoop Clusters with Apache Ambari
PDF
Past, Present and Future of Apache Ambari
PPTX
Hive ACID Apache BigData 2016
PDF
Deploying and Managing Hadoop Clusters with AMBARI
PPTX
Managing 2000 Node Cluster with Ambari
PPTX
Dataworks Berlin Summit 18' - Apache hadoop YARN State Of The Union
PPTX
Apache Accumulo 1.8.0 Overview
PDF
Hortonworks Technical Workshop: Apache Ambari
PPTX
Debugging Apache Hadoop YARN Cluster in Production
PDF
Accumulo Summit 2016: Apache Accumulo on Docker with YARN Native Services
PPTX
Dataworks Berlin Summit 18' - Deep learning On YARN - Running Distributed Te...
PPTX
Managing your Hadoop Clusters with Ambari
PPTX
Apache Ambari BOF - APIs - Hadoop Summit 2013
PPTX
Ozone- Object store for Apache Hadoop
PPTX
Ambari Meetup: Architecture and Demo
PDF
Accumulo Summit 2016: Effective Testing of Apache Accumulo Iterators
PPTX
Hive2.0 big dataspain-nov-2016
PPTX
Managing your Hadoop Clusters with Apache Ambari
PPTX
Apache Ambari - What's New in 1.5.0
PPTX
Apache Ambari BOF - Overview - Hadoop Summit 2013
Managing Enterprise Hadoop Clusters with Apache Ambari
Past, Present and Future of Apache Ambari
Hive ACID Apache BigData 2016
Deploying and Managing Hadoop Clusters with AMBARI
Managing 2000 Node Cluster with Ambari
Dataworks Berlin Summit 18' - Apache hadoop YARN State Of The Union
Apache Accumulo 1.8.0 Overview
Hortonworks Technical Workshop: Apache Ambari
Debugging Apache Hadoop YARN Cluster in Production
Accumulo Summit 2016: Apache Accumulo on Docker with YARN Native Services
Dataworks Berlin Summit 18' - Deep learning On YARN - Running Distributed Te...
Managing your Hadoop Clusters with Ambari
Apache Ambari BOF - APIs - Hadoop Summit 2013
Ozone- Object store for Apache Hadoop
Ambari Meetup: Architecture and Demo
Accumulo Summit 2016: Effective Testing of Apache Accumulo Iterators
Hive2.0 big dataspain-nov-2016
Managing your Hadoop Clusters with Apache Ambari
Apache Ambari - What's New in 1.5.0
Apache Ambari BOF - Overview - Hadoop Summit 2013
Ad

Similar to Apache Ambari Meetup - AMS & Grafana (20)

PPTX
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Why is my Hadoop cluster s...
PPTX
Managing Enterprise Hadoop Clusters with Apache Ambari
PDF
Why is My Hadoop Job Slow?
PPTX
Why is My Hadoop Job Slow?
PPTX
Why is my Hadoop cluster slow?
PPTX
Manage Add-On Services with Apache Ambari
PPTX
Manage Add-on Services in Apache Ambari
PPTX
Meet HBase 2.0 and Phoenix 5.0
PDF
Hortonworks technical workshop operations with ambari
PPTX
Apache Hadoop YARN: Past, Present and Future
PPTX
SAM - Streaming Analytics Made Easy
PPTX
Apache Hadoop YARN: Past, Present and Future
PPTX
Streaming analytics manager
PPTX
Micro services vs hadoop
PPTX
Apache Hadoop 3.0 What's new in YARN and MapReduce
PPTX
Hive acid and_2.x new_features
PPTX
What's new in Ambari
PPTX
Hadoop Operations - Past, Present, and Future
PPTX
Streamline Apache Hadoop Operations with Apache Ambari and SmartSense
POTX
Schema Registry & Stream Analytics Manager
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Why is my Hadoop cluster s...
Managing Enterprise Hadoop Clusters with Apache Ambari
Why is My Hadoop Job Slow?
Why is My Hadoop Job Slow?
Why is my Hadoop cluster slow?
Manage Add-On Services with Apache Ambari
Manage Add-on Services in Apache Ambari
Meet HBase 2.0 and Phoenix 5.0
Hortonworks technical workshop operations with ambari
Apache Hadoop YARN: Past, Present and Future
SAM - Streaming Analytics Made Easy
Apache Hadoop YARN: Past, Present and Future
Streaming analytics manager
Micro services vs hadoop
Apache Hadoop 3.0 What's new in YARN and MapReduce
Hive acid and_2.x new_features
What's new in Ambari
Hadoop Operations - Past, Present, and Future
Streamline Apache Hadoop Operations with Apache Ambari and SmartSense
Schema Registry & Stream Analytics Manager
Ad

Recently uploaded (20)

PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PPTX
MYSQL Presentation for SQL database connectivity
PDF
KodekX | Application Modernization Development
PDF
Encapsulation_ Review paper, used for researhc scholars
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Machine learning based COVID-19 study performance prediction
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
Cloud computing and distributed systems.
PDF
Encapsulation theory and applications.pdf
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
sap open course for s4hana steps from ECC to s4
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
Spectroscopy.pptx food analysis technology
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
MYSQL Presentation for SQL database connectivity
KodekX | Application Modernization Development
Encapsulation_ Review paper, used for researhc scholars
The AUB Centre for AI in Media Proposal.docx
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Machine learning based COVID-19 study performance prediction
Reach Out and Touch Someone: Haptics and Empathic Computing
Advanced methodologies resolving dimensionality complications for autism neur...
Cloud computing and distributed systems.
Encapsulation theory and applications.pdf
Unlocking AI with Model Context Protocol (MCP)
Spectral efficient network and resource selection model in 5G networks
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
“AI and Expert System Decision Support & Business Intelligence Systems”
sap open course for s4hana steps from ECC to s4
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Network Security Unit 5.pdf for BCA BBA.
Spectroscopy.pptx food analysis technology

Apache Ambari Meetup - AMS & Grafana

  • 1. Ambari Metrics System (AMS) & Grafana Aravindan (avijayan@apache) Prajwal (prao@hortonworks)
  • 2. 2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Agenda  Intro to Grafana  Grafana integration  Demo  AMS - Overview  AMS - Features & highlights  AMS – API Overview  Support utilities Contributors : Sid Wagle, Dmytro Sen, Myroslav Papirkovskyy, Aravindan Vijayan & Prajwal Rao
  • 3. 3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Grafana
  • 4. 4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Grafana Introduction  Open source dashboard builder integrated with AMS.  Available from Ambari-2.2.2  Pre-defined host level and service level (HDFS, HBase, Yarn etc) dashboards.  Added to Ambari through API after upgrade
  • 5. 5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved AMS - Grafana Integration  AMS is added as a Grafana datasource  Service dashboards are created if not already present – System, HDFS, HBase, YARN (2.2.2) – Hive, Kafka, HBase per user and per table metrics, Storm (2.4.0)  Grafana Supports Basic Auth & Https
  • 6. 6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Demo
  • 7. 7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved AMS Overview
  • 8. 8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ambari Metrics Service - Motivation  Limited Ganglia capabilities  OpenTSDB – GPL license and needs a Hadoop cluster  Need service level aggregation as well as time based  Alerts based on metrics system  Ability to scale past a 1000 nodes  Ability to perform analytics based on a use case  Allow fine grained control over aspects like: retention, collection intervals, aggregation  Pluggable and Extensible First version released with Ambari 2.0.0
  • 9. 9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved AMS Architecture  Custom Sinks – HDFS, YARN, HBASE, STORM, KAFKA, FLUME, ACCUMULO  Monitors – lightweight daemon for system metrics  Collector – API daemon + HBase (embedded / distributed)  Phoenix schema designed for fast reads  Managed HBase  Grafana support from version 2.2.2 High level component arch AMBARI Collector API GRAFANA HBASE PHOENIX HDP SERVICES SYSTEM MONITORSSINKS METRICS COLLECTOR
  • 10. 10 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Cluster Zookeeper METRICS MONITOR YARN KAFKA FLUME METRICS SINKS HBASE STORM HIVE NIFI HDFS METRICS COLLECTOR HBASE Master + RS PHOENIX Aggregators Collector API Helix Participant METRICS COLLECTOR HBASE Master + RS PHOENIX Aggregators Collector API Helix Participant AMS Arch Details - HA
  • 11. 11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ambari Metrics Service  Active-Active Collector configuration (HBase fully managed)  Partition / Failover aggregate functions using Apache Helix  Deterministic client side distribution of write load with failover  Automatic discovery for Sink and Ambari (No restart needed) AMS – Distributed / HA architecture
  • 12. 12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ambari Metrics Service  Automatic pre-defined Region splits based on cluster services  Aggregator v2 implemented to utilize the Region splits for query parallelism  Ambari Metrics Cache to improve availability and reduce request/response footprint  Major HBase/Phoenix - HDP improvements influenced by AMS: – FIFO compaction policy enabled for all AMS tables – Normalizer to do automatic region split/merge for large tables – Native HBase timestamp support to optimize date range queries in Phoenix Performance improvements - Under the hood
  • 13. 13 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Feature Highlights, API
  • 14. 14 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ambari Metrics Service  Ambari native widgets for service level metrics (2.1.0)  Ambari Metrics UI - Grafana integration (2.2.2)  Pre-built Grafana dashboards to improve cluster debug-ability (2.2.2)  Alerts based on metrics system in Ambari (2.2.2)  Container metrics stored in AMS HBase with Zeppelin dashboards (2.4.0)  Fully distributed mode with failover (2.5.0) Feature Highlights sorted by release version
  • 15. 15 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Aggregations ( Schema ) Request Interval Ambari Metrics Service - Aggregators Precision Data30 SECOND 5 MINUTE 1 HOUR 1 DAY 5 MINUTE 1 HOUR 1 DAY Across Hosts Across Time CLUSTER AGGREGATES HOST AGGREGATES < 2 hours 2 hrs – 1 day 1 day - 30 days > 30 days
  • 16. 16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved  GET – Timeseries & Point in Time data – Aggregate & Post Processing functions – Precision  POST – Used by Monitors and Sinks  Discovered metrics info  All hosted applications Ambari Metrics Service - API API Metrics API Metadata API
  • 17. 17 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ambari Metrics Service - API Metrics GET API Metrics GET API <Metrics Collector Host>:6188/ws/ v1/timeline/met rics metricNames hostNames appId startTime endTime precision Required Optional instanceId
  • 18. 18 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Metadata API Ambari Metrics Service - API Metrics Metadata Hosts Metadata
  • 19. 19 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Debugging utils for AMS-Grafana Older AMS instance Export script Exported metrics Mimicked AMS endpoint (Flask Server) Import script Grafana datasource ambari/ambari-server/src/main/resources/scripts/export_ams_metrics.py
  • 20. 20 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Future  AMS – External mode  AMS – Tee to external storage by providing pluggable sink interface  AMS – Failover with temp storage for HDFS  Grafana – Support for AMS HA  Grafana - More analytic dashboards  Grafana – Alerts integration with Grafana 3.2.0
  • 21. 21 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Q & A

Editor's Notes

  • #9: Ganglia – default metric system for Ambari. Diff to customize, no failover, single disk writes, unreliable, cannot scale past 1000 nodes without special hardware OpenTSDB – no pre-aggregations
  • #10: SQL schema No secondary indices Scale for writes
  • #11: Ambari-2.5.0 will support AMS HA Helix – task partitioning & service discovery Restarting daemons not needed. Auto discovered Client side
  • #20: Some context. Target audience