SlideShare a Scribd company logo
© Hortonworks Inc. 2015
Apache Hadoop YARN 2015
Present and Future
Vinod Kumar Vavilapalli
vinodkv [at] apache.org
@tshooter
Page 1
© Hortonworks Inc. 2015
Who am I?
• 7.75 Hadoop-years old
– Don’t fall for the job postings asking
for 10 years #Hadoop Experience yet

• Past
– 2007: Last thing at School – a two
node Tomcat cluster. Three months
later, first thing at job, brought down a
800 node cluster ;)
– Team that ran Hadoop @ Yahoo!
• Present: @Hortonworks
• Two hats
– Hortonworks: Hadoop MapReduce
and YARN Development lead
– Apache: Apache Hadoop PMC,
Apache Member
• Worked/working on
– YARN, Hadoop MapReduce,
HadoopOnDemand,
CapacityScheduler, Hadoop security
– Apache Ambari: Kickstarted the
project’s first release
– Stinger: High performance data
processing with Hadoop/Hive
• Lots of trouble shooting on
clusters (@tshooter)
• 99% + code in Apache, Hadoop
– Open Source
– Community driven
Page 2
Architecting the Future of Big Data
© Hortonworks Inc. 2015
Agenda
• Apache Hadoop YARN : Overview
• Past
• Present
• Future
Page 3
Architecting the Future of Big Data
© Hortonworks Inc. 2015
Overview
The Why and the What
Architecting the Future of Big Data
Page 4
© Hortonworks Inc. 2015
Why Hadoop YARN?
• Resource Management
• A messy problem
– Multiple apps, frameworks, their life-
cycles and evolution
• Varied expectations
– On isolation, capacity allocations,
scheduling
– Admin: “Best use of my cluster”
– Users: “Get me as much as possible,
as fast as possible”
• Tenancy
– “I am running this cluster for one
user”
– It almost never stops there
– Groups, Teams, Users
• Adhoc structures get bad real fast
• What’s different?
– Centered around Data
• ‘iIities
– Admission policies. Sharing. Security.
Elasticity. SLAs. ROI
Page 5
Architecting the Future of Big Data
Data
?
Applications
Admins Users
© Hortonworks Inc. 2015
What is Hadoop YARN?
Page 6
HDFS (Scalable, Reliable Storage)
YARN (Cluster Resource Management)
Applications (Running Natively in Hadoop)
• Store all your data in one place … (HDFS)
• Interact with that data in multiple ways … (YARN Platform + Apps)
• Scale as you go, shared, multi-tenant, secure … (The Hadoop Stack)
Queues Admins/Users
Cluster Resources
Pipelines
© Hortonworks Inc. 2015
Past
A quick history
Architecting the Future of Big Data
Page 7
© Hortonworks Inc. 2015
A brief Timeline before the BigBang
• Sub-project of Apache Hadoop
• Releases tied to Hadoop releases
• Gmail like alphas and betas 
– In production at several large sites for
MapReduce already by that time
Page 8
Architecting the Future of Big Data
1st line of Code Open sourced First 2.0 alpha First 2.0 beta
June-July 2010 August 2011 May 2012 August 2013
© Hortonworks Inc. 2015
Apache Hadoop YARN releases
• 15 October, 2013
• The 1st GA release of Apache Hadoop 2.x
• YARN
– First stable and supported release of YARN
– Binary Compatibility for MapReduce applications built on Hadoop-1.x
– YARN level APIs solidified for the future
– Performance
– Scale from the get-go!
• Support for running Hadoop on Microsoft Windows
• Substantial amount of integration testing with rest of projects in the
ecosystem
Page 9
Architecting the Future of Big Data
Apache Hadoop 2.2
© Hortonworks Inc. 2015
Releases (contd)
• 24 February, 2014
• First post GA release for the year 2014
• Number of bug-fixes, enhancements
• Alpha features in YARN
– ResourceManager Failover
– Application History
Page 10
Architecting the Future of Big Data
Apache Hadoop 2.3
© Hortonworks Inc. 2015
Releases (contd)
• 07 April, 2014
• YARN
– ResourceManager Fail-over
– Preemption aided Scheduling
– Application History and Timeline Service V1
Page 11
Architecting the Future of Big Data
Apache Hadoop 2.4
© Hortonworks Inc. 2015
Releases (contd)
• 11 August, 2014
• YARN
– YARN's REST APIs
– Submitting & killing applications.
– Timeline Service V1 Security
Page 12
Architecting the Future of Big Data
Apache Hadoop 2.5
© Hortonworks Inc. 2015
Present
Architecting the Future of Big Data
Page 13
© Hortonworks Inc. 2015
Apache Hadoop releases (contd)
• 18 November 2014
• Last major release at the time of this talk
• YARN
– Support for rolling upgrades
– Support for long running services
– Support for node labels
– Alpha/Beta features: Time-based resource reservations, running applications
natively in Docker containers
Page 14
Architecting the Future of Big Data
Apache Hadoop 2.6
© Hortonworks Inc. 2015
Rolling Upgrades
At a click of a button
Architecting the Future of Big Data
Page 15
© Hortonworks Inc. 2015
Work preserving ResourceManager restart
Page 16
Architecting the Future of Big Data
• ResourceManager remembers some state
• Reconstructs the remaining from nodes and apps
© Hortonworks Inc. 2015
Work preserving NodeManager restart
Page 17
Architecting the Future of Big Data
• NodeManager remembers state on each machine
• Reconnects to running containers
© Hortonworks Inc. 2015
ResourceManager Fail-over
• Active/Standby Mode
• Depends on fast-recovery
Page 18
Architecting the Future of Big Data
ZooKeeper
© Hortonworks Inc. 2015
YARN Rolling Upgrades Workflow
Page 19
Architecting the Future of Big Data
• Servers first
– Masters followed by Slaves
• Upgrade of Applications/Frameworks is decoupled!
© Hortonworks Inc. 2015
YARN Rolling Upgrades Snapshot
Page 20
Architecting the Future of Big Data
© Hortonworks Inc. 2015
Stack Rolling Upgrades
Page 21
Architecting the Future of Big Data
Rolling Updates Session by Sanjay Radia
Thursday April 16, 2015 11:45-12:25
@ Silver Hall
© Hortonworks Inc. 2015
Services on YARN
Architecting the Future of Big Data
Page 22
© Hortonworks Inc. 2015
Long running services
• You could run them already before
2.6!
• Enhancements needed
– Logs
– Security
– Management/monitoring
– Sharing and Placement
– Discovery
• Resource sharing across
workload types
• Fault tolerance of long running
services
– Work preserving AM restart
– AM forgetting faults
• Service registry
• Project Slider:
http://slider.incubator.apache.org/
• HBase, Storm, Kafka already!
Page 23
Architecting the Future of Big Data
“Bringing Long Running Services to Hadoop YARN”
by Steve Loughran
Thursday April 16, 2015 12:40-13:20
@ Copper Hall
© Hortonworks Inc. 2015
Cluster Management Features
Architecting the Future of Big Data
Page 24
© Hortonworks Inc. 2015
Preemption aided Scheduling
• Admins
– “Make the best use of cluster resources”
• Users
– “Give me resources fast”
• Solution
– Elastic queues
– Loan idle capacities to others
– Take it back on demand
– Balance across queues: In
– Balance across users in a queue: WIP
Page 25
Architecting the Future of Big Data
© Hortonworks Inc. 2015
Fine-grain isolation for multi-tenancy
• Memory
– Custom monitoring
– Inelastic Resource
• CPU
– Cgroups on Linux
– Elastic Resource
• Support on Windows
– WIP
Page 26
Architecting the Future of Big Data
© Hortonworks Inc. 2015
Multi-resource scheduling
• Multi-dimensional bin-packing
– Application A says “I want 8GB RAM
and 2 CPUs”
– Application B says “I want 1GB RAM
and 10 CPUs”
• Today – memory & cpu
– Physical memory / virtual memory
– Cpu Cores – Virtual cores
• Scheduling constrained based on
the “bottleneck” resource
– Watch out for utilization drop on the
non-scarce resource
Page 27
Architecting the Future of Big Data
© Hortonworks Inc. 2015
Node Labels
• Partitions
– Admin: “I have machines of different
types”
– Impact on capacity planning: “Hey,
we bought those Windows machines”
• Types
– Exclusive: “This is my Precious!”
– Non-exclusive: “I get binding
preference. Use it for others when
idle”
• Constraints
– “Take me to a machine running JDK
version 9”
– No impact on capacity planning
– WIP
Page 28
Architecting the Future of Big Data
Default Partition
Partition B
Linux
Partition C
Windows
JDK 8 JDK 7 JDK 7
© Hortonworks Inc. 2015
Operational and Developer tooling
Architecting the Future of Big Data
Page 29
© Hortonworks Inc. 2015
Application History and Timeline Service
• Before
– Few MR specific implementations:
History and web-UI
• Not just MR anymore!
• History
– “Why was my application slow?”
– “Where did my containers run?”
– MapReduce specific Job History
Server
– Need a generic solution beyond
ResourceManager Restart
• Run analytics on historical apps!
– “User with most resource utilization”
– “Largest application run”
• Application Timeline
– Framework specific event collection
and UIs
– “Show me the Counters for my
running MapReduce task”
– “Show me the slowest Storm stream
processing bolt while it is running”
• Present
– A LevelDB based implementation
– Integrated into MapReduce, Apache
Tez, Apache Hive
Page 30
Architecting the Future of Big Data
© Hortonworks Inc. 2015
Other features
• Web Services
– No need for installed Hadoop Clients
– Submit an app
– Monitor / Kill it
• Multi-homing Environments
– Clients on a public networks
– Cluster traffic on a private network
– Fault tolerance
– Security
Page 31
Architecting the Future of Big Data
© Hortonworks Inc. 2015
Future
Architecting the Future of Big Data
Page 32
© Hortonworks Inc. 2015
Apache Hadoop releases (contd)
• Hadoop 2.7
– Likely April 19-24 week, 2014
– Moving to JDK 7 and beyond
• Future
Page 33
Architecting the Future of Big Data
Apache Hadoop 2.7,
2.8 and beyond
© Hortonworks Inc. 2015
Future: Timeline Service Next Generation
• Next generation
– Today’s solution helped understand the space
– Limited scalability and availability
• Analyzing Hadoop Clusters is a big-data problem
– Don’t want to throw away the Hadoop application metadata
– Large scale
– Enable near real-time analysis: “Find me the user who is hammering the
FileSystem with rouge applications. Now.”
• Timeline data stored in HBase and accessible to queries
Page 34
Architecting the Future of Big Data
© Hortonworks Inc. 2015
Future: Improved Usability
• Generic run-time information
– “What is my actual usage by the running container?”
– “How many rack local containers did I get”
– “How healthy is the scheduler”
– “Why is my application stuck? What limits did it hit?”
• With Timeline Service
– Why is my application slow?
– Why is my cluster slow?
– Why is my application failing?
– Why is my cluster down?
– What happened with my application? Succeeded?
– What happened in my clusters?
• Collect and use past data
– To schedule my application better
– To do better capacity planning
Page 35
Architecting the Future of Big Data
© Hortonworks Inc. 2015
Future: Containerized Applications
• Running Containerized
Applications on YARN
• Docker
• Multiple use-cases
– Run my existing service on YARN
– Slider + Docker
– Run my existing MapReduce
application on YARN via a docker
image
Page 36
Architecting the Future of Big Data
© Hortonworks Inc. 2015
Future: Scheduling
• Support priorities across
applications within the same
queue
• Policy Driven scheduling
– “I want app level fairness in queue A,
user level fairness in queue B, and
throughput focus in all other queues”
• Node anti-affinity
– “Do not run two copies of my service
daemon on the same machine”
• Gang scheduling
– “Run all of my app at once”
• Dynamic scheduling of containers
based on actual utilization
• Stabilized App Reservations
– “Create a reservation for my app with
X resources to run at 6AM tomorrow”
• Time based policies
– “10% cluster capacity for queue A
from 6-9AM, but 20% from 9-12AM”
• Prioritized queues
– Admin’s queue takes precedence
over everything else
• Lot more ..
Page 37
Architecting the Future of Big Data
© Hortonworks Inc. 2015
Future: More Resource Types
• Node level Isolation and Cluster
level Scheduling
• Disks
– Space
– IOPS: Read/Write
• Network
– Incoming bandwidth
– Outgoing bandwidth
Page 38
Architecting the Future of Big Data
© Hortonworks Inc. 2015
Thank you!
Page 39
Architecting the Future of Big Data
Sandbox: Hadoop in a VM!
Questions Time!

More Related Content

PDF
Apache Hadoop YARN - Enabling Next Generation Data Applications
PPTX
Hadoop Summit Europe 2015 - YARN Present and Future
PPTX
Apache Hadoop YARN: Present and Future
PPTX
Hadoop Summit Europe Talk 2014: Apache Hadoop YARN: Present and Future
PPTX
YARN - Presented At Dallas Hadoop User Group
PDF
Apache Hadoop YARN - The Future of Data Processing with Hadoop
PPTX
Writing Yarn Applications Hadoop Summit 2012
PPTX
Enabling Diverse Workload Scheduling in YARN
Apache Hadoop YARN - Enabling Next Generation Data Applications
Hadoop Summit Europe 2015 - YARN Present and Future
Apache Hadoop YARN: Present and Future
Hadoop Summit Europe Talk 2014: Apache Hadoop YARN: Present and Future
YARN - Presented At Dallas Hadoop User Group
Apache Hadoop YARN - The Future of Data Processing with Hadoop
Writing Yarn Applications Hadoop Summit 2012
Enabling Diverse Workload Scheduling in YARN

What's hot (20)

PDF
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...
PPTX
Apache Hadoop YARN: best practices
PDF
Yarns About Yarn
PDF
Scale 12 x Efficient Multi-tenant Hadoop 2 Workloads with Yarn
PPTX
YARN - Next Generation Compute Platform fo Hadoop
PPTX
NextGen Apache Hadoop MapReduce
PPTX
Towards SLA-based Scheduling on YARN Clusters
PDF
Introduction to YARN Apps
PPTX
Apache Hadoop YARN: Past, Present and Future
PDF
Yarn
PPTX
Running Non-MapReduce Big Data Applications on Apache Hadoop
PPTX
Apache Tez - Accelerating Hadoop Data Processing
PDF
Apache Hadoop YARN
ODP
An Introduction to Apache Hadoop Yarn
PDF
Hadoop 2 - Going beyond MapReduce
PDF
Developing Applications with Hadoop 2.0 and YARN by Abhijit Lele
PPTX
Apache Tez - A New Chapter in Hadoop Data Processing
PDF
Hive Now Sparks
PPTX
Get most out of Spark on YARN
PPTX
Moving towards enterprise ready Hadoop clusters on the cloud
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...
Apache Hadoop YARN: best practices
Yarns About Yarn
Scale 12 x Efficient Multi-tenant Hadoop 2 Workloads with Yarn
YARN - Next Generation Compute Platform fo Hadoop
NextGen Apache Hadoop MapReduce
Towards SLA-based Scheduling on YARN Clusters
Introduction to YARN Apps
Apache Hadoop YARN: Past, Present and Future
Yarn
Running Non-MapReduce Big Data Applications on Apache Hadoop
Apache Tez - Accelerating Hadoop Data Processing
Apache Hadoop YARN
An Introduction to Apache Hadoop Yarn
Hadoop 2 - Going beyond MapReduce
Developing Applications with Hadoop 2.0 and YARN by Abhijit Lele
Apache Tez - A New Chapter in Hadoop Data Processing
Hive Now Sparks
Get most out of Spark on YARN
Moving towards enterprise ready Hadoop clusters on the cloud
Ad

Viewers also liked (14)

PDF
BIGDATA- Survey on Scheduling Methods in Hadoop MapReduce
PDF
MACHINE LEARNING ON MAPREDUCE FRAMEWORK
PPTX
HADOOP TECHNOLOGY ppt
PPSX
PPTX
Introduction to Yarn
PPTX
Apache Hadoop YARN: Understanding the Data Operating System of Hadoop
DOCX
Hadoop Report
ODP
Hadoop demo ppt
PPTX
Big Data & Hadoop Tutorial
PPT
Seminar Presentation Hadoop
PPTX
Hadoop introduction , Why and What is Hadoop ?
PDF
Hadoop Overview & Architecture
 
PPTX
Big data and Hadoop
PPTX
Introduction to YARN and MapReduce 2
BIGDATA- Survey on Scheduling Methods in Hadoop MapReduce
MACHINE LEARNING ON MAPREDUCE FRAMEWORK
HADOOP TECHNOLOGY ppt
Introduction to Yarn
Apache Hadoop YARN: Understanding the Data Operating System of Hadoop
Hadoop Report
Hadoop demo ppt
Big Data & Hadoop Tutorial
Seminar Presentation Hadoop
Hadoop introduction , Why and What is Hadoop ?
Hadoop Overview & Architecture
 
Big data and Hadoop
Introduction to YARN and MapReduce 2
Ad

Similar to Apache Hadoop YARN 2015: Present and Future (20)

PPTX
Apache Hadoop YARN: Present and Future
PPTX
Get Started Building YARN Applications
PPTX
Hadoop Summit San Jose 2015: YARN - Past, Present and Future
PPTX
Munich HUG 21.11.2013
PPTX
Dataworks Berlin Summit 18' - Apache hadoop YARN State Of The Union
PPTX
Apache Hadoop YARN: state of the union
PPTX
MHUG - YARN
PPTX
Mrinal devadas, Hortonworks Making Sense Of Big Data
PPTX
201305 hadoop jpl-v3
PPTX
Hadoop In Action
PDF
Apache Ratis - In Search of a Usable Raft Library
PDF
Deploying and Managing Hadoop Clusters with AMBARI
PPTX
YARN Ready - Integrating to YARN using Slider Webinar
PDF
Transitioning Compute Models: Hadoop MapReduce to Spark
PPTX
Hadoop operations-2014-strata-new-york-v5
PPTX
YARN - Hadoop Next Generation Compute Platform
PDF
Bikas saha:the next generation of hadoop– hadoop 2 and yarn
PDF
Running Hadoop as Service in AltiScale Platform
PPTX
One Click Hadoop Clusters - Anywhere (Using Docker)
PPTX
Apache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and Future
Get Started Building YARN Applications
Hadoop Summit San Jose 2015: YARN - Past, Present and Future
Munich HUG 21.11.2013
Dataworks Berlin Summit 18' - Apache hadoop YARN State Of The Union
Apache Hadoop YARN: state of the union
MHUG - YARN
Mrinal devadas, Hortonworks Making Sense Of Big Data
201305 hadoop jpl-v3
Hadoop In Action
Apache Ratis - In Search of a Usable Raft Library
Deploying and Managing Hadoop Clusters with AMBARI
YARN Ready - Integrating to YARN using Slider Webinar
Transitioning Compute Models: Hadoop MapReduce to Spark
Hadoop operations-2014-strata-new-york-v5
YARN - Hadoop Next Generation Compute Platform
Bikas saha:the next generation of hadoop– hadoop 2 and yarn
Running Hadoop as Service in AltiScale Platform
One Click Hadoop Clusters - Anywhere (Using Docker)
Apache Hadoop YARN: Present and Future

More from DataWorks Summit (20)

PPTX
Data Science Crash Course
PPTX
Floating on a RAFT: HBase Durability with Apache Ratis
PPTX
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
PDF
HBase Tales From the Trenches - Short stories about most common HBase operati...
PPTX
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
PPTX
Managing the Dewey Decimal System
PPTX
Practical NoSQL: Accumulo's dirlist Example
PPTX
HBase Global Indexing to support large-scale data ingestion at Uber
PPTX
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
PPTX
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
PPTX
Supporting Apache HBase : Troubleshooting and Supportability Improvements
PPTX
Security Framework for Multitenant Architecture
PDF
Presto: Optimizing Performance of SQL-on-Anything Engine
PPTX
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
PPTX
Extending Twitter's Data Platform to Google Cloud
PPTX
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
PPTX
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
PPTX
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
PDF
Computer Vision: Coming to a Store Near You
PPTX
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Data Science Crash Course
Floating on a RAFT: HBase Durability with Apache Ratis
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
HBase Tales From the Trenches - Short stories about most common HBase operati...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Managing the Dewey Decimal System
Practical NoSQL: Accumulo's dirlist Example
HBase Global Indexing to support large-scale data ingestion at Uber
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Security Framework for Multitenant Architecture
Presto: Optimizing Performance of SQL-on-Anything Engine
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Extending Twitter's Data Platform to Google Cloud
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Computer Vision: Coming to a Store Near You
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark

Recently uploaded (20)

DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
cuic standard and advanced reporting.pdf
PDF
NewMind AI Monthly Chronicles - July 2025
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Modernizing your data center with Dell and AMD
PDF
Advanced Soft Computing BINUS July 2025.pdf
PDF
Advanced IT Governance
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
KodekX | Application Modernization Development
PDF
Electronic commerce courselecture one. Pdf
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
GDG Cloud Iasi [PUBLIC] Florian Blaga - Unveiling the Evolution of Cybersecur...
The AUB Centre for AI in Media Proposal.docx
Unlocking AI with Model Context Protocol (MCP)
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Chapter 3 Spatial Domain Image Processing.pdf
cuic standard and advanced reporting.pdf
NewMind AI Monthly Chronicles - July 2025
“AI and Expert System Decision Support & Business Intelligence Systems”
Modernizing your data center with Dell and AMD
Advanced Soft Computing BINUS July 2025.pdf
Advanced IT Governance
Advanced methodologies resolving dimensionality complications for autism neur...
KodekX | Application Modernization Development
Electronic commerce courselecture one. Pdf
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Mobile App Security Testing_ A Comprehensive Guide.pdf
Spectral efficient network and resource selection model in 5G networks
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
20250228 LYD VKU AI Blended-Learning.pptx
GDG Cloud Iasi [PUBLIC] Florian Blaga - Unveiling the Evolution of Cybersecur...

Apache Hadoop YARN 2015: Present and Future

  • 1. © Hortonworks Inc. 2015 Apache Hadoop YARN 2015 Present and Future Vinod Kumar Vavilapalli vinodkv [at] apache.org @tshooter Page 1
  • 2. © Hortonworks Inc. 2015 Who am I? • 7.75 Hadoop-years old – Don’t fall for the job postings asking for 10 years #Hadoop Experience yet  • Past – 2007: Last thing at School – a two node Tomcat cluster. Three months later, first thing at job, brought down a 800 node cluster ;) – Team that ran Hadoop @ Yahoo! • Present: @Hortonworks • Two hats – Hortonworks: Hadoop MapReduce and YARN Development lead – Apache: Apache Hadoop PMC, Apache Member • Worked/working on – YARN, Hadoop MapReduce, HadoopOnDemand, CapacityScheduler, Hadoop security – Apache Ambari: Kickstarted the project’s first release – Stinger: High performance data processing with Hadoop/Hive • Lots of trouble shooting on clusters (@tshooter) • 99% + code in Apache, Hadoop – Open Source – Community driven Page 2 Architecting the Future of Big Data
  • 3. © Hortonworks Inc. 2015 Agenda • Apache Hadoop YARN : Overview • Past • Present • Future Page 3 Architecting the Future of Big Data
  • 4. © Hortonworks Inc. 2015 Overview The Why and the What Architecting the Future of Big Data Page 4
  • 5. © Hortonworks Inc. 2015 Why Hadoop YARN? • Resource Management • A messy problem – Multiple apps, frameworks, their life- cycles and evolution • Varied expectations – On isolation, capacity allocations, scheduling – Admin: “Best use of my cluster” – Users: “Get me as much as possible, as fast as possible” • Tenancy – “I am running this cluster for one user” – It almost never stops there – Groups, Teams, Users • Adhoc structures get bad real fast • What’s different? – Centered around Data • ‘iIities – Admission policies. Sharing. Security. Elasticity. SLAs. ROI Page 5 Architecting the Future of Big Data Data ? Applications Admins Users
  • 6. © Hortonworks Inc. 2015 What is Hadoop YARN? Page 6 HDFS (Scalable, Reliable Storage) YARN (Cluster Resource Management) Applications (Running Natively in Hadoop) • Store all your data in one place … (HDFS) • Interact with that data in multiple ways … (YARN Platform + Apps) • Scale as you go, shared, multi-tenant, secure … (The Hadoop Stack) Queues Admins/Users Cluster Resources Pipelines
  • 7. © Hortonworks Inc. 2015 Past A quick history Architecting the Future of Big Data Page 7
  • 8. © Hortonworks Inc. 2015 A brief Timeline before the BigBang • Sub-project of Apache Hadoop • Releases tied to Hadoop releases • Gmail like alphas and betas  – In production at several large sites for MapReduce already by that time Page 8 Architecting the Future of Big Data 1st line of Code Open sourced First 2.0 alpha First 2.0 beta June-July 2010 August 2011 May 2012 August 2013
  • 9. © Hortonworks Inc. 2015 Apache Hadoop YARN releases • 15 October, 2013 • The 1st GA release of Apache Hadoop 2.x • YARN – First stable and supported release of YARN – Binary Compatibility for MapReduce applications built on Hadoop-1.x – YARN level APIs solidified for the future – Performance – Scale from the get-go! • Support for running Hadoop on Microsoft Windows • Substantial amount of integration testing with rest of projects in the ecosystem Page 9 Architecting the Future of Big Data Apache Hadoop 2.2
  • 10. © Hortonworks Inc. 2015 Releases (contd) • 24 February, 2014 • First post GA release for the year 2014 • Number of bug-fixes, enhancements • Alpha features in YARN – ResourceManager Failover – Application History Page 10 Architecting the Future of Big Data Apache Hadoop 2.3
  • 11. © Hortonworks Inc. 2015 Releases (contd) • 07 April, 2014 • YARN – ResourceManager Fail-over – Preemption aided Scheduling – Application History and Timeline Service V1 Page 11 Architecting the Future of Big Data Apache Hadoop 2.4
  • 12. © Hortonworks Inc. 2015 Releases (contd) • 11 August, 2014 • YARN – YARN's REST APIs – Submitting & killing applications. – Timeline Service V1 Security Page 12 Architecting the Future of Big Data Apache Hadoop 2.5
  • 13. © Hortonworks Inc. 2015 Present Architecting the Future of Big Data Page 13
  • 14. © Hortonworks Inc. 2015 Apache Hadoop releases (contd) • 18 November 2014 • Last major release at the time of this talk • YARN – Support for rolling upgrades – Support for long running services – Support for node labels – Alpha/Beta features: Time-based resource reservations, running applications natively in Docker containers Page 14 Architecting the Future of Big Data Apache Hadoop 2.6
  • 15. © Hortonworks Inc. 2015 Rolling Upgrades At a click of a button Architecting the Future of Big Data Page 15
  • 16. © Hortonworks Inc. 2015 Work preserving ResourceManager restart Page 16 Architecting the Future of Big Data • ResourceManager remembers some state • Reconstructs the remaining from nodes and apps
  • 17. © Hortonworks Inc. 2015 Work preserving NodeManager restart Page 17 Architecting the Future of Big Data • NodeManager remembers state on each machine • Reconnects to running containers
  • 18. © Hortonworks Inc. 2015 ResourceManager Fail-over • Active/Standby Mode • Depends on fast-recovery Page 18 Architecting the Future of Big Data ZooKeeper
  • 19. © Hortonworks Inc. 2015 YARN Rolling Upgrades Workflow Page 19 Architecting the Future of Big Data • Servers first – Masters followed by Slaves • Upgrade of Applications/Frameworks is decoupled!
  • 20. © Hortonworks Inc. 2015 YARN Rolling Upgrades Snapshot Page 20 Architecting the Future of Big Data
  • 21. © Hortonworks Inc. 2015 Stack Rolling Upgrades Page 21 Architecting the Future of Big Data Rolling Updates Session by Sanjay Radia Thursday April 16, 2015 11:45-12:25 @ Silver Hall
  • 22. © Hortonworks Inc. 2015 Services on YARN Architecting the Future of Big Data Page 22
  • 23. © Hortonworks Inc. 2015 Long running services • You could run them already before 2.6! • Enhancements needed – Logs – Security – Management/monitoring – Sharing and Placement – Discovery • Resource sharing across workload types • Fault tolerance of long running services – Work preserving AM restart – AM forgetting faults • Service registry • Project Slider: http://slider.incubator.apache.org/ • HBase, Storm, Kafka already! Page 23 Architecting the Future of Big Data “Bringing Long Running Services to Hadoop YARN” by Steve Loughran Thursday April 16, 2015 12:40-13:20 @ Copper Hall
  • 24. © Hortonworks Inc. 2015 Cluster Management Features Architecting the Future of Big Data Page 24
  • 25. © Hortonworks Inc. 2015 Preemption aided Scheduling • Admins – “Make the best use of cluster resources” • Users – “Give me resources fast” • Solution – Elastic queues – Loan idle capacities to others – Take it back on demand – Balance across queues: In – Balance across users in a queue: WIP Page 25 Architecting the Future of Big Data
  • 26. © Hortonworks Inc. 2015 Fine-grain isolation for multi-tenancy • Memory – Custom monitoring – Inelastic Resource • CPU – Cgroups on Linux – Elastic Resource • Support on Windows – WIP Page 26 Architecting the Future of Big Data
  • 27. © Hortonworks Inc. 2015 Multi-resource scheduling • Multi-dimensional bin-packing – Application A says “I want 8GB RAM and 2 CPUs” – Application B says “I want 1GB RAM and 10 CPUs” • Today – memory & cpu – Physical memory / virtual memory – Cpu Cores – Virtual cores • Scheduling constrained based on the “bottleneck” resource – Watch out for utilization drop on the non-scarce resource Page 27 Architecting the Future of Big Data
  • 28. © Hortonworks Inc. 2015 Node Labels • Partitions – Admin: “I have machines of different types” – Impact on capacity planning: “Hey, we bought those Windows machines” • Types – Exclusive: “This is my Precious!” – Non-exclusive: “I get binding preference. Use it for others when idle” • Constraints – “Take me to a machine running JDK version 9” – No impact on capacity planning – WIP Page 28 Architecting the Future of Big Data Default Partition Partition B Linux Partition C Windows JDK 8 JDK 7 JDK 7
  • 29. © Hortonworks Inc. 2015 Operational and Developer tooling Architecting the Future of Big Data Page 29
  • 30. © Hortonworks Inc. 2015 Application History and Timeline Service • Before – Few MR specific implementations: History and web-UI • Not just MR anymore! • History – “Why was my application slow?” – “Where did my containers run?” – MapReduce specific Job History Server – Need a generic solution beyond ResourceManager Restart • Run analytics on historical apps! – “User with most resource utilization” – “Largest application run” • Application Timeline – Framework specific event collection and UIs – “Show me the Counters for my running MapReduce task” – “Show me the slowest Storm stream processing bolt while it is running” • Present – A LevelDB based implementation – Integrated into MapReduce, Apache Tez, Apache Hive Page 30 Architecting the Future of Big Data
  • 31. © Hortonworks Inc. 2015 Other features • Web Services – No need for installed Hadoop Clients – Submit an app – Monitor / Kill it • Multi-homing Environments – Clients on a public networks – Cluster traffic on a private network – Fault tolerance – Security Page 31 Architecting the Future of Big Data
  • 32. © Hortonworks Inc. 2015 Future Architecting the Future of Big Data Page 32
  • 33. © Hortonworks Inc. 2015 Apache Hadoop releases (contd) • Hadoop 2.7 – Likely April 19-24 week, 2014 – Moving to JDK 7 and beyond • Future Page 33 Architecting the Future of Big Data Apache Hadoop 2.7, 2.8 and beyond
  • 34. © Hortonworks Inc. 2015 Future: Timeline Service Next Generation • Next generation – Today’s solution helped understand the space – Limited scalability and availability • Analyzing Hadoop Clusters is a big-data problem – Don’t want to throw away the Hadoop application metadata – Large scale – Enable near real-time analysis: “Find me the user who is hammering the FileSystem with rouge applications. Now.” • Timeline data stored in HBase and accessible to queries Page 34 Architecting the Future of Big Data
  • 35. © Hortonworks Inc. 2015 Future: Improved Usability • Generic run-time information – “What is my actual usage by the running container?” – “How many rack local containers did I get” – “How healthy is the scheduler” – “Why is my application stuck? What limits did it hit?” • With Timeline Service – Why is my application slow? – Why is my cluster slow? – Why is my application failing? – Why is my cluster down? – What happened with my application? Succeeded? – What happened in my clusters? • Collect and use past data – To schedule my application better – To do better capacity planning Page 35 Architecting the Future of Big Data
  • 36. © Hortonworks Inc. 2015 Future: Containerized Applications • Running Containerized Applications on YARN • Docker • Multiple use-cases – Run my existing service on YARN – Slider + Docker – Run my existing MapReduce application on YARN via a docker image Page 36 Architecting the Future of Big Data
  • 37. © Hortonworks Inc. 2015 Future: Scheduling • Support priorities across applications within the same queue • Policy Driven scheduling – “I want app level fairness in queue A, user level fairness in queue B, and throughput focus in all other queues” • Node anti-affinity – “Do not run two copies of my service daemon on the same machine” • Gang scheduling – “Run all of my app at once” • Dynamic scheduling of containers based on actual utilization • Stabilized App Reservations – “Create a reservation for my app with X resources to run at 6AM tomorrow” • Time based policies – “10% cluster capacity for queue A from 6-9AM, but 20% from 9-12AM” • Prioritized queues – Admin’s queue takes precedence over everything else • Lot more .. Page 37 Architecting the Future of Big Data
  • 38. © Hortonworks Inc. 2015 Future: More Resource Types • Node level Isolation and Cluster level Scheduling • Disks – Space – IOPS: Read/Write • Network – Incoming bandwidth – Outgoing bandwidth Page 38 Architecting the Future of Big Data
  • 39. © Hortonworks Inc. 2015 Thank you! Page 39 Architecting the Future of Big Data Sandbox: Hadoop in a VM! Questions Time!

Editor's Notes

  • #6: YARN is not the first general Resource Management platform. So what’s different? It’s data!
  • #7: Queues reflect org structures. Hierarchical in nature.