SlideShare a Scribd company logo
Datacenter Operating
System
Presented By:
Keshav Yadav
www.linkedin.com/in/keshavyadavlpu
What is DCOS ?
• Some have declared that “the datacenter is
the new computer”
• Claim: this new computer increasingly
needs an operating system
• Not necessarily a new host OS, but a
common software layer that manages
resources and provides shared services for
the whole datacenter, like an OS does for
one host
Why Datacenters Need an OS
• Growing number of applications
– Parallel processing systems: MapReduce,
Dryad, Pregel, Percolator, Dremel, MR Online
– Storage systems: GFS, BigTable, Dynamo,
SCADS
– Web apps and supporting services
• Growing number of users
– 200+ for Facebook’s Hadoop data
warehouse, running near-interactive ad hoc
queries
What Operating Systems Provide
• Resource sharing across applications &
users
• Data sharing between programs
• Programming abstractions (e.g. threads,
IPC)
• Debugging facilities (e.g. ptrace, gdb)
Result: OSes enable a highly interoperable
software ecosystem that we now take for
granted
Today’s Datacenter OS
• Hadoop MapReduce as common
execution and resource sharing platform
• Hadoop InputFormat API for data sharing
• Abstractions for productivity programmers,
but not for system builders
• Very challenging to debug across all the
layers
Tomorrow’s Datacenter OS
• Resource sharing:
– Lower-level interfaces for fine-grained sharing
(Mesos is a first step in this direction)
– Optimization for a variety of metrics (e.g.
energy)
– Integration with network scheduling
mechanisms (e.g. Seawall [NSDI ‘11], NOX,
Orchestra)
Tomorrow’s Datacenter OS
• Data sharing:
– Standard interfaces for cluster file systems,
key-value stores, etc
– In-memory data sharing (e.g. Spark, DFS
cache), and a unified system to manage this
memory
– Streaming data abstractions (analogous to
pipes)
– Lineage instead of replication for reliability
(RDDs)
Tomorrow’s Datacenter OS
• Programming abstractions:
– Tools that can be used to build the next
MapReduce / BigTable in a week (e.g.
BOOM)
– Efficient implementations of communication
primitives (e.g. shuffle, broadcast)
– New distributed programming models
Tomorrow’s Datacenter OS
• Debugging facilities:
– Tracing and debugging tools that work across
the cluster software stack (e.g. X-Trace,
Dapper)
– Replay debugging that takes advantage of
limited languages / computational models
– Unified monitoring infrastructure and APIs
Putting it AllTogether
• A successful datacenter OS might let
users:
– Build a Hadoop-like software stack in a week
using the OS’s abstractions, while gaining
other benefits (e.g. cross-stack replay
debugging)
– Share data efficiently between independently
developed programming models and
applications
– Understand cluster behavior without having to
log into individual nodes
– Dynamically share the cluster with other users
Future Of DCOS• Focus on paradigms, not only performance
– Industry is spending a lot of time on performance
• Explore clean-slate approaches
– Much datacenter software is written from scratch
– People using Erlang, Scala, functional models
(MR)
• Bring cluster computing to non-experts
– Most impactful (datacenter as the new
workstation)
– Hard to make a Google-scale stack usable
without a Google-scale ops team
Conclusion
• Datacenters need an OS-like software
stack for the same reasons single
computers did: manageability, efficiency &
programmability
• An OS is already emerging in an ad-hoc
way
• Researchers can help by taking a long-
term approach towards these problems
Data Center Operating System

More Related Content

PPTX
Data Management on Hadoop at Yahoo!
PPTX
Big data and hadoop
PPTX
Hadoop
PPTX
Big data vahidamiri-datastack.ir
PDF
Short introduction to ML frameworks on Hadoop
PPTX
Big data technology unit 3
PPTX
Big Data Unit 4 - Hadoop
Data Management on Hadoop at Yahoo!
Big data and hadoop
Hadoop
Big data vahidamiri-datastack.ir
Short introduction to ML frameworks on Hadoop
Big data technology unit 3
Big Data Unit 4 - Hadoop

What's hot (19)

PPT
Cloud computing and Hadoop introduction
PPTX
Data lake-itweekend-sharif university-vahid amiry
PPTX
عصر کلان داده، چرا و چگونه؟
PPTX
Extending your Hadoop Implementation to the Cloud
PPT
Hadoop mapreduce and yarn frame work- unit5
ODP
RDBMS and Hadoop
PPTX
Apache hadoop technology : Beginners
PPT
Hw09 Rethinking The Data Warehouse With Hadoop And Hive
PDF
P.Maharajothi,II-M.sc(computer science),Bon secours college for women,thanjavur.
PPTX
Big data architecture on cloud computing infrastructure
PPTX
سکوهای ابری و مدل های برنامه نویسی در ابر
DOCX
Cassandra data modelling best practices
PPTX
PPTX
Hadoop
PPTX
Hadoop
PPTX
4. hadoop גיא לבנברג
PPTX
Big data analysis using hadoop cluster
PPTX
PPT on Hadoop
PPTX
Hadoop Architecture
Cloud computing and Hadoop introduction
Data lake-itweekend-sharif university-vahid amiry
عصر کلان داده، چرا و چگونه؟
Extending your Hadoop Implementation to the Cloud
Hadoop mapreduce and yarn frame work- unit5
RDBMS and Hadoop
Apache hadoop technology : Beginners
Hw09 Rethinking The Data Warehouse With Hadoop And Hive
P.Maharajothi,II-M.sc(computer science),Bon secours college for women,thanjavur.
Big data architecture on cloud computing infrastructure
سکوهای ابری و مدل های برنامه نویسی در ابر
Cassandra data modelling best practices
Hadoop
Hadoop
4. hadoop גיא לבנברג
Big data analysis using hadoop cluster
PPT on Hadoop
Hadoop Architecture
Ad

Similar to Data Center Operating System (20)

PPTX
Cloud Services for Big Data Analytics
PPTX
Cloud Services for Big Data Analytics
PPTX
Introduction to Apache Hadoop
PPTX
Lecture 3.31 3.32.pptx
PPT
Big Data & Hadoop
PPT
Map reducecloudtech
PPT
Hadoop tutorial
PPT
Hadoop Tutorial.ppt
PPTX
Research on vector spatial data storage scheme based
PDF
Hadoop - Architectural road map for Hadoop Ecosystem
PPTX
Infinitely Scalable Clusters - Grid Computing on Public Cloud - London
PPTX
Big Data and Cloud Computing
PPTX
BIg Data Analytics-Module-2 as per vtu syllabus.pptx
PPTX
Module-2_HADOOP.pptx
PPTX
BIg Data Analytics-Module-2 vtu engineering.pptx
PPTX
Matching Data Intensive Applications and Hardware/Software Architectures
PPTX
Matching Data Intensive Applications and Hardware/Software Architectures
PPTX
project--2 nd review_2
PPTX
project--2 nd review_2
PPTX
High Performance Computing and Big Data
Cloud Services for Big Data Analytics
Cloud Services for Big Data Analytics
Introduction to Apache Hadoop
Lecture 3.31 3.32.pptx
Big Data & Hadoop
Map reducecloudtech
Hadoop tutorial
Hadoop Tutorial.ppt
Research on vector spatial data storage scheme based
Hadoop - Architectural road map for Hadoop Ecosystem
Infinitely Scalable Clusters - Grid Computing on Public Cloud - London
Big Data and Cloud Computing
BIg Data Analytics-Module-2 as per vtu syllabus.pptx
Module-2_HADOOP.pptx
BIg Data Analytics-Module-2 vtu engineering.pptx
Matching Data Intensive Applications and Hardware/Software Architectures
Matching Data Intensive Applications and Hardware/Software Architectures
project--2 nd review_2
project--2 nd review_2
High Performance Computing and Big Data
Ad

Recently uploaded (20)

PDF
Automation-in-Manufacturing-Chapter-Introduction.pdf
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PPTX
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
PPTX
CH1 Production IntroductoryConcepts.pptx
PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PPTX
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
PPTX
Lecture Notes Electrical Wiring System Components
DOCX
573137875-Attendance-Management-System-original
PPTX
web development for engineering and engineering
PDF
composite construction of structures.pdf
PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
PDF
R24 SURVEYING LAB MANUAL for civil enggi
PPTX
Construction Project Organization Group 2.pptx
PDF
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
PPTX
OOP with Java - Java Introduction (Basics)
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PPTX
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PDF
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
PPT
Mechanical Engineering MATERIALS Selection
Automation-in-Manufacturing-Chapter-Introduction.pdf
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
CH1 Production IntroductoryConcepts.pptx
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
Lecture Notes Electrical Wiring System Components
573137875-Attendance-Management-System-original
web development for engineering and engineering
composite construction of structures.pdf
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
R24 SURVEYING LAB MANUAL for civil enggi
Construction Project Organization Group 2.pptx
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
OOP with Java - Java Introduction (Basics)
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
Mechanical Engineering MATERIALS Selection

Data Center Operating System

  • 1. Datacenter Operating System Presented By: Keshav Yadav www.linkedin.com/in/keshavyadavlpu
  • 2. What is DCOS ? • Some have declared that “the datacenter is the new computer” • Claim: this new computer increasingly needs an operating system • Not necessarily a new host OS, but a common software layer that manages resources and provides shared services for the whole datacenter, like an OS does for one host
  • 3. Why Datacenters Need an OS • Growing number of applications – Parallel processing systems: MapReduce, Dryad, Pregel, Percolator, Dremel, MR Online – Storage systems: GFS, BigTable, Dynamo, SCADS – Web apps and supporting services • Growing number of users – 200+ for Facebook’s Hadoop data warehouse, running near-interactive ad hoc queries
  • 4. What Operating Systems Provide • Resource sharing across applications & users • Data sharing between programs • Programming abstractions (e.g. threads, IPC) • Debugging facilities (e.g. ptrace, gdb) Result: OSes enable a highly interoperable software ecosystem that we now take for granted
  • 5. Today’s Datacenter OS • Hadoop MapReduce as common execution and resource sharing platform • Hadoop InputFormat API for data sharing • Abstractions for productivity programmers, but not for system builders • Very challenging to debug across all the layers
  • 6. Tomorrow’s Datacenter OS • Resource sharing: – Lower-level interfaces for fine-grained sharing (Mesos is a first step in this direction) – Optimization for a variety of metrics (e.g. energy) – Integration with network scheduling mechanisms (e.g. Seawall [NSDI ‘11], NOX, Orchestra)
  • 7. Tomorrow’s Datacenter OS • Data sharing: – Standard interfaces for cluster file systems, key-value stores, etc – In-memory data sharing (e.g. Spark, DFS cache), and a unified system to manage this memory – Streaming data abstractions (analogous to pipes) – Lineage instead of replication for reliability (RDDs)
  • 8. Tomorrow’s Datacenter OS • Programming abstractions: – Tools that can be used to build the next MapReduce / BigTable in a week (e.g. BOOM) – Efficient implementations of communication primitives (e.g. shuffle, broadcast) – New distributed programming models
  • 9. Tomorrow’s Datacenter OS • Debugging facilities: – Tracing and debugging tools that work across the cluster software stack (e.g. X-Trace, Dapper) – Replay debugging that takes advantage of limited languages / computational models – Unified monitoring infrastructure and APIs
  • 10. Putting it AllTogether • A successful datacenter OS might let users: – Build a Hadoop-like software stack in a week using the OS’s abstractions, while gaining other benefits (e.g. cross-stack replay debugging) – Share data efficiently between independently developed programming models and applications – Understand cluster behavior without having to log into individual nodes – Dynamically share the cluster with other users
  • 11. Future Of DCOS• Focus on paradigms, not only performance – Industry is spending a lot of time on performance • Explore clean-slate approaches – Much datacenter software is written from scratch – People using Erlang, Scala, functional models (MR) • Bring cluster computing to non-experts – Most impactful (datacenter as the new workstation) – Hard to make a Google-scale stack usable without a Google-scale ops team
  • 12. Conclusion • Datacenters need an OS-like software stack for the same reasons single computers did: manageability, efficiency & programmability • An OS is already emerging in an ad-hoc way • Researchers can help by taking a long- term approach towards these problems

Editor's Notes

  • #3: Doesn’t have to be a host OS, but rather a software stack that performs the same functions as the host OS on a single computer
  • #4: Point out that apps are developed independently and assume they have dedicated (slices of) machines
  • #5: Go back to DC being the new computer
  • #9: Mention lower level storage interfaces such as block store
  • #13: Note about how it may be easier to have impact here than in a “real” OS