SlideShare a Scribd company logo
Copyright © 2012, Elsevier Inc. All rights reserved. 1
Chapter 6
Warehouse-Scale Computers
to Exploit Request-Level and
Data-Level Parallelism:
Computer Architecture
A Quantitative Approach, Fifth Edition
2
Copyright © 2012, Elsevier Inc. All rights reserved.
Introduction
 Warehouse-scale computer (WSC)
 Provides Internet services

Search, social networking, online maps, video sharing, online
shopping, email, cloud computing, etc.
 Differences with HPC “clusters”:

Clusters have higher performance processors and network

Clusters emphasize thread-level parallelism, WSCs
emphasize request-level parallelism
 Differences with datacenters:

Datacenters consolidate different machines and software into
one location

Datacenters emphasize virtual machines and hardware
heterogeneity in order to serve varied customers
Introduction
3
Copyright © 2012, Elsevier Inc. All rights reserved.
Introduction
 Important design factors for WSC:
 Cost-performance

Small savings add up
 Energy efficiency

Affects power distribution and cooling

Work per joule
 Dependability via redundancy
 Network I/O
 Interactive and batch processing workloads
 Ample computational parallelism is not important

Most jobs are totally independent

“Request-level parallelism”
 Operational costs count

Power consumption is a primary, not secondary, constraint when designing
system
 Scale and its opportunities and problems

Can afford to build customized systems since WSC require volume purchase
Introduction
4
Copyright © 2012, Elsevier Inc. All rights reserved.
Prgrm’g Models and Workloads
 Batch processing framework: MapReduce
 Map: applies a programmer-supplied function to each
logical input record

Runs on thousands of computers

Provides new set of key-value pairs as intermediate values
 Reduce: collapses values using another
programmer-supplied function
Programming
Models
and
Workloads
for
WSCs
5
Copyright © 2012, Elsevier Inc. All rights reserved.
Prgrm’g Models and Workloads
 Example:
 map (String key, String value):

// key: document name

// value: document contents

for each word w in value
 EmitIntermediate(w,”1”); // Produce list of all words
 reduce (String key, Iterator values):

// key: a word

// value: a list of counts

int result = 0;

for each v in values:
 result += ParseInt(v); // get integer from key-value pair

Emit(AsString(result));
Programming
Models
and
Workloads
for
WSCs
6
Copyright © 2012, Elsevier Inc. All rights reserved.
Prgrm’g Models and Workloads
 MapReduce runtime environment schedules
map and reduce task to WSC nodes
 Availability:
 Use replicas of data across different servers
 Use relaxed consistency:

No need for all replicas to always agree
 Workload demands
 Often vary considerably
Programming
Models
and
Workloads
for
WSCs
7
Copyright © 2012, Elsevier Inc. All rights reserved.
Computer Architecture of WSC
 WSC often use a hierarchy of networks for
interconnection
 Each 19” rack holds 48 1U servers connected
to a rack switch
 Rack switches are uplinked to switch higher
in hierarchy
 Uplink has 48 / n times lower bandwidth, where n
= # of uplink ports

“Oversubscription”
 Goal is to maximize locality of communication
relative to the rack
Computer
Ar4chitecture
of
WSC
8
Copyright © 2012, Elsevier Inc. All rights reserved.
Storage
 Storage options:
 Use disks inside the servers, or
 Network attached storage through Infiniband
 WSCs generally rely on local disks
 Google File System (GFS) uses local disks and
maintains at least three relicas
Computer
Ar4chitecture
of
WSC
9
Copyright © 2012, Elsevier Inc. All rights reserved.
Array Switch
 Switch that connects an array of racks
 Array switch should have 10 X the bisection
bandwidth of rack switch
 Cost of n-port switch grows as n2
 Often utilize content addressible memory chips
and FPGAs
Computer
Ar4chitecture
of
WSC
10
Copyright © 2012, Elsevier Inc. All rights reserved.
WSC Memory Hierarchy
 Servers can access DRAM and disks on other
servers using a NUMA-style interface
Computer
Ar4chitecture
of
WSC
11
Copyright © 2012, Elsevier Inc. All rights reserved.
Infrastructure and Costs of WSC
 Location of WSC
 Proximity to Internet backbones, electricity cost,
property tax rates, low risk from earthquakes,
floods, and hurricanes
 Power distribution
Physcical
Infrastrcuture
and
Costs
of
WSC
12
Copyright © 2012, Elsevier Inc. All rights reserved.
Infrastructure and Costs of WSC
 Cooling
 Air conditioning used to cool server room
 64 F – 71 F

Keep temperature higher (closer to 71 F)
 Cooling towers can also be used

Minimum temperature is “wet bulb temperature”
Physcical
Infrastrcuture
and
Costs
of
WSC
13
Copyright © 2012, Elsevier Inc. All rights reserved.
Infrastructure and Costs of WSC
 Cooling system also uses water (evaporation and
spills)
 E.g. 70,000 to 200,000 gallons per day for an 8 MW facility
 Power cost breakdown:
 Chillers: 30-50% of the power used by the IT equipment
 Air conditioning: 10-20% of the IT power, mostly due to fans
 How man servers can a WSC support?
 Each server:

“Nameplate power rating” gives maximum power consumption

To get actual, measure power under actual workloads
 Oversubscribe cumulative server power by 40%, but
monitor power closely
Physcical
Infrastrcuture
and
Costs
of
WSC
14
Copyright © 2012, Elsevier Inc. All rights reserved.
Measuring Efficiency of a WSC
 Power Utilization Effectiveness (PEU)
 = Total facility power / IT equipment power
 Median PUE on 2006 study was 1.69
 Performance
 Latency is important metric because it is seen by
users
 Bing study: users will use search less as
response time increases
 Service Level Objectives (SLOs)/Service Level
Agreements (SLAs)

E.g. 99% of requests be below 100 ms
Physcical
Infrastrcuture
and
Costs
of
WSC
15
Copyright © 2012, Elsevier Inc. All rights reserved.
Cost of a WSC
 Capital expenditures (CAPEX)
 Cost to build a WSC
 Operational expenditures (OPEX)
 Cost to operate a WSC
Physcical
Infrastrcuture
and
Costs
of
WSC
16
Copyright © 2012, Elsevier Inc. All rights reserved.
Cloud Computing
 WSCs offer economies of scale that cannot
be achieved with a datacenter:
 5.7 times reduction in storage costs
 7.1 times reduction in administrative costs
 7.3 times reduction in networking costs
 This has given rise to cloud services such as
Amazon Web Services

“Utility Computing”

Based on using open source virtual machine and
operating system software
Cloud
Computing

More Related Content

PPTX
Introduction to Warehouse-Scale Computers
PPTX
Cloud infrastructure, Virtualization tec
PDF
Datacenter as computer
PDF
S00193ed1v01y200905cac006
PDF
An introduction to the Design of Warehouse-Scale Computers
PPTX
Physical infrastructure & cost of warehouse-scale computers - Faheem Abbas
PPTX
Warehouse scale computer
Introduction to Warehouse-Scale Computers
Cloud infrastructure, Virtualization tec
Datacenter as computer
S00193ed1v01y200905cac006
An introduction to the Design of Warehouse-Scale Computers
Physical infrastructure & cost of warehouse-scale computers - Faheem Abbas
Warehouse scale computer

Similar to Warehouse-Scale Computers to Exploit Request-Level and Data-Level Parallelism: .pptx (20)

PPTX
Fast and energy-efficient eNVM based memory organisation at L3-L1 layers for ...
PPTX
Google cluster architecture
PPT
Google Cloud Computing on Google Developer 2008 Day
PPT
Presentation-1.ppt
PPT
Cloud Computing,雲端運算-中研院網格計畫主持人林誠謙
PDF
The Datacenter As A Computer Designing Warehouse Scale Machines Luiz André Ba...
PPT
Cyberinfrastructure and Applications Overview: Howard University June22
PPTX
Introduction to Cloud Data Center and Network Issues
PPT
CENTRE FOR DATA CENTER WITH DIAGRAMS.ppt
PPTX
Cloud Computing
PDF
Hpc, grid and cloud computing - the past, present, and future challenge
PPSX
Computer project
PDF
From the Archives: Future of Supercomputing at Altparty 2009
PDF
Running Accurate, Scalable, and Reproducible Simulations of Distributed Syste...
ODP
Cloud Computing ...changes everything
PDF
BigDataCSEKeyNote_2012
PPT
Cloud computing infrastructure
PDF
Architecting a 35 PB distributed parallel file system for science
PDF
From WWW to Cloud Oct 2009.Pptx
Fast and energy-efficient eNVM based memory organisation at L3-L1 layers for ...
Google cluster architecture
Google Cloud Computing on Google Developer 2008 Day
Presentation-1.ppt
Cloud Computing,雲端運算-中研院網格計畫主持人林誠謙
The Datacenter As A Computer Designing Warehouse Scale Machines Luiz André Ba...
Cyberinfrastructure and Applications Overview: Howard University June22
Introduction to Cloud Data Center and Network Issues
CENTRE FOR DATA CENTER WITH DIAGRAMS.ppt
Cloud Computing
Hpc, grid and cloud computing - the past, present, and future challenge
Computer project
From the Archives: Future of Supercomputing at Altparty 2009
Running Accurate, Scalable, and Reproducible Simulations of Distributed Syste...
Cloud Computing ...changes everything
BigDataCSEKeyNote_2012
Cloud computing infrastructure
Architecting a 35 PB distributed parallel file system for science
From WWW to Cloud Oct 2009.Pptx
Ad

Recently uploaded (20)

PPTX
Geodesy 1.pptx...............................................
PPT
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
PDF
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
PDF
Automation-in-Manufacturing-Chapter-Introduction.pdf
PDF
R24 SURVEYING LAB MANUAL for civil enggi
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
PDF
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PPTX
CH1 Production IntroductoryConcepts.pptx
PDF
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
PPTX
CYBER-CRIMES AND SECURITY A guide to understanding
PPTX
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
PPTX
Internet of Things (IOT) - A guide to understanding
PPTX
Welding lecture in detail for understanding
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PDF
composite construction of structures.pdf
PPT
Project quality management in manufacturing
PPTX
Construction Project Organization Group 2.pptx
Geodesy 1.pptx...............................................
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
Automation-in-Manufacturing-Chapter-Introduction.pdf
R24 SURVEYING LAB MANUAL for civil enggi
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
CH1 Production IntroductoryConcepts.pptx
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
CYBER-CRIMES AND SECURITY A guide to understanding
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
Internet of Things (IOT) - A guide to understanding
Welding lecture in detail for understanding
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
composite construction of structures.pdf
Project quality management in manufacturing
Construction Project Organization Group 2.pptx
Ad

Warehouse-Scale Computers to Exploit Request-Level and Data-Level Parallelism: .pptx

  • 1. Copyright © 2012, Elsevier Inc. All rights reserved. 1 Chapter 6 Warehouse-Scale Computers to Exploit Request-Level and Data-Level Parallelism: Computer Architecture A Quantitative Approach, Fifth Edition
  • 2. 2 Copyright © 2012, Elsevier Inc. All rights reserved. Introduction  Warehouse-scale computer (WSC)  Provides Internet services  Search, social networking, online maps, video sharing, online shopping, email, cloud computing, etc.  Differences with HPC “clusters”:  Clusters have higher performance processors and network  Clusters emphasize thread-level parallelism, WSCs emphasize request-level parallelism  Differences with datacenters:  Datacenters consolidate different machines and software into one location  Datacenters emphasize virtual machines and hardware heterogeneity in order to serve varied customers Introduction
  • 3. 3 Copyright © 2012, Elsevier Inc. All rights reserved. Introduction  Important design factors for WSC:  Cost-performance  Small savings add up  Energy efficiency  Affects power distribution and cooling  Work per joule  Dependability via redundancy  Network I/O  Interactive and batch processing workloads  Ample computational parallelism is not important  Most jobs are totally independent  “Request-level parallelism”  Operational costs count  Power consumption is a primary, not secondary, constraint when designing system  Scale and its opportunities and problems  Can afford to build customized systems since WSC require volume purchase Introduction
  • 4. 4 Copyright © 2012, Elsevier Inc. All rights reserved. Prgrm’g Models and Workloads  Batch processing framework: MapReduce  Map: applies a programmer-supplied function to each logical input record  Runs on thousands of computers  Provides new set of key-value pairs as intermediate values  Reduce: collapses values using another programmer-supplied function Programming Models and Workloads for WSCs
  • 5. 5 Copyright © 2012, Elsevier Inc. All rights reserved. Prgrm’g Models and Workloads  Example:  map (String key, String value):  // key: document name  // value: document contents  for each word w in value  EmitIntermediate(w,”1”); // Produce list of all words  reduce (String key, Iterator values):  // key: a word  // value: a list of counts  int result = 0;  for each v in values:  result += ParseInt(v); // get integer from key-value pair  Emit(AsString(result)); Programming Models and Workloads for WSCs
  • 6. 6 Copyright © 2012, Elsevier Inc. All rights reserved. Prgrm’g Models and Workloads  MapReduce runtime environment schedules map and reduce task to WSC nodes  Availability:  Use replicas of data across different servers  Use relaxed consistency:  No need for all replicas to always agree  Workload demands  Often vary considerably Programming Models and Workloads for WSCs
  • 7. 7 Copyright © 2012, Elsevier Inc. All rights reserved. Computer Architecture of WSC  WSC often use a hierarchy of networks for interconnection  Each 19” rack holds 48 1U servers connected to a rack switch  Rack switches are uplinked to switch higher in hierarchy  Uplink has 48 / n times lower bandwidth, where n = # of uplink ports  “Oversubscription”  Goal is to maximize locality of communication relative to the rack Computer Ar4chitecture of WSC
  • 8. 8 Copyright © 2012, Elsevier Inc. All rights reserved. Storage  Storage options:  Use disks inside the servers, or  Network attached storage through Infiniband  WSCs generally rely on local disks  Google File System (GFS) uses local disks and maintains at least three relicas Computer Ar4chitecture of WSC
  • 9. 9 Copyright © 2012, Elsevier Inc. All rights reserved. Array Switch  Switch that connects an array of racks  Array switch should have 10 X the bisection bandwidth of rack switch  Cost of n-port switch grows as n2  Often utilize content addressible memory chips and FPGAs Computer Ar4chitecture of WSC
  • 10. 10 Copyright © 2012, Elsevier Inc. All rights reserved. WSC Memory Hierarchy  Servers can access DRAM and disks on other servers using a NUMA-style interface Computer Ar4chitecture of WSC
  • 11. 11 Copyright © 2012, Elsevier Inc. All rights reserved. Infrastructure and Costs of WSC  Location of WSC  Proximity to Internet backbones, electricity cost, property tax rates, low risk from earthquakes, floods, and hurricanes  Power distribution Physcical Infrastrcuture and Costs of WSC
  • 12. 12 Copyright © 2012, Elsevier Inc. All rights reserved. Infrastructure and Costs of WSC  Cooling  Air conditioning used to cool server room  64 F – 71 F  Keep temperature higher (closer to 71 F)  Cooling towers can also be used  Minimum temperature is “wet bulb temperature” Physcical Infrastrcuture and Costs of WSC
  • 13. 13 Copyright © 2012, Elsevier Inc. All rights reserved. Infrastructure and Costs of WSC  Cooling system also uses water (evaporation and spills)  E.g. 70,000 to 200,000 gallons per day for an 8 MW facility  Power cost breakdown:  Chillers: 30-50% of the power used by the IT equipment  Air conditioning: 10-20% of the IT power, mostly due to fans  How man servers can a WSC support?  Each server:  “Nameplate power rating” gives maximum power consumption  To get actual, measure power under actual workloads  Oversubscribe cumulative server power by 40%, but monitor power closely Physcical Infrastrcuture and Costs of WSC
  • 14. 14 Copyright © 2012, Elsevier Inc. All rights reserved. Measuring Efficiency of a WSC  Power Utilization Effectiveness (PEU)  = Total facility power / IT equipment power  Median PUE on 2006 study was 1.69  Performance  Latency is important metric because it is seen by users  Bing study: users will use search less as response time increases  Service Level Objectives (SLOs)/Service Level Agreements (SLAs)  E.g. 99% of requests be below 100 ms Physcical Infrastrcuture and Costs of WSC
  • 15. 15 Copyright © 2012, Elsevier Inc. All rights reserved. Cost of a WSC  Capital expenditures (CAPEX)  Cost to build a WSC  Operational expenditures (OPEX)  Cost to operate a WSC Physcical Infrastrcuture and Costs of WSC
  • 16. 16 Copyright © 2012, Elsevier Inc. All rights reserved. Cloud Computing  WSCs offer economies of scale that cannot be achieved with a datacenter:  5.7 times reduction in storage costs  7.1 times reduction in administrative costs  7.3 times reduction in networking costs  This has given rise to cloud services such as Amazon Web Services  “Utility Computing”  Based on using open source virtual machine and operating system software Cloud Computing