SlideShare a Scribd company logo
A Scalable, Commodity Data Center 
Network Architecture
Overview 
• Structure and Properties of a Data Center 
• Desired properties in a DC Architecture 
• Fat tree based solution 
• Monsoon: layer 2 flat routing
Common data center topology 
Internet 
Data Center 
Core Layer-3 router 
Aggregation Layer-2/3 switch 
Access Layer-2 switch 
Servers
Problem With common DC topology 
• Single point of failure 
• Over subscript of links higher up in the topology 
– Trade off between cost and provisioning
Properties of solutions 
• Backwards compatible with existing 
infrastructure 
– No changes in application 
– Support of layer 2 (Ethernet) 
• Cost effective 
– Low power consumption & heat emission 
– Cheap infrastructure 
• Allows host communication at line speed
Cost of maintaining switches
Need for Layer 2 In DC 
• Certain monitoring apps require server with 
same role to be on the same vlan 
• Using same ip on dual homed servers 
• Allowing growth of server farms.
Review of Layer 2 & Layer 3 
• Layer 2 
– One spanning tree for entire network 
• Prevents looping 
• Ignores alternate paths 
• Layer 3 
– Shortest path routing between source and destination 
– Best-effort delivery
FAT Tree based Solution 
• Connect end-host together using a fat tree 
topology 
– Infrastructure consist of cheap devices 
• Each port supports same speed as endhost 
– All devices can transmit at line speed if packets are 
distributed along existing paths 
– A k-port fat tree can support k3/4 hosts
Fat-Tree Topology
Problems with a vanilla Fat-tree 
• Layer 3 will only use one of the existing equal 
cost paths 
• Packet re-ordering occurs if layer 3 blindly takes 
advantage of path diversity
FAT-tree Modified 
• Enforce special addressing scheme in DC 
– Allows host attached to same switch to route only 
through switch 
– Allows inter-pod traffic to stay within pod 
– unused.PodNumber.switchnumber.Endhost 
• Use two level look-ups to distribute traffic and 
maintain packet ordering.
2 Level look-ups 
• First level is prefix lookup 
– Used to route down the topology to endhost 
• Second level is a suffix lookup 
– Used to route up towards core 
– Diffuses and spreads out traffic 
– Maintains packet ordering by using the same ports for 
the same endhost
Diffusion Optimizations 
• Flow classification 
– Eliminates local congestion 
– Assign to traffic to ports on a per-flow basis instead of 
a per-host basis 
• Flow scheduling 
– Eliminates global congestion 
– Prevent long lived flows from sharing the same links 
– Assign long lived flows to different links
Results: Network Utilization
Results: Heat & Power Consumption
Draw Backs 
• No inherent support for VLan traffic 
• Data center is fixed in size 
• Ignored connectivity to the internet 
• Waste of address space 
– Requires NAT at border
Monsoon approach 
• Layer 2 based using future commodity switches 
• Hierarchy has 2: 
– access switches (top of rack) 
– load balancing switches 
• Eliminate spanning tree 
– Flat routing 
– Allows network to take advantage of path diversity 
• Prevent MAC address learning 
– 4D architecture to distribute data plane information 
– TOR: Only need to learn address for the intermediate switches 
– Core: learn for TOR switches 
• Support efficient grouping of hosts (VLAN replacement)
Moonson
Monsoon Components 
• Top-of-Rack switch: 
– Aggregate traffic from 20 end host in a rack 
– Performs ip to mac translation 
• Intermediate Switch 
– Disperses traffic 
– Balances traffic among switches 
– Used for valiant load balancing 
• Decision Element 
– Places routes in switches 
– Maintain a directory services of IP to MAC 
• Endhost 
– Performs ip to mac lookup
How routing works 
• End-host checks flow cache for MAC of flow 
– If not found ask monsoon agent to resolve 
– Agent returns list of MACs for server and MACs for 
intermediate routers 
• Send traffic to Top of Router 
– Traffic is triple encapsulated 
• Traffic is sent to intermediate destination 
• Traffic is sent to Top of rack switch of destination
Monsoon Agent Lookup
Forwarding
Other Work in the Data Center Space 
• Network Security 
– Policy aware switching 
• Data Center Cabling 
– 60GHz Data-Center Networking: Wireless

More Related Content

PPTX
Viloria osi layer4-7
PPTX
Introduction to LTE
PPTX
A Tutorial on IEEE 802.11ax High.pptx
PDF
TPDDL Smart Grid Journey
PPTX
Contiki Operating system tutorial
PDF
Modul Instalasi FTTH
PPTX
Difference between cellular and adhoc networks
Viloria osi layer4-7
Introduction to LTE
A Tutorial on IEEE 802.11ax High.pptx
TPDDL Smart Grid Journey
Contiki Operating system tutorial
Modul Instalasi FTTH
Difference between cellular and adhoc networks

What's hot (20)

PDF
4 Structure Cabling System Design
PPT
UMTS system architecture, protocols & processes
PDF
SysTick, Timer & Watchdog
PPTX
Advanced multiplexing technology
PDF
Taller cableado estruturado final
PDF
Routing and switching essentials companion guide
PPTX
Internet of things(iot)
PDF
Gsm signaling
PPT
PPT
Local multipoint distribution service(lmds)
PPTX
PPTX
flexray technology in modern cars
PDF
Cell Planning
PPTX
Ngn presentation
PPTX
WIDE AREA NETWORK
PPTX
CCCNA R&S-02-The TCP-IP and OSI Networking Models
PPTX
Iot architecture
PPT
Advance ethernet
PDF
25 February 2014 - novel electro-optic infrared technology themed competitio...
4 Structure Cabling System Design
UMTS system architecture, protocols & processes
SysTick, Timer & Watchdog
Advanced multiplexing technology
Taller cableado estruturado final
Routing and switching essentials companion guide
Internet of things(iot)
Gsm signaling
Local multipoint distribution service(lmds)
flexray technology in modern cars
Cell Planning
Ngn presentation
WIDE AREA NETWORK
CCCNA R&S-02-The TCP-IP and OSI Networking Models
Iot architecture
Advance ethernet
25 February 2014 - novel electro-optic infrared technology themed competitio...
Ad

Viewers also liked (20)

PPTX
FATTREE: A scalable Commodity Data Center Network Architecture
PDF
Data Center Network Topologies
PPTX
Introduction to Data Center Network Architecture
PDF
Reference Architecture-Validated & Tested Approach to Define Network Design
PDF
Modern Data Center Network Architecture - The house that Clos built
PPTX
Data center network architectures v1.3
ZIP
DataCenter:: Infrastructure Presentation
PDF
Gartner TOP 10 Strategic Technology Trends 2017
PDF
データセンター向け高機能スイッチ Cisco Nexus スイッチ ガイド
PPTX
Continuous Deployment
PPTX
Intro to Cloud Architecture
PPTX
Intro to AWS
PPTX
Intro to Stylecop
PDF
Dynamic Data Center for Hosters, by Stefan Simon
PPTX
.NET Database Toolkit
ODP
Qdr infini band products technical presentation
PPTX
Second Project PPT
PPTX
3D IT Architecture - Data Center
PDF
Presentation data center and cloud architecture
PPTX
HTRC Architecture Overview
FATTREE: A scalable Commodity Data Center Network Architecture
Data Center Network Topologies
Introduction to Data Center Network Architecture
Reference Architecture-Validated & Tested Approach to Define Network Design
Modern Data Center Network Architecture - The house that Clos built
Data center network architectures v1.3
DataCenter:: Infrastructure Presentation
Gartner TOP 10 Strategic Technology Trends 2017
データセンター向け高機能スイッチ Cisco Nexus スイッチ ガイド
Continuous Deployment
Intro to Cloud Architecture
Intro to AWS
Intro to Stylecop
Dynamic Data Center for Hosters, by Stefan Simon
.NET Database Toolkit
Qdr infini band products technical presentation
Second Project PPT
3D IT Architecture - Data Center
Presentation data center and cloud architecture
HTRC Architecture Overview
Ad

Similar to A Scalable, Commodity Data Center Network Architecture (20)

PPT
Theo's slides
PPT
Theo's slides
PPTX
Data Center Networks
PPTX
Data Center Network Multipathing
PPTX
Cloud interconnection networks basic .pptx
PDF
Graphs are at the Heart of the Cloud
PPTX
DC Moving I migracion a otro datacenterd
PDF
Dcnintroduction 141010054657-conversion-gate01
PDF
Juniper Networks: Q Fabric Architecture
DOCX
Enterprise Data Center Networking (with citations)
PDF
Topic 15: Datacenter Design and Networking
PDF
Brocade-The Ethernet Fabrics-wp
PPTX
LinkedIn's Approach to Programmable Data Center
PDF
5G-USA-Telemetry
PDF
Juniper: Data Center Evolution
PPTX
Lecture notes - Data Centers________.pptx
PDF
Kubernetes networking in AWS
PDF
multilayer-campus-architectures-and-design-principles
PDF
Cloud & Data Center Networking
PDF
Datacenterarchitecture
Theo's slides
Theo's slides
Data Center Networks
Data Center Network Multipathing
Cloud interconnection networks basic .pptx
Graphs are at the Heart of the Cloud
DC Moving I migracion a otro datacenterd
Dcnintroduction 141010054657-conversion-gate01
Juniper Networks: Q Fabric Architecture
Enterprise Data Center Networking (with citations)
Topic 15: Datacenter Design and Networking
Brocade-The Ethernet Fabrics-wp
LinkedIn's Approach to Programmable Data Center
5G-USA-Telemetry
Juniper: Data Center Evolution
Lecture notes - Data Centers________.pptx
Kubernetes networking in AWS
multilayer-campus-architectures-and-design-principles
Cloud & Data Center Networking
Datacenterarchitecture

Recently uploaded (20)

PDF
Sims 4 Historia para lo sims 4 para jugar
PDF
Automated vs Manual WooCommerce to Shopify Migration_ Pros & Cons.pdf
PPTX
international classification of diseases ICD-10 review PPT.pptx
PDF
Best Practices for Testing and Debugging Shopify Third-Party API Integrations...
DOCX
Unit-3 cyber security network security of internet system
PDF
Introduction to the IoT system, how the IoT system works
PDF
💰 𝐔𝐊𝐓𝐈 𝐊𝐄𝐌𝐄𝐍𝐀𝐍𝐆𝐀𝐍 𝐊𝐈𝐏𝐄𝐑𝟒𝐃 𝐇𝐀𝐑𝐈 𝐈𝐍𝐈 𝟐𝟎𝟐𝟓 💰
PDF
The Internet -By the Numbers, Sri Lanka Edition
PPTX
Internet___Basics___Styled_ presentation
PPT
tcp ip networks nd ip layering assotred slides
PDF
Cloud-Scale Log Monitoring _ Datadog.pdf
PDF
SASE Traffic Flow - ZTNA Connector-1.pdf
PPTX
Introuction about WHO-FIC in ICD-10.pptx
PPTX
E -tech empowerment technologies PowerPoint
PPTX
522797556-Unit-2-Temperature-measurement-1-1.pptx
PPTX
innovation process that make everything different.pptx
PPTX
Funds Management Learning Material for Beg
PPTX
INTERNET------BASICS-------UPDATED PPT PRESENTATION
PDF
Testing WebRTC applications at scale.pdf
PPTX
June-4-Sermon-Powerpoint.pptx USE THIS FOR YOUR MOTIVATION
Sims 4 Historia para lo sims 4 para jugar
Automated vs Manual WooCommerce to Shopify Migration_ Pros & Cons.pdf
international classification of diseases ICD-10 review PPT.pptx
Best Practices for Testing and Debugging Shopify Third-Party API Integrations...
Unit-3 cyber security network security of internet system
Introduction to the IoT system, how the IoT system works
💰 𝐔𝐊𝐓𝐈 𝐊𝐄𝐌𝐄𝐍𝐀𝐍𝐆𝐀𝐍 𝐊𝐈𝐏𝐄𝐑𝟒𝐃 𝐇𝐀𝐑𝐈 𝐈𝐍𝐈 𝟐𝟎𝟐𝟓 💰
The Internet -By the Numbers, Sri Lanka Edition
Internet___Basics___Styled_ presentation
tcp ip networks nd ip layering assotred slides
Cloud-Scale Log Monitoring _ Datadog.pdf
SASE Traffic Flow - ZTNA Connector-1.pdf
Introuction about WHO-FIC in ICD-10.pptx
E -tech empowerment technologies PowerPoint
522797556-Unit-2-Temperature-measurement-1-1.pptx
innovation process that make everything different.pptx
Funds Management Learning Material for Beg
INTERNET------BASICS-------UPDATED PPT PRESENTATION
Testing WebRTC applications at scale.pdf
June-4-Sermon-Powerpoint.pptx USE THIS FOR YOUR MOTIVATION

A Scalable, Commodity Data Center Network Architecture

  • 1. A Scalable, Commodity Data Center Network Architecture
  • 2. Overview • Structure and Properties of a Data Center • Desired properties in a DC Architecture • Fat tree based solution • Monsoon: layer 2 flat routing
  • 3. Common data center topology Internet Data Center Core Layer-3 router Aggregation Layer-2/3 switch Access Layer-2 switch Servers
  • 4. Problem With common DC topology • Single point of failure • Over subscript of links higher up in the topology – Trade off between cost and provisioning
  • 5. Properties of solutions • Backwards compatible with existing infrastructure – No changes in application – Support of layer 2 (Ethernet) • Cost effective – Low power consumption & heat emission – Cheap infrastructure • Allows host communication at line speed
  • 7. Need for Layer 2 In DC • Certain monitoring apps require server with same role to be on the same vlan • Using same ip on dual homed servers • Allowing growth of server farms.
  • 8. Review of Layer 2 & Layer 3 • Layer 2 – One spanning tree for entire network • Prevents looping • Ignores alternate paths • Layer 3 – Shortest path routing between source and destination – Best-effort delivery
  • 9. FAT Tree based Solution • Connect end-host together using a fat tree topology – Infrastructure consist of cheap devices • Each port supports same speed as endhost – All devices can transmit at line speed if packets are distributed along existing paths – A k-port fat tree can support k3/4 hosts
  • 11. Problems with a vanilla Fat-tree • Layer 3 will only use one of the existing equal cost paths • Packet re-ordering occurs if layer 3 blindly takes advantage of path diversity
  • 12. FAT-tree Modified • Enforce special addressing scheme in DC – Allows host attached to same switch to route only through switch – Allows inter-pod traffic to stay within pod – unused.PodNumber.switchnumber.Endhost • Use two level look-ups to distribute traffic and maintain packet ordering.
  • 13. 2 Level look-ups • First level is prefix lookup – Used to route down the topology to endhost • Second level is a suffix lookup – Used to route up towards core – Diffuses and spreads out traffic – Maintains packet ordering by using the same ports for the same endhost
  • 14. Diffusion Optimizations • Flow classification – Eliminates local congestion – Assign to traffic to ports on a per-flow basis instead of a per-host basis • Flow scheduling – Eliminates global congestion – Prevent long lived flows from sharing the same links – Assign long lived flows to different links
  • 16. Results: Heat & Power Consumption
  • 17. Draw Backs • No inherent support for VLan traffic • Data center is fixed in size • Ignored connectivity to the internet • Waste of address space – Requires NAT at border
  • 18. Monsoon approach • Layer 2 based using future commodity switches • Hierarchy has 2: – access switches (top of rack) – load balancing switches • Eliminate spanning tree – Flat routing – Allows network to take advantage of path diversity • Prevent MAC address learning – 4D architecture to distribute data plane information – TOR: Only need to learn address for the intermediate switches – Core: learn for TOR switches • Support efficient grouping of hosts (VLAN replacement)
  • 20. Monsoon Components • Top-of-Rack switch: – Aggregate traffic from 20 end host in a rack – Performs ip to mac translation • Intermediate Switch – Disperses traffic – Balances traffic among switches – Used for valiant load balancing • Decision Element – Places routes in switches – Maintain a directory services of IP to MAC • Endhost – Performs ip to mac lookup
  • 21. How routing works • End-host checks flow cache for MAC of flow – If not found ask monsoon agent to resolve – Agent returns list of MACs for server and MACs for intermediate routers • Send traffic to Top of Router – Traffic is triple encapsulated • Traffic is sent to intermediate destination • Traffic is sent to Top of rack switch of destination
  • 24. Other Work in the Data Center Space • Network Security – Policy aware switching • Data Center Cabling – 60GHz Data-Center Networking: Wireless

Editor's Notes

  • #4: Before describing the problems with current middlebox deployment approaches, let me first describe the commonly used 3-tier data center network topology. At the top is the core-tier, whose layer-3 routers connect the data center to the Internet or to the rest of the campus network. At the bottom is the access tier, containing the layer-2 switches into which servers are plugged in. In between the access and core tiers are the layer 2/3 switches of the aggregation tier. Middleboxes are commonly deployed at the aggregation tier. Multiple redundant links connect together the various switches and servers. To prevent forwarding loops, we use mechanisms like spanning tree construction to block out some of the links. For example, the topology as shown here.
  • #7: Netgear ~ 3K Procurve – 4.5K
  • #9: In this talk, I shall next explain the problems with current middlebox deployment mechanisms. I shall then describe how our solution, the policy-aware switching layer simplifies middlebox deployment and achieves the properties I mentioned earlier. I shall briefly discuss related work, and our prototype implementation and evaluation of the policy-aware switching layer.