SlideShare a Scribd company logo
ITRI Cloud Operating System
      and OpenStack
                           Tzi-cker Chiueh 闕志克
                  Cloud Computing Research Center
                  for Mobile Applications (CCMA)
                  雲端運算行動應用研究中心




                                                1
Cloud Data Center Solution

• Renting rather than buying IT infrastructure  Build-up
  of cloud-scale data centers  Need for inexpensive
  integrated cloud data center solutions
• The user pain point: integration
    Is it possible to build a cloud data center like “take a HW
       box,
    install OS on it, and have an AWS-like IaaS ready to go”?
• A total IaaS solution for both public and private clouds

2
Container Computer Architecture
 Physical Server
 VM0   VM1         VMn



                                                                   Layer-3
                                                                   Border
                                                                   Routers



                         Layer-2-Only Data
                         Center Network
                                                        IEL
                                                   Load Balancing
                                                   Traffic Shaping
                                                 Intrusion Detection
                                                      NAT/VPN

Compute Server

                                                   Storage
                                                   Server
                                             3
What is Cloud OS?
                                         ITRI Cloud OS
                                         ITRI Cloud OS

    Virtual Data Center Management
     Virtual Data Center Management                          Physical Data Center Management
                                                              Physical Data Center Management


                                        Photo      Video         Web
                                        Sharing    Streaming     Conference
                                                                                        CHT
PCHome                                  VDC        VDC           VDC
                 Provision and Deploy



                                                  Physical
                                                  Cluster
                                                                                      •Cloud Service
•Cloud Application                                                                     Infrastructure
                                                                                       Administrator
 Developer
                                                                                      •Carrier
•Cloud Service
 Provider
                                                                  Monitor, Diagnose and Configure
4           Monitor and Configure                                 Physical Resources
            Virtual Resources
Cloud OS Service Model
    • Virtual data center consists of one or multiple virtual clusters,
      each of which comprises one or multiple VMs
    • Users provide a Virtual Cluster specification
       –   No. of VM instances each with CPU performance and memory size requirement
       –   Per-VM storage space requirement
       –   External network bandwidth requirement
       –   Security policy
       –   Backup policy
       –   Load balancing policy
       –   Network configuration, e.g. public IP address and private IP address range
       –   OS image and application image


5
VDCM – Assets (VDC, VC, VM)




2012/8/11       OpenStack APAC Conference 2012   6
PDCM Event Monitor




2012/8/11        OpenStack APAC Conference 2012   7
PDCM Network Topology




2012/8/11         OpenStack APAC Conference 2012   8
Key Cloud OS 1.0 Features – 1
    • Physical resource management (PRM): BIOS
       – Centralized installation of all systems software
       – Start up, shut down, and recover a data center computer
    • Data center storage management: file management
       – Main storage (DMS) : Forming a highly available global storage pool from: a set
         of commodity JBOD storage servers
       – Secondary storage (DSS): Offering streamlined disk-based snapshot/backup with
         configurable policy, and scalable de-duplication
    • Virtualization management: process management
       – Resource provisioning management (RPM): allocate physical data center
         resources for a given virtual data center and auto-scaling
       – Dynamic virtual resource management (DVMM): use VM migration to support
         consolidation, load balancing and high availability

9
Key Cloud OS 1.0 Features – 2
 • Physical data center management (PDCM): system administration
     –   Comprehensive SNMP-based monitoring
     –   Integrated virtual/physical resource mapping view
     –   Unified event logging
     –   Integrated trouble ticking support
 • Virtual data center management (VDCM): system administration
     – VDC/VC/VM specification
     – Real-time resource usage and performance monitoring
 • Security: security
     – Inter-VDC isolation
     – Centralized L3 and distributed L7 and web application firewalling
 • Internet edge logic
     – Supporting inter-VM load balancing within a VC
     – DDoS attack mediation
     – Distributed traffic shaping
10
Building Cloud Data Center
                              Virtualization
  Primary/Secondary       Management (VMWare)             Physical Data Center
  Storage Management                                      Management (Tivoli)
  (EMC)


Physical
Resource                                                       Virtual Data Center
Management                                                     Management (CA)
(Dell)


                                                          Server Load Balancing/
                                                          Traffic Shaping (F5)
     System Integration
           (IBM)            Security (Checkpoint)   (XXX): represents leader
                                                    In the corresponding space
11
ITRI Cloud OS’s Way
                            Virtualization
  Primary/Secondary       Management (ITRI)    Physical Data Center
  Storage Management                           Management (ITRI)
  (ITRI)


Physical
Resource                                           Virtual Data Center
Management                                         Management (ITRI)
(ITRI)


                                                Internet Edge
                                                Logic (ITRI)
     System Integration
           (ITRI)            Security (ITRI)
12
Strong Data Protection
• N-way data replication vs. RAID
     – End to end data availability: disk, server, and network failures
• Periodic snapshots for local data backup with de-duplication
• Wide-area data backup
     – Snapshot frequency: a couple of hours to days
• Wide-area data replication (Cloud OS 2.0)
     – Snapshot frequency: a couple of seconds to minutes




13
High Availability
 •  High availability support for Cloud OS subsystems
   – Active-passive: Linux HA + DRBD + edit logging/recovery
   – Active-active: MySQL and server load balancer
 • Disk state-preserving fail-over for applications running inside
    VDCs
   – Shared persistent state + VM restart + take-over
 • Memory state-preserving fail-over for applications running
    inside VDCs (Cloud OS 2.0)
   – Shared memory/persistent state + VM resume + take-over


14
Multi-Dimensional Load Balancing

                                   Multi-homing load balancing

                                   Intra-VDC inter-VM load balancing


                                                Network-wide link-level
                                                load balancing




                                                                 Inter-storage-server
                                                                 load balancing


           Inter-physical-server          Thermal capacity
           load balancing                 load balancing
15
Cloud Security
• Any security breaches that are possible for a physical data
  center are equally likely for a virtual data center
     – L4/L7 and Web Application Firewall
• New security concerns
     – Interference between tenants on the same physical machines
        • Inter-VDC isolation vs. VLAN isolation




16
OpenStack
• Open Stack core:
     – Nova: VM provisioning
     – Glance: VM image upload and delivery
     – Swift: Object data storage
• RPM vs. Nova
     –   Boot from remote cloned volume
     –   Dynamic load balancing
     –   Power consolidation
     –   Dedicated physical machine pool
17
     –   Auto-scaling
OpenStack-Compatible Cloud OS
                                                                        OpenStack API

Security          PRM/PDCM                        Security                PRM/PDCM
Directory                                        Directory
 Server            VDCM/RS                        Server                      VDCM/RS
  IEL                                                IEL
               APIs with VDC concept                                    APIs with VDC concept
                                                Nova
DMS/DSS               VMM                                                     VMM lite
                                               Volume
                                            DMS/DSS                           CPU API
                                        OpenStack compatible
                                                               Volume                  Network
 Walrus                NC                                       API     OpenStack Nova   API
                                            Glance/Swift

     create/start/stop/delete/migrate VMs                      create/start/stop/delete/migrate VMs

  Xen       Xen      Xen      Xen                      KVM              Xen     Qemu     HyperV
Cloud OS 2.0
 • OpenStack Compatible:
     – Nova’s compute, volume and network API
     – OpenStack web service API
     – Target date: 10/1/2012
 •   Data center federation: Support for multi-site data centers
 •   Network virtualization: Support for hybrid cloud
 •   Wide-area data replication
 •   Memory de-duplication




19
Cloud Data Center Network
     • Cloud data centers are Big and Shared
     • Scalable and available data center fabrics
        – Not all links are used
        – No load-sensitive routing
        – Fail-over latency is high (> 5 seconds)
     • Network virtualization: Each virtual data center (VDC) gets to
     define its own network
        –   All VMs in a VDC belong to one flat subnet
        –   Each VDC has its own private IP address space
        –   Each VDC has a set of public IP addresses
        –   Each VDC has a set of external VPN connections
        –   Per-VDC Internet traffic shaping policy, intra-VDC and inter-VDC firewalling
            policy, and server load balancing policy
20
Peregrine
     • A unified Layer-2-only network for LAN and SA
     • Centralized control plane and distributed data plane
     • Use only Commodity Ethernet switches
        – Army of commodity switches vs. few high-port-density switches
        – Requirements on switches: run fast and has programmable routing table
     • Centralized load-balancing routing using real-time traffic matrix
        – Support for incremental and QoS-aware routing
     • Fast fail-over using pre-computed primary/back routes
     • Native support for network virtualization
        – Private IP address space reuse
        – Multi-tenancy VPN, NAT and traffic shaping
        – Intra-VDC or inter-VDC firewall

21
Software Architecture




22
Load Balancing Routing

     • Collection of real-time traffic matrix
       – Traffic volume between each pair of VMs
       – Traffic volume between each pair of PMs
     • Load balancing routing algorithm
       – Loads on the physical links
       – Number of hops
       – Forwarding table entries
       – Prioritization
     • Computed routes are installed on switches
23
When a Network Link Fails




27
Private IP Address Space Reuse
     • Requirement: Every VDC has a VDC ID and its own full 24-bit
       private IP address space (10.x.x.x), even though multiple VDCs
       run on top of the same data center network
     • Two approaches:
        – Ethernet over TCP/UDP:
           • Every Ethernet packet is encapsulated inside an TCP/UDP packet or
             TCP/UDP connection as an Ethernet link
           • Needs to implement in software such Ethernet switch functions as source
             learning, flooding, VLAN, etc.
           • Can work with arbitrary IP networks
        – Multi-tenancy-aware IP-MAC mapping: our approach
           • Runs directly on L2 networks, no need for Ethernet switch emulation
           • Inter-virtual-data-center isolation

25
Peregrine Summary
 • Peregrine is a network system technology, not a network device
   technology, and consists of
     – A hypervisor module running on every compute node
     – A route server and an ARP server
     – A VDC-aware VPN
 • Runs directly on commodity Ethernet switches and NICs: fully
   leverages the benefit of I/O virtualization, which encourages
   direct NIC access from VM
 • Under development: Refactor Peregrine as a Quantum plug-in



26
Conclusion
 • Cloud computing is all about consolidation of IT infrastructures
   and usage-based resource allocation
     – Data center as a computer paradigm
 • Cloud-scale data center industry is emerging
     – Integration is a real user pain point
     – An integrated solution with lesser components is much more desirable
       than an un-integrated set of more capable components
 • ITRI’s integrated data center solution, Container Computer 1.0 +
   Cloud OS 1.0, is expected to provide 70% of the functionalities at
   1/3 cost of leading solutions from US
     – Virtual data center service abstraction


27
Thank You!

     Questions and Comments?
          tcc@itri.org.tw



28

More Related Content

PDF
2010 09-24-闕志克老師-cloud computing where do we go
PPT
Distributed Block-level Storage Management for OpenStack, by Danile lee
PDF
Presentation introduction to cloud computing and technical issues
PPTX
Deep Dive Into Windows Server 2012 Hyper-V
PDF
2 25008 domain_ten11.29.12_v2_opt
PPT
Overview of VMware & VMware Education from IBM
PDF
12th Japan CloudStack User Group Meetup MidoNet with scalable virtual router
PDF
Open solaris customer presentation
2010 09-24-闕志克老師-cloud computing where do we go
Distributed Block-level Storage Management for OpenStack, by Danile lee
Presentation introduction to cloud computing and technical issues
Deep Dive Into Windows Server 2012 Hyper-V
2 25008 domain_ten11.29.12_v2_opt
Overview of VMware & VMware Education from IBM
12th Japan CloudStack User Group Meetup MidoNet with scalable virtual router
Open solaris customer presentation

What's hot (20)

PPT
Ibm blade center_foundation_for_cloud_seller_presentation
PDF
virtualization tutorial at ACM bangalore Compute 2009
PDF
IBM BladeCenter Foundation for Cloud: Integration Guide
PPSX
Windows Server 2012 - Dynamische opslag met Storage Pools
PPTX
Openstack Diablo Survey
PPTX
Architecting a Private Cloud - Cloud Expo
PDF
Building your private cloud the ncs experience harrison lee
PDF
Virtualization Primer for Java Developers
PDF
Simplifying network management with Platespin
PPTX
CloudStack DC Meetup - Apache CloudStack Overview and 4.1/4.2 Preview
PDF
Shalini xs10
PDF
The unified data center for cloud david yen
PDF
The Network\'s IN the (virtualised) Server: Virtualized Io In Heterogeneous M...
PPTX
Private cloud infrastructure configure and deploy 24 hiapc fabrizio volpe
PPTX
Windows Azure Uzerinden Alinabilen Hizmetler
PDF
Software Developer Conference 2012 - Paper Presentation - Cloud File Systems
PDF
Novell Success Stories: Collaboration in Education
PPTX
Postgres Plus Cloud Database
PDF
Adaptive Computing Using PlateSpin Orchestrate
PDF
Finding Virtual Coins in the Couch
Ibm blade center_foundation_for_cloud_seller_presentation
virtualization tutorial at ACM bangalore Compute 2009
IBM BladeCenter Foundation for Cloud: Integration Guide
Windows Server 2012 - Dynamische opslag met Storage Pools
Openstack Diablo Survey
Architecting a Private Cloud - Cloud Expo
Building your private cloud the ncs experience harrison lee
Virtualization Primer for Java Developers
Simplifying network management with Platespin
CloudStack DC Meetup - Apache CloudStack Overview and 4.1/4.2 Preview
Shalini xs10
The unified data center for cloud david yen
The Network\'s IN the (virtualised) Server: Virtualized Io In Heterogeneous M...
Private cloud infrastructure configure and deploy 24 hiapc fabrizio volpe
Windows Azure Uzerinden Alinabilen Hizmetler
Software Developer Conference 2012 - Paper Presentation - Cloud File Systems
Novell Success Stories: Collaboration in Education
Postgres Plus Cloud Database
Adaptive Computing Using PlateSpin Orchestrate
Finding Virtual Coins in the Couch
Ad

Viewers also liked (20)

DOC
Field name
DOC
Formato planeacion
PDF
φυλλα εργασιας διακοπες Acrobat
PPTX
M6d cassandrapresentation
PPTX
Media the cut
PPTX
Stop motion evaluation
PPTX
Con8896 securely enabling mobile access for business transformation - final
ODS
Agenda robert guzman
PDF
PPT
Centro de desarrollo_infantil_kamelot_original
PPTX
Tarea de heramientas de colaboracion digital
DOCX
Ideas generation
PDF
affTA11 - LampiranC
PPTX
AOL_Baku_address
PPTX
Week 7Aef4 07
PPT
Forss: Seksuaalinen häirintä ja väkivalta netissä
PDF
Teatro de la sensacion danza moderna infantil-jornadas de puertas abiertas
PDF
affTA09 - LampiranA
PDF
Heike Hoeffler: Advisory Capacities for the Contemporary Agricultural Policy ...
Field name
Formato planeacion
φυλλα εργασιας διακοπες Acrobat
M6d cassandrapresentation
Media the cut
Stop motion evaluation
Con8896 securely enabling mobile access for business transformation - final
Agenda robert guzman
Centro de desarrollo_infantil_kamelot_original
Tarea de heramientas de colaboracion digital
Ideas generation
affTA11 - LampiranC
AOL_Baku_address
Week 7Aef4 07
Forss: Seksuaalinen häirintä ja väkivalta netissä
Teatro de la sensacion danza moderna infantil-jornadas de puertas abiertas
affTA09 - LampiranA
Heike Hoeffler: Advisory Capacities for the Contemporary Agricultural Policy ...
Ad

Similar to Osac2012 (20)

PPT
Danile lee -open stackblocklevelstorage
PDF
Cloud and Grids
PPTX
Citrix CloudStack - Build Your Own Scalable Infrastructure Cloud with CloudStack
PPTX
Softchoice Webinar Series: VMware vSphere 5.1 Changes
PDF
2012 CloudStack Design Camp in Taiwan--- CloudStack Overview-1
PPTX
Cloud computing by prabhunath sharma
PPT
CCitDG Presenation
PPTX
Cloud stack overview
PDF
Covmug v sphere 4.1 what's new
PDF
CloudStack Architecture Future
PPTX
Cloud computing NIC 2012
PPTX
NIC 2013 - Configure and Deploy Private Cloud
PPTX
Mhta.private.cloud.final.16.9
PDF
Layer 7 & Burton Group: New Cloud Security Model Requirements
PPTX
Nicira chef webinar-merged
PPTX
The role of hyper-v in nist model
PDF
Christian ferber cloud platform_cloudportal
PDF
MPLS 2010: Network Enabled Cloud and Service Models
PPTX
Infrastructure components configure and deploy 24 hiapc fabrizio volpe
Danile lee -open stackblocklevelstorage
Cloud and Grids
Citrix CloudStack - Build Your Own Scalable Infrastructure Cloud with CloudStack
Softchoice Webinar Series: VMware vSphere 5.1 Changes
2012 CloudStack Design Camp in Taiwan--- CloudStack Overview-1
Cloud computing by prabhunath sharma
CCitDG Presenation
Cloud stack overview
Covmug v sphere 4.1 what's new
CloudStack Architecture Future
Cloud computing NIC 2012
NIC 2013 - Configure and Deploy Private Cloud
Mhta.private.cloud.final.16.9
Layer 7 & Burton Group: New Cloud Security Model Requirements
Nicira chef webinar-merged
The role of hyper-v in nist model
Christian ferber cloud platform_cloudportal
MPLS 2010: Network Enabled Cloud and Service Models
Infrastructure components configure and deploy 24 hiapc fabrizio volpe

More from OpenCity Community (20)

PDF
开源讲义.pdf
PDF
物联网操作系统漫谈-GIAC大会.pdf
PDF
2017开源年会-企业开源那些事儿-更新.pdf
PDF
社会化研发
PDF
Containers & CaaS
PPT
OaaS:Open as a Strategy
PDF
Hello openstack 2014
PDF
Docker openstack-2014
PDF
Learn OpenStack from trystack.cn
PDF
OpenStack系列公开课2 -20130508
PDF
OpenStack ecosystem
PDF
How to master OpenStack in 2 hours
PDF
Learn OpenStack from trystack.cn ——Folsom in practice
PDF
Quantum Networks
PDF
云计算思考
PDF
Openstorage Openstack
PDF
Openstack的研究与实践
PDF
Open Stack Cheng Du Swift Alex Yang
PDF
Nova与虚拟机管理
PDF
Look Into Libvirt Osier Yang
开源讲义.pdf
物联网操作系统漫谈-GIAC大会.pdf
2017开源年会-企业开源那些事儿-更新.pdf
社会化研发
Containers & CaaS
OaaS:Open as a Strategy
Hello openstack 2014
Docker openstack-2014
Learn OpenStack from trystack.cn
OpenStack系列公开课2 -20130508
OpenStack ecosystem
How to master OpenStack in 2 hours
Learn OpenStack from trystack.cn ——Folsom in practice
Quantum Networks
云计算思考
Openstorage Openstack
Openstack的研究与实践
Open Stack Cheng Du Swift Alex Yang
Nova与虚拟机管理
Look Into Libvirt Osier Yang

Recently uploaded (20)

PPTX
MYSQL Presentation for SQL database connectivity
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Empathic Computing: Creating Shared Understanding
PDF
cuic standard and advanced reporting.pdf
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
MYSQL Presentation for SQL database connectivity
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Unlocking AI with Model Context Protocol (MCP)
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Advanced methodologies resolving dimensionality complications for autism neur...
NewMind AI Weekly Chronicles - August'25 Week I
Reach Out and Touch Someone: Haptics and Empathic Computing
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Empathic Computing: Creating Shared Understanding
cuic standard and advanced reporting.pdf
The Rise and Fall of 3GPP – Time for a Sabbatical?
Dropbox Q2 2025 Financial Results & Investor Presentation
Review of recent advances in non-invasive hemoglobin estimation
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
CIFDAQ's Market Insight: SEC Turns Pro Crypto
The AUB Centre for AI in Media Proposal.docx
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows

Osac2012

  • 1. ITRI Cloud Operating System and OpenStack Tzi-cker Chiueh 闕志克 Cloud Computing Research Center for Mobile Applications (CCMA) 雲端運算行動應用研究中心 1
  • 2. Cloud Data Center Solution • Renting rather than buying IT infrastructure  Build-up of cloud-scale data centers  Need for inexpensive integrated cloud data center solutions • The user pain point: integration Is it possible to build a cloud data center like “take a HW box, install OS on it, and have an AWS-like IaaS ready to go”? • A total IaaS solution for both public and private clouds 2
  • 3. Container Computer Architecture Physical Server VM0 VM1 VMn Layer-3 Border Routers Layer-2-Only Data Center Network IEL Load Balancing Traffic Shaping Intrusion Detection NAT/VPN Compute Server Storage Server 3
  • 4. What is Cloud OS? ITRI Cloud OS ITRI Cloud OS Virtual Data Center Management Virtual Data Center Management Physical Data Center Management Physical Data Center Management Photo Video Web Sharing Streaming Conference CHT PCHome VDC VDC VDC Provision and Deploy Physical Cluster •Cloud Service •Cloud Application Infrastructure Administrator Developer •Carrier •Cloud Service Provider Monitor, Diagnose and Configure 4 Monitor and Configure Physical Resources Virtual Resources
  • 5. Cloud OS Service Model • Virtual data center consists of one or multiple virtual clusters, each of which comprises one or multiple VMs • Users provide a Virtual Cluster specification – No. of VM instances each with CPU performance and memory size requirement – Per-VM storage space requirement – External network bandwidth requirement – Security policy – Backup policy – Load balancing policy – Network configuration, e.g. public IP address and private IP address range – OS image and application image 5
  • 6. VDCM – Assets (VDC, VC, VM) 2012/8/11 OpenStack APAC Conference 2012 6
  • 7. PDCM Event Monitor 2012/8/11 OpenStack APAC Conference 2012 7
  • 8. PDCM Network Topology 2012/8/11 OpenStack APAC Conference 2012 8
  • 9. Key Cloud OS 1.0 Features – 1 • Physical resource management (PRM): BIOS – Centralized installation of all systems software – Start up, shut down, and recover a data center computer • Data center storage management: file management – Main storage (DMS) : Forming a highly available global storage pool from: a set of commodity JBOD storage servers – Secondary storage (DSS): Offering streamlined disk-based snapshot/backup with configurable policy, and scalable de-duplication • Virtualization management: process management – Resource provisioning management (RPM): allocate physical data center resources for a given virtual data center and auto-scaling – Dynamic virtual resource management (DVMM): use VM migration to support consolidation, load balancing and high availability 9
  • 10. Key Cloud OS 1.0 Features – 2 • Physical data center management (PDCM): system administration – Comprehensive SNMP-based monitoring – Integrated virtual/physical resource mapping view – Unified event logging – Integrated trouble ticking support • Virtual data center management (VDCM): system administration – VDC/VC/VM specification – Real-time resource usage and performance monitoring • Security: security – Inter-VDC isolation – Centralized L3 and distributed L7 and web application firewalling • Internet edge logic – Supporting inter-VM load balancing within a VC – DDoS attack mediation – Distributed traffic shaping 10
  • 11. Building Cloud Data Center Virtualization Primary/Secondary Management (VMWare) Physical Data Center Storage Management Management (Tivoli) (EMC) Physical Resource Virtual Data Center Management Management (CA) (Dell) Server Load Balancing/ Traffic Shaping (F5) System Integration (IBM) Security (Checkpoint) (XXX): represents leader In the corresponding space 11
  • 12. ITRI Cloud OS’s Way Virtualization Primary/Secondary Management (ITRI) Physical Data Center Storage Management Management (ITRI) (ITRI) Physical Resource Virtual Data Center Management Management (ITRI) (ITRI) Internet Edge Logic (ITRI) System Integration (ITRI) Security (ITRI) 12
  • 13. Strong Data Protection • N-way data replication vs. RAID – End to end data availability: disk, server, and network failures • Periodic snapshots for local data backup with de-duplication • Wide-area data backup – Snapshot frequency: a couple of hours to days • Wide-area data replication (Cloud OS 2.0) – Snapshot frequency: a couple of seconds to minutes 13
  • 14. High Availability • High availability support for Cloud OS subsystems – Active-passive: Linux HA + DRBD + edit logging/recovery – Active-active: MySQL and server load balancer • Disk state-preserving fail-over for applications running inside VDCs – Shared persistent state + VM restart + take-over • Memory state-preserving fail-over for applications running inside VDCs (Cloud OS 2.0) – Shared memory/persistent state + VM resume + take-over 14
  • 15. Multi-Dimensional Load Balancing Multi-homing load balancing Intra-VDC inter-VM load balancing Network-wide link-level load balancing Inter-storage-server load balancing Inter-physical-server Thermal capacity load balancing load balancing 15
  • 16. Cloud Security • Any security breaches that are possible for a physical data center are equally likely for a virtual data center – L4/L7 and Web Application Firewall • New security concerns – Interference between tenants on the same physical machines • Inter-VDC isolation vs. VLAN isolation 16
  • 17. OpenStack • Open Stack core: – Nova: VM provisioning – Glance: VM image upload and delivery – Swift: Object data storage • RPM vs. Nova – Boot from remote cloned volume – Dynamic load balancing – Power consolidation – Dedicated physical machine pool 17 – Auto-scaling
  • 18. OpenStack-Compatible Cloud OS OpenStack API Security PRM/PDCM Security PRM/PDCM Directory Directory Server VDCM/RS Server VDCM/RS IEL IEL APIs with VDC concept APIs with VDC concept Nova DMS/DSS VMM VMM lite Volume DMS/DSS CPU API OpenStack compatible Volume Network Walrus NC API OpenStack Nova API Glance/Swift create/start/stop/delete/migrate VMs create/start/stop/delete/migrate VMs Xen Xen Xen Xen KVM Xen Qemu HyperV
  • 19. Cloud OS 2.0 • OpenStack Compatible: – Nova’s compute, volume and network API – OpenStack web service API – Target date: 10/1/2012 • Data center federation: Support for multi-site data centers • Network virtualization: Support for hybrid cloud • Wide-area data replication • Memory de-duplication 19
  • 20. Cloud Data Center Network • Cloud data centers are Big and Shared • Scalable and available data center fabrics – Not all links are used – No load-sensitive routing – Fail-over latency is high (> 5 seconds) • Network virtualization: Each virtual data center (VDC) gets to define its own network – All VMs in a VDC belong to one flat subnet – Each VDC has its own private IP address space – Each VDC has a set of public IP addresses – Each VDC has a set of external VPN connections – Per-VDC Internet traffic shaping policy, intra-VDC and inter-VDC firewalling policy, and server load balancing policy 20
  • 21. Peregrine • A unified Layer-2-only network for LAN and SA • Centralized control plane and distributed data plane • Use only Commodity Ethernet switches – Army of commodity switches vs. few high-port-density switches – Requirements on switches: run fast and has programmable routing table • Centralized load-balancing routing using real-time traffic matrix – Support for incremental and QoS-aware routing • Fast fail-over using pre-computed primary/back routes • Native support for network virtualization – Private IP address space reuse – Multi-tenancy VPN, NAT and traffic shaping – Intra-VDC or inter-VDC firewall 21
  • 23. Load Balancing Routing • Collection of real-time traffic matrix – Traffic volume between each pair of VMs – Traffic volume between each pair of PMs • Load balancing routing algorithm – Loads on the physical links – Number of hops – Forwarding table entries – Prioritization • Computed routes are installed on switches 23
  • 24. When a Network Link Fails 27
  • 25. Private IP Address Space Reuse • Requirement: Every VDC has a VDC ID and its own full 24-bit private IP address space (10.x.x.x), even though multiple VDCs run on top of the same data center network • Two approaches: – Ethernet over TCP/UDP: • Every Ethernet packet is encapsulated inside an TCP/UDP packet or TCP/UDP connection as an Ethernet link • Needs to implement in software such Ethernet switch functions as source learning, flooding, VLAN, etc. • Can work with arbitrary IP networks – Multi-tenancy-aware IP-MAC mapping: our approach • Runs directly on L2 networks, no need for Ethernet switch emulation • Inter-virtual-data-center isolation 25
  • 26. Peregrine Summary • Peregrine is a network system technology, not a network device technology, and consists of – A hypervisor module running on every compute node – A route server and an ARP server – A VDC-aware VPN • Runs directly on commodity Ethernet switches and NICs: fully leverages the benefit of I/O virtualization, which encourages direct NIC access from VM • Under development: Refactor Peregrine as a Quantum plug-in 26
  • 27. Conclusion • Cloud computing is all about consolidation of IT infrastructures and usage-based resource allocation – Data center as a computer paradigm • Cloud-scale data center industry is emerging – Integration is a real user pain point – An integrated solution with lesser components is much more desirable than an un-integrated set of more capable components • ITRI’s integrated data center solution, Container Computer 1.0 + Cloud OS 1.0, is expected to provide 70% of the functionalities at 1/3 cost of leading solutions from US – Virtual data center service abstraction 27
  • 28. Thank You! Questions and Comments? tcc@itri.org.tw 28