SlideShare a Scribd company logo
SQL Server 2008 Fast Track
  Data Warehouse 2.0
   Speaker: Phil Hummel of WinWire Technologies
    Presentation developed by: Bruce Campbell
  Western Region Data Warehouse Specialist, Microsoft



          Silicon Valley SQL Server User Group
                    February 16, 2009




             Mark Ginnebaugh, User Group Leader,
                   mark@designmind.com
Agenda

• DW vs. OLTP
• Balanced Architecture Approach for DW
  Fast Track Defined
• Fast Track Reference Architectures
• Next Steps
Microsoft DW & BI Stack
                   DELIVERY


                              PerformancePoint
                              Services

               END USER TOOLS



BI & DW PLATFORM (RDBMS, ETL, OLAP, Reporting)
DW                        versus                 OLTP
Database                                        Database
• Designed for analytical operations:           • Designed for operational requirements:
  Strategic focus                                 Tactical focus
• Optimized for bulk load and large, complex,   • Optimized for transactions: “single row”
  unpredictable queries                           entry and retrieval
• Fewer concurrent users relative to OLTP       • Thousands of concurrent users
Storage
• Primary focus on Read operations              Storage
• Optimized for disk scan over seek             • Emphasizes transactional
  operations                                      performance
• Storage optimization focused on disk          • Optimized for disk seek over scan
  scan rate (MB/s)                                operations
                                                • Storage optimization focused on
                                                  I/O operations/s (IOPs)
Sequential I/O
            Sequential I/O                                               Random I/O

• Scans on large data stores are                       •    OLTP usually random-read centric.
  usually read with sequential read                         Discrete lookups benefit from index
  patterns and not random read                              optimization and random read
  patterns                                                  capability.
• Scalable, predictable performance                    •    Not as predictable & scalable for
                                                            data warehousing
• Requires 1/3 or fewer drives to
  match server I/O consumption                         •    Requires large number of drives to
  capability.                                               match server I/O consumption
                                                            capability.




All databases contain both scans and seeks among with other types of reads and writes, DW workload indicate
                            that the vast majority of reads are sequential – not all
Some SQL Data Warehouses today
Big SAN
Big 64-core Server
Connected together




       What’s wrong with this picture?
Answer: system out of balance
• This server can consume 16 GB/Sec of IO, but the
  SAN can only deliver 2 GB/Sec
   – Even when the SAN is dedicated to the SQL Data
     Warehouse, which it often isn’t
   – Lots of disks for Random IOPS BUT
   – Limited controllers Limited IO bandwidth
• System is typically IO bound
• Queries are slow




Result: significant investment, not delivering performance
The Alternative: A Balanced System
•   Design a server + storage configuration that can deliver all the IO
    bandwidth that CPUs can consume when executing a SQL Relational
    DW workload
•   Avoid sharing storage devices among servers
•   Avoid overinvesting in disk drives
    – Focus on scan performance, not IOPS
•   Layout and manage data to maximize range scan performance and
    minimize fragmentation
Potential Performance Bottlenecks

                                                                                             DISK   DISK
            SQL SERVER
            CPU CORES




                                     A




                                          FC SWITCH
                              FC
   SERVER

             WINDOWS



                                                                                         A
              CACHE


                             HBA     B                                                          LUN




                                                                             CACHE
                                                      A     STORAGE                  A
                                                      B    CONTROLLER                B       DISK   DISK
                              FC     A
                             HBA                                                         B
                                     B
                                                                                                LUN




CPU Feed Rate      SQL Server      HBA Port Rate          Switch Port Rate   SP Port Rate    LUN Read Rate   Disk Feed Rate
                 Read Ahead Rate
SQL Server Fast Track Data Warehouse
Solution to help customers and partners accelerate their data warehouse
deployments

• A method for designing a cost-effective, balanced
  system for Data Warehouse workloads
• Reference hardware configurations developed in
  conjunction with hardware partners using this
  method
• Best practices for data layout, loading and
  management


        Relational Database Only – Not SSAS, IS, RS
Fast Track Data Warehouse Components


                       Software:
                       • SQL Server 2008 Enterprise
                       • Windows Server 2008

                       Configuration guidelines:
                       • Physical table structures
                       • Indexes
                       • Compression
                       • SQL Server settings
                       • Windows Server settings
                       • Loading

                       Hardware:
                       • Tight specifications for servers,
                       storage and networking
                       • ‘Per core’ building block
Fast Track Scope
Supporting Systems   BI Data Storage Systems                               Presentation Layer Systems




    Integration                   Analysis Services
    Services ETL                  Cubes




                                                       Presentation Data
                                                      Presentation Data
                                                                               Web Analytic Tools
                      Data Path                                                Reporting Services




                                                                                SharePoint Services

    Dedicated SAN,                                                              Microsoft Office SharePoint
    Storage Array                 Data Warehouse                                     PerformancePoint
                                  Data Staging,                                      Excel Services
                                  Bulk Loading

 Reference Architecture Scope (dashed)
Two SQL DW Infrastructure Options:
                    SQL Classic DW or Fast Track SQL DW
    SQL Classic DW                            Fast Track SQL DW Architecture
    Architecture                              Architecture modeled after DW Appliances
    Leverages Shared SAN                      Teradata, DATAllegro..etc “ Appliance Like”
                                              Uses Dedicated SAN arrays and Network

Enterprise Shared        Shared Network               Dedicated
SAN Storage              Bandwidth                    Network           Dedicated SAN
                                                      Bandwidth




                                      SQL 2008 Data Warehouse         SAN Arrays 1:4 cpu cores
                                      SMP Server                      8 Data Disk / Array – 4 Raid 1 Pairs
                                                                      Simultaneous SQL Server Reads
                                                                      2 Log and 1 Hot Spare
                                                                      EMC AX4 – HP MSA2312
                                                                      IBM 3400

    OLTP Applications                     SQL Fast Track DW supports “Scan Centric” DW
                                          workloads that are index light
Optimizing storage layout for scan
           intensive workloads
•   LUN configuration is based on
                                                  RAID GP01    RAID GP02    RAID GP05
    RAID1 pairs                               S
                                              P   01     02    03      04   09    10
    – Optimal for scan type access patterns        LUN1         LUN3         LUN0

•   Striping across storage is                A    LUN2         LUN4         (Logs)




                                                                                        HOT SPARE
                                                  RAID GP03    RAID GP04
    accomplished via SQL Server data          S
    files                                     P   05      06   07      08
                                                       LUN5     LUN7

•   Observed throughput for a single          B        LUN6     LUN8


    RAID pair >= 130 MB/s
Storage Layout Implications for SQL Server

                     LUN 1                 LUN 2                  LUN 3                               LUN16


                                                            Permanent FG
  Permanant_DB




                  Permanent_1.ndf      Permanent_2.ndf        Permanent_3.ndf                Permanent_16.ndf




                                                             Stage FG
Database
 Stage




                   Stage_1.ndf           Stage_2.ndf          Stage_3.ndf                         Stage_16.ndf
                  Local Drive 1
  TempDB




                   TempDB.mdf (25GB) TempDB_02.ndf (25GB)    TempDB_03ndf (25GB)          TempDB_16.ndf (25GB)



                                                                                     Log LUN 1
                                                                                   Permanent DB
                                                                                       Log
                                                                                   Stage DB Log
Creating Sequential Data Layout
• Goal: Align logical and physical ordering of data
  within a Filegroup
• Two primary ways Fast Track optimizes allocation
  for Sequential Scan
  – Minimize Fragmentation
  – Manage Load processing
Maximum Consumption Rate
            Theoretical throughput for IO stack
•   Using a 2x quad-core
    server as a building        Maximum theoretical throughput for IO stack
                             components sized for an 8 CPU core Fast Track system
    block / starting point             (assumes 200 MB/s per core)
•   Ensure that the per-
    core data
    consumption rate can
    be delivered by all                                                 500 MB/s
                                                                                        300 MB/s

                                                                                        300 MB/s
    elements of the IO                                                                  300 MB/s

    stack                         MCR 1.6 GB/s                          500 MB/s        300 MB/s




                                                         Fiber Switch
                                                                             Storage Enclosure
•   Sticker on the new          Windows Server
                                     OS
    car: “Miles Per                                                                     300 MB/s

    Gallon”                       CPU
                                 Socket    HBA   Min
                                                                        500 MB/s        300 MB/s
                                (4 Core)                                                300 MB/s
                                                  2     Min
                                  CPU            GB/s    2
                                 Socket    HBA                          500 MB/s        300 MB/s
                                                        GB/s
                                (4 Core)
                                       Server                                Storage Enclosure
Scaling the IO stack
                                        Storage Processor            RAID-1
                                                                           RAID-1
     CPU        CPU            Fiber    Storage Processor
                                                                                 RAID-1
                                                                                       RAID-1
                                                                                             RAID-1
    Socket     Socket                                       Storage Enclosure
   (4 Core)   (4 Core)         Switch
                                        Storage Processor            RAID-1
     CPU        CPU                                                        RAID-1
                                                                                 RAID-1
    Socket     Socket                   Storage Processor                              RAID-1
   (4 Core)   (4 Core)                                                                       RAID-1
                                                            Storage Enclosure

     CPU        CPU                     Storage Processor            RAID-1
                                                                           RAID-1
    Socket     Socket                                                            RAID-1
   (4 Core)   (4 Core)                  Storage Processor                              RAID-1
                                                                                             RAID-1
                                                            Storage Enclosure
     CPU        CPU
    Socket     Socket                   Storage Processor            RAID-1
   (4 Core)   (4 Core)                                                     RAID-1
                                                                                 RAID-1
                                        Storage Processor                              RAID-1
                                                                                             RAID-1
                                                            Storage Enclosure
                         HBA
                                        Storage Processor            RAID-1
                         HBA                                               RAID-1
                                                                                RAID-1
                                                                                     RAID-1
                                        Storage Processor
                                                                                            RAID-1
                                                            Storage Enclosure
                         HBA
                                        Storage Processor            RAID-1
                         HBA                                               RAID-1
                                                                                 RAID-1
                                        Storage Processor                              RAID-1
                                                                                             RAID-1
                                                            Storage Enclosure
                         HBA
                                        Storage Processor            RAID-1
                         HBA                                               RAID-1
                                                                                 RAID-1
                                        Storage Processor                              RAID-1
                                                                                             RAID-1
                                                            Storage Enclosure
                         HBA
Server                   HBA
                                        Storage Processor            RAID-1
                                                                           RAID-1
                                                                                 RAID-1
                                        Storage Processor                              RAID-1
                                                                                             RAID-1
                                                            Storage Enclosure
Fast Track Data Warehouse Reference
                                       Configurations
                                                                                         CPU                                                                                  Initial      Max
         Server                                       CPU                                                           SAN                      Data Drive Count
                                                                                        Cores                                                                               Capacity*   Capacity**
HP Proliant                      (2) AMD Opteron Istanbul                              12              (3) HP MSA2312fc                     (24) 300GB 15k SAS                    6TB         12TB
DL 385 G6                        six core 2.6 GHz
HP Proliant                      (2) Intel Xeon® 5500 Series                           8               (2) HP MSA2312                       (16) 300GB 15k SAS                    4TB          8TB
DL 380 G6                        Quad core
HP Proliant                      (4) AMD Opteron Instanbul                             24              (6) HP MSA2312fc                     (48) 300GB 15k SAS                   12TB         24TB
DL 585 G6                        six core 2.6 GHz
HP Proliant                      (4) Intel Xeon® 7400 Series six                       24              (6) HP MSA2312                       (48) 300GB 15k SAS                   12TB         24TB
DL 580 G5                        core
HP Proliant                      (8) AMD Opteron Istanbul                              48              (12) HP MSA2312                      (96) 300GB 15k SAS                   24TB         48TB
DL 785 G6                        six core 2.8 GHz
Dell PowerEdge                   (2) Intel Xeon Nehalem quad                           8               (2) EMC AX4                          (16) 300GB 15k FC                     4TB          8TB
R710                             core 2.66 GHz
Dell Power Edge                  (4) Intel Xeon Dunnington                             24              (6) EMC AX4                          (48) 300GB 15k FC                    12TB         24TB
R900                             six core 2.67GHz
IBM X3650 M2                     (2) Intel Xeon Nehalem quad                           8               (2) IBM DS3400                       (16) 200GB 15K FC                     4TB          8TB
                                 core 2.67 GHx
IBM X3850 M2                     (4) Intel Xeon Dunnington six                         24              (6) IBM DS3400                       (24) 300GB 15k FC                    12TB         24TB
                                 core 2.67 GHz
IBM X3950 M2                     (8) Intel Xeon Nehalem four                           32              (8) IBM DS3400                       (32) 300GB 15k SAS                   16TB         32TB
                                 core 2.13 GHz
Bull Novascale                   (2) Intel Xeon Nehalem quad                           8               (2) EMC AX4                          (16) 300GB 15k FC                     4TB          8TB
R460 E2                          core 2.66 GHz
Bull Novascale                   (4) Intel Xeon Dunnington                             24              (6) EMC AX4                          (48) 300GB 15k FC                    12TB         24TB
R480 E1                          six core 2.67GHz
* Core-balanced compressed capacity based on 300GB 15k SAS not including hot spares and log drives. Assumes 25% (of raw disk space) allocated for Temp DB.
** Represents storage array fully populated with 300GB15k SAS and use of 2.5:1 compression ratio. This includes the addition of one storage expansion tray per enclosure.
  30% of this storage should be reserved for DBA operations
SQL Server Fast Track Data Warehouse 2.0 for
              HP – now on G6 Platform
Five AMD and Intel based Reference configurations available for HP:

AMD Based Reference Architectures

2 Processor Configuration
    – Server: HP ProLiant DL385 G6 with 2 6-core AMD Opteron CPUs
    – Storage server: MSA Storage
    – Scalability: 4 – 12 TB

4 Processor Configuration
    – Server: HP ProLiant DL 585 G6 with 4 6-core AMD Opteron CPUs
    – Storage server: MSA Storage
    – Scalability: 12 – 24 TB

8 processor Configuration
    – Server: HP ProLiant DL 785 G6 with 8 6-core AMD
      Opteron CPUs
    – Storage server: MSA Storage
    – Scalability: 24 – 48TB
SQL Server Fast Track Data Warehouse 2.0 for
            HP – now on G6 Platform
Intel Based Reference Architectures

2 Processor Configuration
    – Server: HP ProLiant DL380 G6 with 2 4-core Intel Xeon® 5500
      Series CPUs
    – Storage server: MSA Storage
    – Scalability: 4 – 8 TB

4 Processor Configuration
    – Server: HP ProLiant DL 580 G5 with 4 6-core Intel Xeon®
      7400 Series CPUs
    – Storage server: MSA Storage
    – Scalability: 12 – 24 TB
New Fast Track Data Warehouse 2.0 for IBM

Three Reference configurations available for IBM:
2 Processor Configuration
       –    Server: IBM System x3650 M2 with 2 Quad-core Intel Xeon CPUs
       –    Storage server: IBM System Storage DS3400
       –    Scalability: 4 – 8 TB

4 Processor Configuration
       –    Server: IBM System x3850 M2 with 4 6-core Intel Xeon CPUs
       –    Storage server: IBM System Storage DS3400
       –    Scalability: 12 – 24 TB

8 processor Configuration
       –    Server: IBM System x3950 M2 with 8 Quad-core Intel Xeon CPUs
       –    Storage server: IBM System Storage DS3400
       –    Scalability: 16 – 32TB
SQL Server Fast Track Data Warehouse 2.0 for
                    DELL
          Two Reference configurations available for DELL:

          2 Processor Configuration
              – Server: Dell Power Edge R710 with 2 Quad-core Intel
                Xeon processors
              – 8 CPU Cores
              – 32GB Memory
              – Storage server: EMC CLARiiON AX4
              – Scalability: 4 – 8 TB

          4 Processor Configuration
              – Server: Dell Power Edge R900 with 4 6-core Intel Xeon
                processors
              – 24 CPU Cores
              – 96 GB Memory
              – Storage server: EMC CLARiiON AX4
              – Scalability: 12 – 24 TB
SQL Server Fast Track Data Warehouse for BULL
        Two Reference configurations available for BULL:
        2 Processor Configuration
            – Server: Bull Novascale R460 E2 with 2 Quad-core Intel Xeon
              processors
            – Storage server: EMC CLARiiON AX4
            – Scalability: 4 – 8 TB

        4 Processor Configuration
            – Server: Bull Novascale R480 E1 with 4 6-core Intel Xeon
              processors
            – Storage server: EMC CLARiiON AX4
            – Scalability: 12 – 24 TB

        •   Also included in the Rack:
            –   SQL Server Analysis Services
            –   SQL Server Reporting Services
            –   SQL Server Integration Services
            –   HA Server
            –   Administration Server (with Management Studio, Backup
                Server
Fast Track Data Warehouse Benefits
• Lower TCO
  – Minimizes risk of overspending on un-balanced hardware
    configurations
  – Commodity Hardware
• Choice
  – HW platform
  – Implementation vendor
• Reduced Risk
  – Validated by Microsoft
  – Encapsulates best practices
  – Known performance & scalability
Summary
                          Faster time to solution
                          High scale: up to 48TB
                          Low TCO with better price performance; industry standard hardware
Fast Track Data
offers customers          Better performance out of the box and predictable performance
                          Reduced risk through balanced hardware & Best practices
                          Integration with Madison Hub & Spoke Architecture




                                     Twelve reference architectures from HP, Dell, Bull, EMC and
 SQL Server Fast Track Data          IBM
Warehouse has 2 components           System Integrators with industry solution templates –
                                     Avanade, HP, Hitachi, Cognizant and EMC
Next Steps
• Proof Steps
  – Quick Start DW Roadmap Service
  – Architectural Design Session
  – Madison Technology Preview (MTP)
  – Review Madison, SQL Server Classic or Fast Track
    DW HW/SW configurations and pricing
© 2009 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries.
The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market
                                                                                                 conditions,
          it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation.
                                 MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

More Related Content

PDF
Realtime Apache Hadoop at Facebook
PDF
Real-Time Loading to Sybase IQ
PPTX
Shared Personalization Service - How To Scale to 15K RPS, Patrice Pelland
PDF
Top 6 Reasons to Use a Distributed Data Grid
PDF
SQL Server 2008 R2 Parallel Data Warehouse
PPT
Goods Receipt Document Imaging
PDF
Databus - Abhishek Bhargava & Maheswaran Veluchamy - DevOps Bangalore Meetup...
PPT
Document Imaging in Finance
Realtime Apache Hadoop at Facebook
Real-Time Loading to Sybase IQ
Shared Personalization Service - How To Scale to 15K RPS, Patrice Pelland
Top 6 Reasons to Use a Distributed Data Grid
SQL Server 2008 R2 Parallel Data Warehouse
Goods Receipt Document Imaging
Databus - Abhishek Bhargava & Maheswaran Veluchamy - DevOps Bangalore Meetup...
Document Imaging in Finance

What's hot (20)

PDF
Liquidity Risk Management powered by SAP HANA
PPTX
SPSMEL 2012 - SQL 2012 AlwaysOn Availability Groups for SharePoint 2010 / 2013
PPTX
Evolved BI with SQL Server 2012
PDF
SQL Server Workshop Paul Bertucci
PPTX
HBaseCon 2012 | Low Latency OLAP with HBase - Cosmin Lehene, Adobe
KEY
Cloud Computing & Scaling Web Apps
PDF
ITCamp 2012 - Adrian Stoian - Migrating from CFG MGR 2007 to CFG MGR 2012
PDF
Right Availability in RAC environment. Playing with Oracle clusterware infras...
PPTX
The Art & Sience of Optimization
PDF
Ta3
PDF
A Survey of Petabyte Scale Databases and Storage Systems Deployed at Facebook
PDF
SAP HANA Distinguished Engineer (HDE) Webinar: Overview of SAP HANA On-Premis...
PPTX
High Performance Cloud Computing
PDF
SQL Server User Group 02/2009
PDF
Twee remedies tegen systeemuitval en datacorruptie
PDF
Hadoop World 2011: The Hadoop Stack - Then, Now and in the Future - Eli Colli...
PPTX
Edition based redefinition joords
PDF
SQL Server High Availability
PPTX
Cloudera Sessions - Clinic 1 - Getting Started With Hadoop
Liquidity Risk Management powered by SAP HANA
SPSMEL 2012 - SQL 2012 AlwaysOn Availability Groups for SharePoint 2010 / 2013
Evolved BI with SQL Server 2012
SQL Server Workshop Paul Bertucci
HBaseCon 2012 | Low Latency OLAP with HBase - Cosmin Lehene, Adobe
Cloud Computing & Scaling Web Apps
ITCamp 2012 - Adrian Stoian - Migrating from CFG MGR 2007 to CFG MGR 2012
Right Availability in RAC environment. Playing with Oracle clusterware infras...
The Art & Sience of Optimization
Ta3
A Survey of Petabyte Scale Databases and Storage Systems Deployed at Facebook
SAP HANA Distinguished Engineer (HDE) Webinar: Overview of SAP HANA On-Premis...
High Performance Cloud Computing
SQL Server User Group 02/2009
Twee remedies tegen systeemuitval en datacorruptie
Hadoop World 2011: The Hadoop Stack - Then, Now and in the Future - Eli Colli...
Edition based redefinition joords
SQL Server High Availability
Cloudera Sessions - Clinic 1 - Getting Started With Hadoop
Ad

Viewers also liked (20)

PPTX
Instagram/ Redes Sociales
ODP
Exploring the Open Source Movement
PPTX
Drones: límites legales y cuestiones básicas
PDF
Folleto - Unidad Adolfo López Mateos - INV, 1964
PDF
Energie-Apéro in Poschiavo und Chur
PDF
Презентация компании Siemens "Системный рекордер видеонаблюдение Vectis iX"
PDF
Por Escrito
PDF
Diretrizes para o Manejo da Histoplasmose 2007
PDF
Reporte de estadias smart cubo
PPTX
Integrating bpi and innovation
PDF
Guia ved
PDF
PPTX
EPLAN Education NL
PDF
Campaña presentada al concurso Versus 2013 del Club de Creativos
PPT
Praxis PXI - Análisis Telefónía Móvil - 12.12.2013
PDF
Thomas Metten - "Wissenstransfer oder Wissenstransformation? Kulturwissenscha...
PDF
Modul web php
PDF
Catálogo Ingeniar Solutions Portafolio
PPT
Clase7 UBA PFII
PDF
Macchine intelligenti che imparano da sole
Instagram/ Redes Sociales
Exploring the Open Source Movement
Drones: límites legales y cuestiones básicas
Folleto - Unidad Adolfo López Mateos - INV, 1964
Energie-Apéro in Poschiavo und Chur
Презентация компании Siemens "Системный рекордер видеонаблюдение Vectis iX"
Por Escrito
Diretrizes para o Manejo da Histoplasmose 2007
Reporte de estadias smart cubo
Integrating bpi and innovation
Guia ved
EPLAN Education NL
Campaña presentada al concurso Versus 2013 del Club de Creativos
Praxis PXI - Análisis Telefónía Móvil - 12.12.2013
Thomas Metten - "Wissenstransfer oder Wissenstransformation? Kulturwissenscha...
Modul web php
Catálogo Ingeniar Solutions Portafolio
Clase7 UBA PFII
Macchine intelligenti che imparano da sole
Ad

Similar to SQL Server 2008 Fast Track Data Warehouse (20)

PDF
User Group Bi
PDF
Bi303 data warehousing with fast track and pdw - Assaf Fraenkel
PPTX
Overview of Microsoft Appliances: Scaling SQL Server to Hundreds of Terabytes
PPTX
HP Microsoft SQL Server Data Management Solutions
PPTX
Microsoft SQL Server Data Warehouses for SQL Server DBAs
PPTX
Sql Server 2008 Performance and Scaleability
PDF
Tools for developing and monitoring SQL in DB2 for z/OS
PDF
An overview of Microsoft data mining technology
PDF
An overview of microsoft data mining technology
PPTX
Microsoft SQL Server 2012
PDF
BI Forum 2009 - Principy architektury MPP datového skladu
PPTX
Big Data i CSC's optik, CSC Representative
PDF
Smart analytic optimizer how it works
PDF
The fillmore-group-aese-presentation-111810
PPT
Tivoli Storage Productivity Center... What’s new in v4.2.2?
PDF
Architecting a Data Warehouse: A Case Study
PDF
Database & Technology 1 | Andrew Holdsworth | Orace Database Performance.pdf
PPTX
Oracle: Dw Design
PPTX
Oracle: DW Design
PDF
Secrets of Enterprise Data Mining
User Group Bi
Bi303 data warehousing with fast track and pdw - Assaf Fraenkel
Overview of Microsoft Appliances: Scaling SQL Server to Hundreds of Terabytes
HP Microsoft SQL Server Data Management Solutions
Microsoft SQL Server Data Warehouses for SQL Server DBAs
Sql Server 2008 Performance and Scaleability
Tools for developing and monitoring SQL in DB2 for z/OS
An overview of Microsoft data mining technology
An overview of microsoft data mining technology
Microsoft SQL Server 2012
BI Forum 2009 - Principy architektury MPP datového skladu
Big Data i CSC's optik, CSC Representative
Smart analytic optimizer how it works
The fillmore-group-aese-presentation-111810
Tivoli Storage Productivity Center... What’s new in v4.2.2?
Architecting a Data Warehouse: A Case Study
Database & Technology 1 | Andrew Holdsworth | Orace Database Performance.pdf
Oracle: Dw Design
Oracle: DW Design
Secrets of Enterprise Data Mining

More from Mark Ginnebaugh (20)

PDF
Automating Microsoft Power BI Creations 2015
PDF
Microsoft SQL Server Analysis Services (SSAS) - A Practical Introduction
PDF
Platfora - An Analytics Sandbox In A World Of Big Data
PDF
Microsoft SQL Server Relational Databases and Primary Keys
PDF
DesignMind Microsoft Business Intelligence SQL Server
PDF
San Francisco Bay Area SQL Server July 2013 meetings
PDF
Silicon Valley SQL Server User Group June 2013
PDF
Microsoft SQL Server Continuous Integration
PDF
Hortonworks Big Data & Hadoop
PDF
Microsoft SQL Server Physical Join Operators
PDF
Microsoft Data Warehouse Business Intelligence Lifecycle - The Kimball Approach
PDF
Fusion-io Memory Flash for Microsoft SQL Server 2012
PDF
Microsoft Data Mining 2012
PDF
Microsoft SQL Server PASS News August 2012
PDF
Business Intelligence Dashboard Design Best Practices
PDF
Microsoft Mobile Business Intelligence
PDF
Microsoft SQL Server 2012 Cloud Ready
PDF
Microsoft SQL Server 2012 Master Data Services
PDF
Microsoft SQL Server PowerPivot
PDF
Microsoft SQL Server Testing Frameworks
Automating Microsoft Power BI Creations 2015
Microsoft SQL Server Analysis Services (SSAS) - A Practical Introduction
Platfora - An Analytics Sandbox In A World Of Big Data
Microsoft SQL Server Relational Databases and Primary Keys
DesignMind Microsoft Business Intelligence SQL Server
San Francisco Bay Area SQL Server July 2013 meetings
Silicon Valley SQL Server User Group June 2013
Microsoft SQL Server Continuous Integration
Hortonworks Big Data & Hadoop
Microsoft SQL Server Physical Join Operators
Microsoft Data Warehouse Business Intelligence Lifecycle - The Kimball Approach
Fusion-io Memory Flash for Microsoft SQL Server 2012
Microsoft Data Mining 2012
Microsoft SQL Server PASS News August 2012
Business Intelligence Dashboard Design Best Practices
Microsoft Mobile Business Intelligence
Microsoft SQL Server 2012 Cloud Ready
Microsoft SQL Server 2012 Master Data Services
Microsoft SQL Server PowerPivot
Microsoft SQL Server Testing Frameworks

Recently uploaded (20)

PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PPT
Teaching material agriculture food technology
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
Cloud computing and distributed systems.
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
A Presentation on Artificial Intelligence
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Empathic Computing: Creating Shared Understanding
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
MYSQL Presentation for SQL database connectivity
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Unlocking AI with Model Context Protocol (MCP)
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Teaching material agriculture food technology
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
The Rise and Fall of 3GPP – Time for a Sabbatical?
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Reach Out and Touch Someone: Haptics and Empathic Computing
Chapter 3 Spatial Domain Image Processing.pdf
Network Security Unit 5.pdf for BCA BBA.
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Encapsulation_ Review paper, used for researhc scholars
Cloud computing and distributed systems.
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Agricultural_Statistics_at_a_Glance_2022_0.pdf
A Presentation on Artificial Intelligence
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Empathic Computing: Creating Shared Understanding
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
MYSQL Presentation for SQL database connectivity

SQL Server 2008 Fast Track Data Warehouse

  • 1. SQL Server 2008 Fast Track Data Warehouse 2.0 Speaker: Phil Hummel of WinWire Technologies Presentation developed by: Bruce Campbell Western Region Data Warehouse Specialist, Microsoft Silicon Valley SQL Server User Group February 16, 2009 Mark Ginnebaugh, User Group Leader, mark@designmind.com
  • 2. Agenda • DW vs. OLTP • Balanced Architecture Approach for DW Fast Track Defined • Fast Track Reference Architectures • Next Steps
  • 3. Microsoft DW & BI Stack DELIVERY PerformancePoint Services END USER TOOLS BI & DW PLATFORM (RDBMS, ETL, OLAP, Reporting)
  • 4. DW versus OLTP Database Database • Designed for analytical operations: • Designed for operational requirements: Strategic focus Tactical focus • Optimized for bulk load and large, complex, • Optimized for transactions: “single row” unpredictable queries entry and retrieval • Fewer concurrent users relative to OLTP • Thousands of concurrent users Storage • Primary focus on Read operations Storage • Optimized for disk scan over seek • Emphasizes transactional operations performance • Storage optimization focused on disk • Optimized for disk seek over scan scan rate (MB/s) operations • Storage optimization focused on I/O operations/s (IOPs)
  • 5. Sequential I/O Sequential I/O Random I/O • Scans on large data stores are • OLTP usually random-read centric. usually read with sequential read Discrete lookups benefit from index patterns and not random read optimization and random read patterns capability. • Scalable, predictable performance • Not as predictable & scalable for data warehousing • Requires 1/3 or fewer drives to match server I/O consumption • Requires large number of drives to capability. match server I/O consumption capability. All databases contain both scans and seeks among with other types of reads and writes, DW workload indicate that the vast majority of reads are sequential – not all
  • 6. Some SQL Data Warehouses today Big SAN Big 64-core Server Connected together What’s wrong with this picture?
  • 7. Answer: system out of balance • This server can consume 16 GB/Sec of IO, but the SAN can only deliver 2 GB/Sec – Even when the SAN is dedicated to the SQL Data Warehouse, which it often isn’t – Lots of disks for Random IOPS BUT – Limited controllers Limited IO bandwidth • System is typically IO bound • Queries are slow Result: significant investment, not delivering performance
  • 8. The Alternative: A Balanced System • Design a server + storage configuration that can deliver all the IO bandwidth that CPUs can consume when executing a SQL Relational DW workload • Avoid sharing storage devices among servers • Avoid overinvesting in disk drives – Focus on scan performance, not IOPS • Layout and manage data to maximize range scan performance and minimize fragmentation
  • 9. Potential Performance Bottlenecks DISK DISK SQL SERVER CPU CORES A FC SWITCH FC SERVER WINDOWS A CACHE HBA B LUN CACHE A STORAGE A B CONTROLLER B DISK DISK FC A HBA B B LUN CPU Feed Rate SQL Server HBA Port Rate Switch Port Rate SP Port Rate LUN Read Rate Disk Feed Rate Read Ahead Rate
  • 10. SQL Server Fast Track Data Warehouse Solution to help customers and partners accelerate their data warehouse deployments • A method for designing a cost-effective, balanced system for Data Warehouse workloads • Reference hardware configurations developed in conjunction with hardware partners using this method • Best practices for data layout, loading and management Relational Database Only – Not SSAS, IS, RS
  • 11. Fast Track Data Warehouse Components Software: • SQL Server 2008 Enterprise • Windows Server 2008 Configuration guidelines: • Physical table structures • Indexes • Compression • SQL Server settings • Windows Server settings • Loading Hardware: • Tight specifications for servers, storage and networking • ‘Per core’ building block
  • 12. Fast Track Scope Supporting Systems BI Data Storage Systems Presentation Layer Systems Integration Analysis Services Services ETL Cubes Presentation Data Presentation Data Web Analytic Tools Data Path Reporting Services SharePoint Services Dedicated SAN, Microsoft Office SharePoint Storage Array Data Warehouse PerformancePoint Data Staging, Excel Services Bulk Loading Reference Architecture Scope (dashed)
  • 13. Two SQL DW Infrastructure Options: SQL Classic DW or Fast Track SQL DW SQL Classic DW Fast Track SQL DW Architecture Architecture Architecture modeled after DW Appliances Leverages Shared SAN Teradata, DATAllegro..etc “ Appliance Like” Uses Dedicated SAN arrays and Network Enterprise Shared Shared Network Dedicated SAN Storage Bandwidth Network Dedicated SAN Bandwidth SQL 2008 Data Warehouse SAN Arrays 1:4 cpu cores SMP Server 8 Data Disk / Array – 4 Raid 1 Pairs Simultaneous SQL Server Reads 2 Log and 1 Hot Spare EMC AX4 – HP MSA2312 IBM 3400 OLTP Applications SQL Fast Track DW supports “Scan Centric” DW workloads that are index light
  • 14. Optimizing storage layout for scan intensive workloads • LUN configuration is based on RAID GP01 RAID GP02 RAID GP05 RAID1 pairs S P 01 02 03 04 09 10 – Optimal for scan type access patterns LUN1 LUN3 LUN0 • Striping across storage is A LUN2 LUN4 (Logs) HOT SPARE RAID GP03 RAID GP04 accomplished via SQL Server data S files P 05 06 07 08 LUN5 LUN7 • Observed throughput for a single B LUN6 LUN8 RAID pair >= 130 MB/s
  • 15. Storage Layout Implications for SQL Server LUN 1 LUN 2 LUN 3 LUN16 Permanent FG Permanant_DB Permanent_1.ndf Permanent_2.ndf Permanent_3.ndf Permanent_16.ndf Stage FG Database Stage Stage_1.ndf Stage_2.ndf Stage_3.ndf Stage_16.ndf Local Drive 1 TempDB TempDB.mdf (25GB) TempDB_02.ndf (25GB) TempDB_03ndf (25GB) TempDB_16.ndf (25GB) Log LUN 1 Permanent DB Log Stage DB Log
  • 16. Creating Sequential Data Layout • Goal: Align logical and physical ordering of data within a Filegroup • Two primary ways Fast Track optimizes allocation for Sequential Scan – Minimize Fragmentation – Manage Load processing
  • 17. Maximum Consumption Rate Theoretical throughput for IO stack • Using a 2x quad-core server as a building Maximum theoretical throughput for IO stack components sized for an 8 CPU core Fast Track system block / starting point (assumes 200 MB/s per core) • Ensure that the per- core data consumption rate can be delivered by all 500 MB/s 300 MB/s 300 MB/s elements of the IO 300 MB/s stack MCR 1.6 GB/s 500 MB/s 300 MB/s Fiber Switch Storage Enclosure • Sticker on the new Windows Server OS car: “Miles Per 300 MB/s Gallon” CPU Socket HBA Min 500 MB/s 300 MB/s (4 Core) 300 MB/s 2 Min CPU GB/s 2 Socket HBA 500 MB/s 300 MB/s GB/s (4 Core) Server Storage Enclosure
  • 18. Scaling the IO stack Storage Processor RAID-1 RAID-1 CPU CPU Fiber Storage Processor RAID-1 RAID-1 RAID-1 Socket Socket Storage Enclosure (4 Core) (4 Core) Switch Storage Processor RAID-1 CPU CPU RAID-1 RAID-1 Socket Socket Storage Processor RAID-1 (4 Core) (4 Core) RAID-1 Storage Enclosure CPU CPU Storage Processor RAID-1 RAID-1 Socket Socket RAID-1 (4 Core) (4 Core) Storage Processor RAID-1 RAID-1 Storage Enclosure CPU CPU Socket Socket Storage Processor RAID-1 (4 Core) (4 Core) RAID-1 RAID-1 Storage Processor RAID-1 RAID-1 Storage Enclosure HBA Storage Processor RAID-1 HBA RAID-1 RAID-1 RAID-1 Storage Processor RAID-1 Storage Enclosure HBA Storage Processor RAID-1 HBA RAID-1 RAID-1 Storage Processor RAID-1 RAID-1 Storage Enclosure HBA Storage Processor RAID-1 HBA RAID-1 RAID-1 Storage Processor RAID-1 RAID-1 Storage Enclosure HBA Server HBA Storage Processor RAID-1 RAID-1 RAID-1 Storage Processor RAID-1 RAID-1 Storage Enclosure
  • 19. Fast Track Data Warehouse Reference Configurations CPU Initial Max Server CPU SAN Data Drive Count Cores Capacity* Capacity** HP Proliant (2) AMD Opteron Istanbul 12 (3) HP MSA2312fc (24) 300GB 15k SAS 6TB 12TB DL 385 G6 six core 2.6 GHz HP Proliant (2) Intel Xeon® 5500 Series 8 (2) HP MSA2312 (16) 300GB 15k SAS 4TB 8TB DL 380 G6 Quad core HP Proliant (4) AMD Opteron Instanbul 24 (6) HP MSA2312fc (48) 300GB 15k SAS 12TB 24TB DL 585 G6 six core 2.6 GHz HP Proliant (4) Intel Xeon® 7400 Series six 24 (6) HP MSA2312 (48) 300GB 15k SAS 12TB 24TB DL 580 G5 core HP Proliant (8) AMD Opteron Istanbul 48 (12) HP MSA2312 (96) 300GB 15k SAS 24TB 48TB DL 785 G6 six core 2.8 GHz Dell PowerEdge (2) Intel Xeon Nehalem quad 8 (2) EMC AX4 (16) 300GB 15k FC 4TB 8TB R710 core 2.66 GHz Dell Power Edge (4) Intel Xeon Dunnington 24 (6) EMC AX4 (48) 300GB 15k FC 12TB 24TB R900 six core 2.67GHz IBM X3650 M2 (2) Intel Xeon Nehalem quad 8 (2) IBM DS3400 (16) 200GB 15K FC 4TB 8TB core 2.67 GHx IBM X3850 M2 (4) Intel Xeon Dunnington six 24 (6) IBM DS3400 (24) 300GB 15k FC 12TB 24TB core 2.67 GHz IBM X3950 M2 (8) Intel Xeon Nehalem four 32 (8) IBM DS3400 (32) 300GB 15k SAS 16TB 32TB core 2.13 GHz Bull Novascale (2) Intel Xeon Nehalem quad 8 (2) EMC AX4 (16) 300GB 15k FC 4TB 8TB R460 E2 core 2.66 GHz Bull Novascale (4) Intel Xeon Dunnington 24 (6) EMC AX4 (48) 300GB 15k FC 12TB 24TB R480 E1 six core 2.67GHz * Core-balanced compressed capacity based on 300GB 15k SAS not including hot spares and log drives. Assumes 25% (of raw disk space) allocated for Temp DB. ** Represents storage array fully populated with 300GB15k SAS and use of 2.5:1 compression ratio. This includes the addition of one storage expansion tray per enclosure. 30% of this storage should be reserved for DBA operations
  • 20. SQL Server Fast Track Data Warehouse 2.0 for HP – now on G6 Platform Five AMD and Intel based Reference configurations available for HP: AMD Based Reference Architectures 2 Processor Configuration – Server: HP ProLiant DL385 G6 with 2 6-core AMD Opteron CPUs – Storage server: MSA Storage – Scalability: 4 – 12 TB 4 Processor Configuration – Server: HP ProLiant DL 585 G6 with 4 6-core AMD Opteron CPUs – Storage server: MSA Storage – Scalability: 12 – 24 TB 8 processor Configuration – Server: HP ProLiant DL 785 G6 with 8 6-core AMD Opteron CPUs – Storage server: MSA Storage – Scalability: 24 – 48TB
  • 21. SQL Server Fast Track Data Warehouse 2.0 for HP – now on G6 Platform Intel Based Reference Architectures 2 Processor Configuration – Server: HP ProLiant DL380 G6 with 2 4-core Intel Xeon® 5500 Series CPUs – Storage server: MSA Storage – Scalability: 4 – 8 TB 4 Processor Configuration – Server: HP ProLiant DL 580 G5 with 4 6-core Intel Xeon® 7400 Series CPUs – Storage server: MSA Storage – Scalability: 12 – 24 TB
  • 22. New Fast Track Data Warehouse 2.0 for IBM Three Reference configurations available for IBM: 2 Processor Configuration – Server: IBM System x3650 M2 with 2 Quad-core Intel Xeon CPUs – Storage server: IBM System Storage DS3400 – Scalability: 4 – 8 TB 4 Processor Configuration – Server: IBM System x3850 M2 with 4 6-core Intel Xeon CPUs – Storage server: IBM System Storage DS3400 – Scalability: 12 – 24 TB 8 processor Configuration – Server: IBM System x3950 M2 with 8 Quad-core Intel Xeon CPUs – Storage server: IBM System Storage DS3400 – Scalability: 16 – 32TB
  • 23. SQL Server Fast Track Data Warehouse 2.0 for DELL Two Reference configurations available for DELL: 2 Processor Configuration – Server: Dell Power Edge R710 with 2 Quad-core Intel Xeon processors – 8 CPU Cores – 32GB Memory – Storage server: EMC CLARiiON AX4 – Scalability: 4 – 8 TB 4 Processor Configuration – Server: Dell Power Edge R900 with 4 6-core Intel Xeon processors – 24 CPU Cores – 96 GB Memory – Storage server: EMC CLARiiON AX4 – Scalability: 12 – 24 TB
  • 24. SQL Server Fast Track Data Warehouse for BULL Two Reference configurations available for BULL: 2 Processor Configuration – Server: Bull Novascale R460 E2 with 2 Quad-core Intel Xeon processors – Storage server: EMC CLARiiON AX4 – Scalability: 4 – 8 TB 4 Processor Configuration – Server: Bull Novascale R480 E1 with 4 6-core Intel Xeon processors – Storage server: EMC CLARiiON AX4 – Scalability: 12 – 24 TB • Also included in the Rack: – SQL Server Analysis Services – SQL Server Reporting Services – SQL Server Integration Services – HA Server – Administration Server (with Management Studio, Backup Server
  • 25. Fast Track Data Warehouse Benefits • Lower TCO – Minimizes risk of overspending on un-balanced hardware configurations – Commodity Hardware • Choice – HW platform – Implementation vendor • Reduced Risk – Validated by Microsoft – Encapsulates best practices – Known performance & scalability
  • 26. Summary Faster time to solution High scale: up to 48TB Low TCO with better price performance; industry standard hardware Fast Track Data offers customers Better performance out of the box and predictable performance Reduced risk through balanced hardware & Best practices Integration with Madison Hub & Spoke Architecture Twelve reference architectures from HP, Dell, Bull, EMC and SQL Server Fast Track Data IBM Warehouse has 2 components System Integrators with industry solution templates – Avanade, HP, Hitachi, Cognizant and EMC
  • 27. Next Steps • Proof Steps – Quick Start DW Roadmap Service – Architectural Design Session – Madison Technology Preview (MTP) – Review Madison, SQL Server Classic or Fast Track DW HW/SW configurations and pricing
  • 28. © 2009 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.