SlideShare a Scribd company logo
Running A Megasite On Microsoft Technologies Casey Jacobs Aber Whitcomb Director of Engineering CTO Microsoft.com MySpace.com Chris St.Amand Jim Benedetto   Sr. System Engineer VP of Technology Microsoft.com MySpace.com NGW046
Agenda Introduction – Quick Facts MySpace.com – Growing Up Upcoming Technology Enablers Open Panel Discussion
Introduction
Brief History Of Microsoft.com Microsoft combines Web platform, ops, and content teams Standardization effort begins, consolidation hosted systems Focus on MSCOM Network Programming and campaign-to-Web integration Single MSCOM group formed Brand, content, site std’s, Privacy, brand compliance Microsoft launches www.microsoft.com  Information & support publishing; hosting Enable an innovative customer experience online & in-product Product Info, Support, Dev /  ITPro Experience, Customer Intelligence, Profile Mgmt & Enterprise Downloads 2001 4M UUsers / day 2003 6.5M UUsers / day 1995 30k users / day 2006 17.1M UUsers / day
Microsoft.com Quick Facts Infrastructure and Application Footprint 5 Internet Data Centers & 3 CDN Partnerships 110 Web Sites, 1000’s App's and 2138 Databases  80+ Gigabit/sec Bandwidth Solutions at High Scale www.Microsoft.com  13M UUsers/Day & 70M Page Views/Day 10K Req/Sec, 300K CC Conn’s on 80 Servers 350 Vroots, 190 IIS Web App’s & 12 App Pools Microsoft Update 250M UScans/Day, 12K ASP.NET Req/Sec, 1.1M ConCurrent  28.2 Billion Downloads for CY 2005 Egress – MS, Akamai & Savvis (30-80+ Gbit/Sec)
MySpace Company Overview Launched Sept, 2003 Latest as of February 2006 64+ MM Registered Users 38 MM UUsers & 2.3M Concurrent 260K New Registered Users/Day 23 Billion Page* Views/Month Demographics 50.2% Female / 49.8% Male Primary Age Demo: 14-34 Site Trends 260K New Users/Day 430M Total Images  Millions of Songs Streamed/Day 1000’s of New MP3’s/Day 20 Million Comments Posted Media Metrix February 2006 Audience Rankings Source  comScore Media Metrix February  - 2006 23,566 #2 MySpace Hotmail Google Ebay MSN Yahoo 9,632 #4 29,508 #1 14,695 #3 7,329 #5 6,812 #6 Pageviews in ‘000s Internet Rank
MySpace.com  Quick Facts Infrastructure and Application Footprint 3 Internet Data Centers Server Breakdown 2682 Web and 650 Database Servers 90 Cache Servers 16gb RAM 650 Dart servers 60 DB Servers 150 Media servers 3000 disks in SAN architecture Egress Management 17,000 mb/s bandwidth 15,000 mb/s on CDN
MySpace.com Growing up in the Internet World
0 users The beginning Two tiered architecture Single Database Load balanced web servers Great for rapid development Less complexity means faster time to market and less operational costs Works for small to medium sized websites, not big ones 0 Users
500k Users A Single database is not enough Max out a single database Split reads and writes across separate databases Use transactional replication so multiple databases can service reads 500k Users
1 Million Vertical partitioning Transactional replication doesn’t work for all workloads and data types Use a combination of Vertical Partitioning and replication 1M Users
2 Million SAN Start to reconsider SCSI arrays for the long-term SCSI arrays have good performance but reliability issues SANS provide better performance, uptime, and redundancy Move to a clarion and enjoy better these benefits 2M Users
3 Million Horizontal partitioning Vertical Partitions see performance problems Decide we need to re-architect the database Horizontal partitioning is the answer but is difficult to do while in production 3M Users
Horizontal Partitioning All features reside on  a single database server Data is partitioned by user ID Some data cannot be partitioned especially on a social networking site 3M Users
5 Million Network bottlenecks Various areas of the network  become saturated Gig uplinks are maxed out Switch to Autonomous network and BGP Get multiple gig links and 10G links Load balancer is maxed out “ Must load balance the load balancers” Use DNS 5M Users
7 Million Site dependencies Separating features on the front end isolates potential bottlenecks Using subdomains  is easiest way 7M Users
10 Million Scalable storage Trying to partition storage on the backend is time consuming and inefficient Maxing out SANs is very costly We realize scalable storage is key 10M Users
15 Million DB’s versus Caching Databases still having perf issues Databases are expensive Have a lot of transactional overhead Caching tier High speed cache is perfect for reads LRU algorithm is self managing Drastically reduces database load
MySpace Where we are today
Upcoming Technology Enablers What’s Next for Microsoft.com and MySpace.com?
SQL Server 2005 Product technology enablers Peer-To-Peer Replication System & Data Center Autonomy Zero “perceived” Application  Downtime from Consumers Eliminates Single Point of Failure for R/W Databases Mirroring (SP1) Targeting Replacement of Log Shipping Fail-Over pairs 3 Systems in TAP Program (Technet, Learning & Genuine)  Reduced Failover Downtime Log Shipping:  5-15min Avg Mirroring < 1min (planned) Table Partitioning Reduced Storage Costs Scale Up at Lower Costs
MySpace Scaling SQL Server V1:  Single Instance – < 1 Million Users Single SQL Server Instance Supports All Users and Features V2:  Single Instance Replicating to Read Only Full Copies < 2 Million Users Single server handles all write transactions, read transactions spread across multiple transactional  replication copies V3:  Vertical Partitioning - < 4 Million Users Each Feature/Page of the site on its own SQL Server
MySpace Scaling SQL Server V4:  Horizontal Partitioning - < 8 Million Users All features/pages brought back to single database schema Standard schema across all databases User ranges partitioned across databases V5:  Horizontally Partitioned Core with Replicated Content, Vertically Partitioned Features Databases, “Shared Content” Databases - > 8 Million Users Primary Myspace schema exists across large farm of servers  Small amounts of content replicated to all horizontally partitioned servers to allow for features spanning all  user ranges V6: Migration to SQL Server 2005 - >26 Million Users
SQL Server 2005 64 bit Memory Pressure under 4GB 32 Limit  Servers loaded with 32Gigs of RAM <4 Gig Addressable to the memory pools we were stressing Manifestations Connection Timeouts Servers going “dark”, requiring restart Rejected Connections Problem Eliminated on 64bit Arch Connection/Sort memory pools now able to address all 32Gigs of RAM
Virtualizing Storage What is it? Software layer between your disks & hosts Advantages Provisioning is very simple, makes capacity planning more predictable Much better performance Can easily add more capacity to a LUN What do we use? 3par 14 week bake off
Longhorn And IIS 7.0 Product technology enablers UNC Content Store Simplified Content Mgmt Reduced Disk Footprint File Replication (DC to DC) Latent/Long links improved 80X (10Mbps vs 850Mbps) Enabler of Geo-Hosting Options Centralized IIS Config’s Copy “Host-Host” capability Eliminate complex scripting of meta-base & config’s Dynamic Content Compression Further reduced Egress Improved Web Perf Delivery
IIS 7.0 Failed Request Tracing
Geo-Targeting Solutions Demographic management Objective – Enable Targeted Release of App’s and Content  Avoid demographic support spikes and further align to marketing campaigns Microsoft Confidential. © 2006 Microsoft Corporation. All rights reserved. This presentation is for internal Microsoft use only.  Sensitivity to Time/Frequency of customer online experiences Improve ability to reach last 30% of client population
Open Panel Discussion
© 2006 Microsoft Corporation. All rights reserved. This presentation is for informational purposes only. Microsoft makes no warranties, express or implied, in this summary.
 

More Related Content

PPT
Web Center Services and Framework
DOC
Updating current Network Design It18 roshan basnet
PPT
Web Server Hardware and Software
PPTX
Exchange 2013
PPTX
Microsoft Windows 7 Improved Network Access
PPTX
Web server hardware and software
PPTX
Web ,app and db server presentation
PPTX
Web server
Web Center Services and Framework
Updating current Network Design It18 roshan basnet
Web Server Hardware and Software
Exchange 2013
Microsoft Windows 7 Improved Network Access
Web server hardware and software
Web ,app and db server presentation
Web server

What's hot (20)

PDF
Fm12 instant web_publish_en
PPTX
New File Server Features Of Windows Server 2008
PPT
Web Server Technologies II: Web Applications & Server Maintenance
PPTX
What is Server? (Web Server vs Application Server)
PPTX
SharePoint Saturday Michigan Keynote - Top 5 Infrastructure Concerns for a Sh...
PPTX
Web server hardware and software
PPT
Web Server Technologies I: HTTP & Getting Started
PPT
Web server
PPT
Asia Pacific SharePoint Capacity Planning by Joel Oleson
PPTX
Sitecore experience platform session 1
PDF
Getting Rid Of Legacy And SharePoint Migration and Assement by Joel Oleson
PDF
New Exchange Server 2013 Architecture
PPT
Web servers – features, installation and configuration
PPSX
Web server
PPT
Web servers (l6)
PPT
Test111
PPTX
Web server hardware and software
PPT
Internet server components
PDF
Exl393 exchange 2013 architecture schnoll (rm221)
PDF
Implementing Advanced Caching and Replication Techniques in ...
Fm12 instant web_publish_en
New File Server Features Of Windows Server 2008
Web Server Technologies II: Web Applications & Server Maintenance
What is Server? (Web Server vs Application Server)
SharePoint Saturday Michigan Keynote - Top 5 Infrastructure Concerns for a Sh...
Web server hardware and software
Web Server Technologies I: HTTP & Getting Started
Web server
Asia Pacific SharePoint Capacity Planning by Joel Oleson
Sitecore experience platform session 1
Getting Rid Of Legacy And SharePoint Migration and Assement by Joel Oleson
New Exchange Server 2013 Architecture
Web servers – features, installation and configuration
Web server
Web servers (l6)
Test111
Web server hardware and software
Internet server components
Exl393 exchange 2013 architecture schnoll (rm221)
Implementing Advanced Caching and Replication Techniques in ...
Ad

Viewers also liked (9)

PPTX
Global Recruitment Capabilities, LinkedIn Recruiter Integration & the Intervi...
PDF
IaaS, SaaS, PasS : Cloud Computing
PPTX
T3 Cover Letters
PPTX
Tutorial 2 - Skills Matrix
PPTX
What's New in SPS11 Overview
PPTX
Cloud Computing - Amazon Web Services
PDF
Technical Recruitment Overview & Tips
PPTX
Implementa procesos de calidad de datos para obtener visibilidad de 360 grado...
PPTX
Big Data - 25 Amazing Facts Everyone Should Know
Global Recruitment Capabilities, LinkedIn Recruiter Integration & the Intervi...
IaaS, SaaS, PasS : Cloud Computing
T3 Cover Letters
Tutorial 2 - Skills Matrix
What's New in SPS11 Overview
Cloud Computing - Amazon Web Services
Technical Recruitment Overview & Tips
Implementa procesos de calidad de datos para obtener visibilidad de 360 grado...
Big Data - 25 Amazing Facts Everyone Should Know
Ad

Similar to Running a Megasite on Microsoft Technologies (20)

PPTX
Designing a Scalable Twitter - Patterns for Designing Scalable Real-Time Web ...
PDF
Solving enterprise challenges through scale out storage &amp; big compute final
PPT
Gavin M
PPTX
Membase Meetup 2010
PPT
MOSS 2007 Deployment Fundamentals -Part2
PPTX
Architecting Cloudy Applications
PPTX
Handling Data in Mega Scale Systems
PDF
Caching for Microservices Architectures: Session I
PPT
Infrastructure Strategies 2007
PPT
Oracle Coherence: in-memory datagrid
PPTX
How leading financial services organisations are winning with tech
PPT
StackOverflow Architectural Overview
PPTX
Building Scalable .NET Web Applications
PPT
Spring 2007 SharePoint Connections Oleson Advanced Administration and Plannin...
PDF
FILES IN TODAY’S WORLD - #MFSummit2017
PPTX
Scaling wix with microservices architecture devoxx London 2015
PDF
Move fast and make things with microservices
PPT
Optimize Your It Environment With An Hp Blade System Solution
PPTX
La creación de una capa operacional con MongoDB
PDF
Marketing Automation at Scale: How Marketo Solved Key Data Management Challen...
Designing a Scalable Twitter - Patterns for Designing Scalable Real-Time Web ...
Solving enterprise challenges through scale out storage &amp; big compute final
Gavin M
Membase Meetup 2010
MOSS 2007 Deployment Fundamentals -Part2
Architecting Cloudy Applications
Handling Data in Mega Scale Systems
Caching for Microservices Architectures: Session I
Infrastructure Strategies 2007
Oracle Coherence: in-memory datagrid
How leading financial services organisations are winning with tech
StackOverflow Architectural Overview
Building Scalable .NET Web Applications
Spring 2007 SharePoint Connections Oleson Advanced Administration and Plannin...
FILES IN TODAY’S WORLD - #MFSummit2017
Scaling wix with microservices architecture devoxx London 2015
Move fast and make things with microservices
Optimize Your It Environment With An Hp Blade System Solution
La creación de una capa operacional con MongoDB
Marketing Automation at Scale: How Marketo Solved Key Data Management Challen...

More from goodfriday (20)

PPT
Narine Presentations 20051021 134052
PDF
Triunemar05
PDF
09 03 22 easter
PDF
Holy Week Easter 2009
PDF
Holt Park Easter 09 Swim
PDF
Easter Letter
PDF
April2009
PDF
Swarthmore Lentbrochure20092
PDF
Eastercard2009
PDF
Easterservices2009
PDF
Bulletin Current
PDF
Easter2009
PDF
Bulletin
PDF
March 2009 Newsletter
PDF
Mar 29 2009
PDF
Lent Easter 2009
PDF
Easterpowersports09
PDF
Easter Trading 09
PDF
Easter Brochure 2009
PDF
March April 2009 Calendar
Narine Presentations 20051021 134052
Triunemar05
09 03 22 easter
Holy Week Easter 2009
Holt Park Easter 09 Swim
Easter Letter
April2009
Swarthmore Lentbrochure20092
Eastercard2009
Easterservices2009
Bulletin Current
Easter2009
Bulletin
March 2009 Newsletter
Mar 29 2009
Lent Easter 2009
Easterpowersports09
Easter Trading 09
Easter Brochure 2009
March April 2009 Calendar

Recently uploaded (20)

PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
Cloud computing and distributed systems.
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPT
Teaching material agriculture food technology
PDF
Empathic Computing: Creating Shared Understanding
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
cuic standard and advanced reporting.pdf
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
sap open course for s4hana steps from ECC to s4
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Approach and Philosophy of On baking technology
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
MYSQL Presentation for SQL database connectivity
Cloud computing and distributed systems.
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Per capita expenditure prediction using model stacking based on satellite ima...
Teaching material agriculture food technology
Empathic Computing: Creating Shared Understanding
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
cuic standard and advanced reporting.pdf
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Dropbox Q2 2025 Financial Results & Investor Presentation
Building Integrated photovoltaic BIPV_UPV.pdf
Unlocking AI with Model Context Protocol (MCP)
Network Security Unit 5.pdf for BCA BBA.
The Rise and Fall of 3GPP – Time for a Sabbatical?
Reach Out and Touch Someone: Haptics and Empathic Computing
sap open course for s4hana steps from ECC to s4
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Approach and Philosophy of On baking technology

Running a Megasite on Microsoft Technologies

  • 1. Running A Megasite On Microsoft Technologies Casey Jacobs Aber Whitcomb Director of Engineering CTO Microsoft.com MySpace.com Chris St.Amand Jim Benedetto Sr. System Engineer VP of Technology Microsoft.com MySpace.com NGW046
  • 2. Agenda Introduction – Quick Facts MySpace.com – Growing Up Upcoming Technology Enablers Open Panel Discussion
  • 4. Brief History Of Microsoft.com Microsoft combines Web platform, ops, and content teams Standardization effort begins, consolidation hosted systems Focus on MSCOM Network Programming and campaign-to-Web integration Single MSCOM group formed Brand, content, site std’s, Privacy, brand compliance Microsoft launches www.microsoft.com Information & support publishing; hosting Enable an innovative customer experience online & in-product Product Info, Support, Dev / ITPro Experience, Customer Intelligence, Profile Mgmt & Enterprise Downloads 2001 4M UUsers / day 2003 6.5M UUsers / day 1995 30k users / day 2006 17.1M UUsers / day
  • 5. Microsoft.com Quick Facts Infrastructure and Application Footprint 5 Internet Data Centers & 3 CDN Partnerships 110 Web Sites, 1000’s App's and 2138 Databases 80+ Gigabit/sec Bandwidth Solutions at High Scale www.Microsoft.com 13M UUsers/Day & 70M Page Views/Day 10K Req/Sec, 300K CC Conn’s on 80 Servers 350 Vroots, 190 IIS Web App’s & 12 App Pools Microsoft Update 250M UScans/Day, 12K ASP.NET Req/Sec, 1.1M ConCurrent 28.2 Billion Downloads for CY 2005 Egress – MS, Akamai & Savvis (30-80+ Gbit/Sec)
  • 6. MySpace Company Overview Launched Sept, 2003 Latest as of February 2006 64+ MM Registered Users 38 MM UUsers & 2.3M Concurrent 260K New Registered Users/Day 23 Billion Page* Views/Month Demographics 50.2% Female / 49.8% Male Primary Age Demo: 14-34 Site Trends 260K New Users/Day 430M Total Images Millions of Songs Streamed/Day 1000’s of New MP3’s/Day 20 Million Comments Posted Media Metrix February 2006 Audience Rankings Source comScore Media Metrix February - 2006 23,566 #2 MySpace Hotmail Google Ebay MSN Yahoo 9,632 #4 29,508 #1 14,695 #3 7,329 #5 6,812 #6 Pageviews in ‘000s Internet Rank
  • 7. MySpace.com Quick Facts Infrastructure and Application Footprint 3 Internet Data Centers Server Breakdown 2682 Web and 650 Database Servers 90 Cache Servers 16gb RAM 650 Dart servers 60 DB Servers 150 Media servers 3000 disks in SAN architecture Egress Management 17,000 mb/s bandwidth 15,000 mb/s on CDN
  • 8. MySpace.com Growing up in the Internet World
  • 9. 0 users The beginning Two tiered architecture Single Database Load balanced web servers Great for rapid development Less complexity means faster time to market and less operational costs Works for small to medium sized websites, not big ones 0 Users
  • 10. 500k Users A Single database is not enough Max out a single database Split reads and writes across separate databases Use transactional replication so multiple databases can service reads 500k Users
  • 11. 1 Million Vertical partitioning Transactional replication doesn’t work for all workloads and data types Use a combination of Vertical Partitioning and replication 1M Users
  • 12. 2 Million SAN Start to reconsider SCSI arrays for the long-term SCSI arrays have good performance but reliability issues SANS provide better performance, uptime, and redundancy Move to a clarion and enjoy better these benefits 2M Users
  • 13. 3 Million Horizontal partitioning Vertical Partitions see performance problems Decide we need to re-architect the database Horizontal partitioning is the answer but is difficult to do while in production 3M Users
  • 14. Horizontal Partitioning All features reside on a single database server Data is partitioned by user ID Some data cannot be partitioned especially on a social networking site 3M Users
  • 15. 5 Million Network bottlenecks Various areas of the network become saturated Gig uplinks are maxed out Switch to Autonomous network and BGP Get multiple gig links and 10G links Load balancer is maxed out “ Must load balance the load balancers” Use DNS 5M Users
  • 16. 7 Million Site dependencies Separating features on the front end isolates potential bottlenecks Using subdomains is easiest way 7M Users
  • 17. 10 Million Scalable storage Trying to partition storage on the backend is time consuming and inefficient Maxing out SANs is very costly We realize scalable storage is key 10M Users
  • 18. 15 Million DB’s versus Caching Databases still having perf issues Databases are expensive Have a lot of transactional overhead Caching tier High speed cache is perfect for reads LRU algorithm is self managing Drastically reduces database load
  • 19. MySpace Where we are today
  • 20. Upcoming Technology Enablers What’s Next for Microsoft.com and MySpace.com?
  • 21. SQL Server 2005 Product technology enablers Peer-To-Peer Replication System & Data Center Autonomy Zero “perceived” Application Downtime from Consumers Eliminates Single Point of Failure for R/W Databases Mirroring (SP1) Targeting Replacement of Log Shipping Fail-Over pairs 3 Systems in TAP Program (Technet, Learning & Genuine) Reduced Failover Downtime Log Shipping: 5-15min Avg Mirroring < 1min (planned) Table Partitioning Reduced Storage Costs Scale Up at Lower Costs
  • 22. MySpace Scaling SQL Server V1: Single Instance – < 1 Million Users Single SQL Server Instance Supports All Users and Features V2: Single Instance Replicating to Read Only Full Copies < 2 Million Users Single server handles all write transactions, read transactions spread across multiple transactional replication copies V3: Vertical Partitioning - < 4 Million Users Each Feature/Page of the site on its own SQL Server
  • 23. MySpace Scaling SQL Server V4: Horizontal Partitioning - < 8 Million Users All features/pages brought back to single database schema Standard schema across all databases User ranges partitioned across databases V5: Horizontally Partitioned Core with Replicated Content, Vertically Partitioned Features Databases, “Shared Content” Databases - > 8 Million Users Primary Myspace schema exists across large farm of servers Small amounts of content replicated to all horizontally partitioned servers to allow for features spanning all user ranges V6: Migration to SQL Server 2005 - >26 Million Users
  • 24. SQL Server 2005 64 bit Memory Pressure under 4GB 32 Limit Servers loaded with 32Gigs of RAM <4 Gig Addressable to the memory pools we were stressing Manifestations Connection Timeouts Servers going “dark”, requiring restart Rejected Connections Problem Eliminated on 64bit Arch Connection/Sort memory pools now able to address all 32Gigs of RAM
  • 25. Virtualizing Storage What is it? Software layer between your disks & hosts Advantages Provisioning is very simple, makes capacity planning more predictable Much better performance Can easily add more capacity to a LUN What do we use? 3par 14 week bake off
  • 26. Longhorn And IIS 7.0 Product technology enablers UNC Content Store Simplified Content Mgmt Reduced Disk Footprint File Replication (DC to DC) Latent/Long links improved 80X (10Mbps vs 850Mbps) Enabler of Geo-Hosting Options Centralized IIS Config’s Copy “Host-Host” capability Eliminate complex scripting of meta-base & config’s Dynamic Content Compression Further reduced Egress Improved Web Perf Delivery
  • 27. IIS 7.0 Failed Request Tracing
  • 28. Geo-Targeting Solutions Demographic management Objective – Enable Targeted Release of App’s and Content Avoid demographic support spikes and further align to marketing campaigns Microsoft Confidential. © 2006 Microsoft Corporation. All rights reserved. This presentation is for internal Microsoft use only. Sensitivity to Time/Frequency of customer online experiences Improve ability to reach last 30% of client population
  • 30. © 2006 Microsoft Corporation. All rights reserved. This presentation is for informational purposes only. Microsoft makes no warranties, express or implied, in this summary.
  • 31.