SlideShare a Scribd company logo
The experiences of migrating a large scale, high performance healthcare network Larry Williams Corporate Manager, Partners HealthCare
In the next half hour… Partners Healthcare System overview Caché platform architecture & metrics The need to migrate Phased migration approach Benchmark testing and results Discoveries and production enhancements Post database migration results Future milestones
Partners Healthcare System Founded in 1994  Brigham & Women’s Hospital  Massachusetts General Hospital Now includes: Community physician network (1200 + 3500 MD’s) PCHi 3 community hospitals 2 rehab hospitals 3 specialty institutions  Enterprise-wide Information Systems 1100 employees Annual budget FY05 approximately $160 million
Anchor Hospitals & Airport BWH MGH Logan Airport 10 km 6 km
Acute Care Hospitals MGH BWH Newton- Wellesley Community Physician Practices
Partners Domain Devices Internet 12,000 Printers 32,000 Desktops Firewall ~30,000 other devices 1,450 Servers Closely Managed Assumed Managed
Current Mixed-Mode  Production Architecture
Enterprise Integration Over 30% are to and from Caché database Change from prior year Daily  Average Est. Annual Transactions # of Interfaces 196 170 192 167 37% 4,659,035 1,330,962,017 2007 40% 3,399,211 1,240,712,044 2006 45% 2,431,917 887,649,802 2005 1,673,515 610,833,080 2004
Integration Components
Annual Database Growth Rate
Database Utilization Average Database References per day in Billions
The Need to Migrate - Availability Monthly Downtime Current  State Business  need
Additional Business Requirements Increase availability and reliability Decrease database risk from 5 single points of failure More robust hardware and OS Many less servers and OS instances to manage Clustering and automated failover  Reduce monthly maintenance needs, updates once or twice per year -------------------------------------------------------- Improve Performance  64 bit OS, more memory for cache Caché 5.0.20 to Caché 2008.1, significantly improved  ECP performance Increase Scalability 91 Terabytes available on EMC SAN DMX3 On-demand addition of processor cores
Caché Migration Decision Making Process Only considered first tier vendors and support  (IBM, HP) HP assumed much more risk with Professional  Services Existing HP business yields more leverage & visibility with regional office More headroom in HP configuration Price was not a distinguishing factor
Phased migration approach Proof of Concept (benchmark testing) Completed 10/15/07 Phase 1 – Database tier Completed 4/14/08 Phase 2 – Application tier Big Bang migration 12/14/08 (includes Cache 2008.1 UNIX platform upgrade) Phase 3 – Disaster Recovery Now implemented as part of Phase 2, leveraging HP Blade servers
UNIX DataBase Tier Benchmark Environment
UNIX Application Tier Benchmark Environment
Benchmark Load Testing Results Goals  Simulate current Production user counts & transaction loads Verify support for load increases up to 3x Benchmark Environment Isolated LAN, new DMX3 SAN 20 new Windows blade servers (10 app servers, 10 script ‘players’) Scripts for 8 apps (represent heaviest use, Web/Telnet/VB apps) 2 batch jobs (screensaver simulation, NullGen LMR functions) Conclusions Able to simulate production load, 1.5x and 3x load 2 HP rx8640 can handle growth projections 0.66 0.15 0.32 LMR avg Caché app time (in sec.) 40,000 40,000 11,806 LMR transactions (5 min. period) 135,000 30,000 35,000 Database Global Refs / sec. Benchmark full script load Benchmark “paced” script load Production peak (8/21, 11:20 am) Metric
Design and Configuration Considerations Database configuration simulation testing 1 to 5 Caché database instances were assessed 1 vs. 5 ECP channels per Caché instance were assessed Number of active cores were accessed (4 active, 2  reserved) Results and unexpected discoveries Identify 5 Caché database instance as optimal design configuration Maintain same data distribution across 5 DB instances Journal synch bottleneck the biggest issue  High Transaction Journal deamon maintains ECP durability to guarantee transaction (1 per Caché instance) Determine 1 ECP channel per instance optimal Additional channels did not improve throughput, still have only 1 Journal Deamon
Benchmark Discoveries Led to Production Improvements (the rules have changed) References to  Undefined  globals using $Data and $Get  These commands require network round trip Use of $increment  Each call to $I requires network round trip Excessive use of Cache locks  Forces more than 1 round trip Use of large strings  Strings that require more than 3900–4000 bytes to represent the string value are big strings and never cached on the ECP client.   Lesson Learned -  Each trip to the database server results in overhead caused by a Journal Synch.  Increasing the Journal Synch rate causes bottlenecks in the ECP channel which increase the risk of long transactions.
Post UNIX DB migration - 50% Reduction in Outlier Transaction
Monthly Average Caché Web Transaction Time
Unprecedented Growth - The LMR
Phased/Parallel Migration Approach
Parallel Migration Testing and Certification Effort DEV VC/m DEV QA PROD QA auto copy on promote promote check-out & check-in Developer test Analyst cert Production release promote auto copy on check-in Caché Environment 5.0 (WIN) 2008.1 (UNIX)
Application Models Old   New Browser  client Web server Cache Cache VB client .Net server Cache Cache .Net client Browser  client Web server Cache Web Services Browser  client .Net client Scalability/Connection pooling, robustness/error handling, Vism Managed Provider Vism.ocx (MServices) Managed Provider Cache Web Services WebLink
The experiences of migrating a large scale, high performance healthcare network Larry Williams Corporate Manager, Partners HealthCare

More Related Content

PPT
Optimiszing proxy
PPT
Server Load Balancing
PPT
Optimiszing proxy
PPTX
Web Server Load Balancing
PPT
clustering and load balancing
PPT
Optimizing proxy
PPT
Performance testing virtualized systems v5
PPT
Optimiszing proxy
Server Load Balancing
Optimiszing proxy
Web Server Load Balancing
clustering and load balancing
Optimizing proxy
Performance testing virtualized systems v5

What's hot (20)

PPT
ScalabilityAvailability
PPTX
Server load balancer ppt
PPTX
Load Balancing Server
PPT
saito_porcupine
PPT
Load Balancing
PPT
Performance and Scalability Tuning
PPTX
Load Balancing from the Cloud - Layer 7 Aware Solution
PPTX
top 15 easy performance optimizations for Veeam
PPTX
Training Webinar: Enterprise application performance with distributed caching
PPTX
Veeam Availability Console What's News at Veeam ON Thailand
PDF
Webinar Slides: Real-Time Replication vs. ETL - How Analytics Requires New Te...
PPT
Access Db to SQL Server Migration
PPTX
QA. Load Testing
PPTX
Load Balancing And Yahoo!
PPT
SFDCP – sfdc project
PPTX
Training Webinar: Detect Performance Bottlenecks of Applications
PDF
Better performances with HTTP/2
PDF
Clustering - october 2006
PPTX
Hyper V In Windows Server 2008 R2.Son Vu
PPTX
Optimizing performance
ScalabilityAvailability
Server load balancer ppt
Load Balancing Server
saito_porcupine
Load Balancing
Performance and Scalability Tuning
Load Balancing from the Cloud - Layer 7 Aware Solution
top 15 easy performance optimizations for Veeam
Training Webinar: Enterprise application performance with distributed caching
Veeam Availability Console What's News at Veeam ON Thailand
Webinar Slides: Real-Time Replication vs. ETL - How Analytics Requires New Te...
Access Db to SQL Server Migration
QA. Load Testing
Load Balancing And Yahoo!
SFDCP – sfdc project
Training Webinar: Detect Performance Bottlenecks of Applications
Better performances with HTTP/2
Clustering - october 2006
Hyper V In Windows Server 2008 R2.Son Vu
Optimizing performance
Ad

Viewers also liked (9)

PPT
Bloody Sunday Photo Album
ODP
Lost In The Clouds
PPT
Michelle's Wallpaper
PPT
Freshman Orientation
ODP
Out Of The Slipstream Proposal
PPT
Beyond The MVC
PPT
Template[1]
PPT
The Bridge
PPT
Bmw Pp
Bloody Sunday Photo Album
Lost In The Clouds
Michelle's Wallpaper
Freshman Orientation
Out Of The Slipstream Proposal
Beyond The MVC
Template[1]
The Bridge
Bmw Pp
Ad

Similar to The experiences of migrating a large scale, high performance healthcare network (20)

PPT
The experiences of migrating a large scale, high performance healthcare network
PPTX
Hpe service virtualization 3.8 what's new chicago adm
PPTX
Introduction To Cloud Computing
ODP
Real Time Network Monitoring System
PPTX
3 Hyper V
PPTX
Virtualising Tier 1 Apps
PPT
T3 Consortium's Performance Center of Excellence
PPT
Partners Enterprise Caché Unix Migration Public
PPT
Asynchronous Mobile Web Services:
PDF
Case Study: How Cisco Gained Visibility into Network Utilization and Proacti...
PPTX
Flexible and Real-Time Stream Processing with Apache Flink
PPT
Webinar: Deploying the Combined Virtual and Physical Infrastructure
PDF
Java Abs Dynamic Server Replication
PDF
VMworld 2013: How to Replace Websphere Application Server (WAS) with TCserver
PPTX
Si so product 1 day technical
PPTX
Scvmm Technical Overview.Son Vu
PPT
Sunx4450 Intel7460 GigaSpaces XAP Platform Benchmark
PPT
prezentációt
PDF
Architecting and Tuning IIB/eXtreme Scale for Maximum Performance and Reliabi...
PPT
Hp Connect 10 06 08 V5
The experiences of migrating a large scale, high performance healthcare network
Hpe service virtualization 3.8 what's new chicago adm
Introduction To Cloud Computing
Real Time Network Monitoring System
3 Hyper V
Virtualising Tier 1 Apps
T3 Consortium's Performance Center of Excellence
Partners Enterprise Caché Unix Migration Public
Asynchronous Mobile Web Services:
Case Study: How Cisco Gained Visibility into Network Utilization and Proacti...
Flexible and Real-Time Stream Processing with Apache Flink
Webinar: Deploying the Combined Virtual and Physical Infrastructure
Java Abs Dynamic Server Replication
VMworld 2013: How to Replace Websphere Application Server (WAS) with TCserver
Si so product 1 day technical
Scvmm Technical Overview.Son Vu
Sunx4450 Intel7460 GigaSpaces XAP Platform Benchmark
prezentációt
Architecting and Tuning IIB/eXtreme Scale for Maximum Performance and Reliabi...
Hp Connect 10 06 08 V5

More from george.james (20)

PPT
Fosdem 2010 GT.M and OpenStreetMap
PPT
M/DB and M/DB:X
PDF
Lost In The Clouds
PPT
On a cloudy day you can scale forever
PPT
Bad Light Stops Play
ODP
Securing The Cloud
PPT
Lightning In The Clouds
PPT
Mumps the Internet scale database
PPT
Web Development Environments: Choose the best or go with the rest
PPT
Web Servers: Architecture and Security
PPT
Google's BigTable
PPT
Report from DEVCON 2008
PPT
Amazon S3 and EC2
PDF
FIS-PIP™ – A high end database application development platform
PPT
Web Design and Programming
PPT
Querying the Web
PPT
Mission-critical Ajax: Making Test Ordering Easier and Faster at Quest Diagno...
PPT
Scalable Web Architectures and Infrastructure
PPT
Chris Munt Cherylin Fletcher State Of Play In Nhs
PPT
Rob Tweed :: Ajax and the Impact on Caché and Similar Technologies
Fosdem 2010 GT.M and OpenStreetMap
M/DB and M/DB:X
Lost In The Clouds
On a cloudy day you can scale forever
Bad Light Stops Play
Securing The Cloud
Lightning In The Clouds
Mumps the Internet scale database
Web Development Environments: Choose the best or go with the rest
Web Servers: Architecture and Security
Google's BigTable
Report from DEVCON 2008
Amazon S3 and EC2
FIS-PIP™ – A high end database application development platform
Web Design and Programming
Querying the Web
Mission-critical Ajax: Making Test Ordering Easier and Faster at Quest Diagno...
Scalable Web Architectures and Infrastructure
Chris Munt Cherylin Fletcher State Of Play In Nhs
Rob Tweed :: Ajax and the Impact on Caché and Similar Technologies

Recently uploaded (20)

PPT
genitourinary-cancers_1.ppt Nursing care of clients with GU cancer
PPT
MENTAL HEALTH - NOTES.ppt for nursing students
PPTX
1 General Principles of Radiotherapy.pptx
PPTX
Electromyography (EMG) in Physiotherapy: Principles, Procedure & Clinical App...
PPTX
Respiratory drugs, drugs acting on the respi system
PPT
Management of Acute Kidney Injury at LAUTECH
PPTX
Slider: TOC sampling methods for cleaning validation
DOCX
NEET PG 2025 | Pharmacology Recall: 20 High-Yield Questions Simplified
PPTX
Fundamentals of human energy transfer .pptx
PDF
Human Health And Disease hggyutgghg .pdf
PDF
Medical Evidence in the Criminal Justice Delivery System in.pdf
PPTX
Important Obstetric Emergency that must be recognised
PDF
Therapeutic Potential of Citrus Flavonoids in Metabolic Inflammation and Ins...
PDF
CT Anatomy for Radiotherapy.pdf eryuioooop
PPT
Breast Cancer management for medicsl student.ppt
PPT
Copy-Histopathology Practical by CMDA ESUTH CHAPTER(0) - Copy.ppt
PPT
Obstructive sleep apnea in orthodontics treatment
PPTX
surgery guide for USMLE step 2-part 1.pptx
PPTX
CME 2 Acute Chest Pain preentation for education
PDF
NEET PG 2025 | 200 High-Yield Recall Topics Across All Subjects
genitourinary-cancers_1.ppt Nursing care of clients with GU cancer
MENTAL HEALTH - NOTES.ppt for nursing students
1 General Principles of Radiotherapy.pptx
Electromyography (EMG) in Physiotherapy: Principles, Procedure & Clinical App...
Respiratory drugs, drugs acting on the respi system
Management of Acute Kidney Injury at LAUTECH
Slider: TOC sampling methods for cleaning validation
NEET PG 2025 | Pharmacology Recall: 20 High-Yield Questions Simplified
Fundamentals of human energy transfer .pptx
Human Health And Disease hggyutgghg .pdf
Medical Evidence in the Criminal Justice Delivery System in.pdf
Important Obstetric Emergency that must be recognised
Therapeutic Potential of Citrus Flavonoids in Metabolic Inflammation and Ins...
CT Anatomy for Radiotherapy.pdf eryuioooop
Breast Cancer management for medicsl student.ppt
Copy-Histopathology Practical by CMDA ESUTH CHAPTER(0) - Copy.ppt
Obstructive sleep apnea in orthodontics treatment
surgery guide for USMLE step 2-part 1.pptx
CME 2 Acute Chest Pain preentation for education
NEET PG 2025 | 200 High-Yield Recall Topics Across All Subjects

The experiences of migrating a large scale, high performance healthcare network

  • 1. The experiences of migrating a large scale, high performance healthcare network Larry Williams Corporate Manager, Partners HealthCare
  • 2. In the next half hour… Partners Healthcare System overview Caché platform architecture & metrics The need to migrate Phased migration approach Benchmark testing and results Discoveries and production enhancements Post database migration results Future milestones
  • 3. Partners Healthcare System Founded in 1994 Brigham & Women’s Hospital Massachusetts General Hospital Now includes: Community physician network (1200 + 3500 MD’s) PCHi 3 community hospitals 2 rehab hospitals 3 specialty institutions Enterprise-wide Information Systems 1100 employees Annual budget FY05 approximately $160 million
  • 4. Anchor Hospitals & Airport BWH MGH Logan Airport 10 km 6 km
  • 5. Acute Care Hospitals MGH BWH Newton- Wellesley Community Physician Practices
  • 6. Partners Domain Devices Internet 12,000 Printers 32,000 Desktops Firewall ~30,000 other devices 1,450 Servers Closely Managed Assumed Managed
  • 7. Current Mixed-Mode Production Architecture
  • 8. Enterprise Integration Over 30% are to and from Caché database Change from prior year Daily Average Est. Annual Transactions # of Interfaces 196 170 192 167 37% 4,659,035 1,330,962,017 2007 40% 3,399,211 1,240,712,044 2006 45% 2,431,917 887,649,802 2005 1,673,515 610,833,080 2004
  • 11. Database Utilization Average Database References per day in Billions
  • 12. The Need to Migrate - Availability Monthly Downtime Current State Business need
  • 13. Additional Business Requirements Increase availability and reliability Decrease database risk from 5 single points of failure More robust hardware and OS Many less servers and OS instances to manage Clustering and automated failover Reduce monthly maintenance needs, updates once or twice per year -------------------------------------------------------- Improve Performance 64 bit OS, more memory for cache Caché 5.0.20 to Caché 2008.1, significantly improved ECP performance Increase Scalability 91 Terabytes available on EMC SAN DMX3 On-demand addition of processor cores
  • 14. Caché Migration Decision Making Process Only considered first tier vendors and support (IBM, HP) HP assumed much more risk with Professional Services Existing HP business yields more leverage & visibility with regional office More headroom in HP configuration Price was not a distinguishing factor
  • 15. Phased migration approach Proof of Concept (benchmark testing) Completed 10/15/07 Phase 1 – Database tier Completed 4/14/08 Phase 2 – Application tier Big Bang migration 12/14/08 (includes Cache 2008.1 UNIX platform upgrade) Phase 3 – Disaster Recovery Now implemented as part of Phase 2, leveraging HP Blade servers
  • 16. UNIX DataBase Tier Benchmark Environment
  • 17. UNIX Application Tier Benchmark Environment
  • 18. Benchmark Load Testing Results Goals Simulate current Production user counts & transaction loads Verify support for load increases up to 3x Benchmark Environment Isolated LAN, new DMX3 SAN 20 new Windows blade servers (10 app servers, 10 script ‘players’) Scripts for 8 apps (represent heaviest use, Web/Telnet/VB apps) 2 batch jobs (screensaver simulation, NullGen LMR functions) Conclusions Able to simulate production load, 1.5x and 3x load 2 HP rx8640 can handle growth projections 0.66 0.15 0.32 LMR avg Caché app time (in sec.) 40,000 40,000 11,806 LMR transactions (5 min. period) 135,000 30,000 35,000 Database Global Refs / sec. Benchmark full script load Benchmark “paced” script load Production peak (8/21, 11:20 am) Metric
  • 19. Design and Configuration Considerations Database configuration simulation testing 1 to 5 Caché database instances were assessed 1 vs. 5 ECP channels per Caché instance were assessed Number of active cores were accessed (4 active, 2 reserved) Results and unexpected discoveries Identify 5 Caché database instance as optimal design configuration Maintain same data distribution across 5 DB instances Journal synch bottleneck the biggest issue High Transaction Journal deamon maintains ECP durability to guarantee transaction (1 per Caché instance) Determine 1 ECP channel per instance optimal Additional channels did not improve throughput, still have only 1 Journal Deamon
  • 20. Benchmark Discoveries Led to Production Improvements (the rules have changed) References to Undefined globals using $Data and $Get  These commands require network round trip Use of $increment Each call to $I requires network round trip Excessive use of Cache locks Forces more than 1 round trip Use of large strings Strings that require more than 3900–4000 bytes to represent the string value are big strings and never cached on the ECP client. Lesson Learned - Each trip to the database server results in overhead caused by a Journal Synch.  Increasing the Journal Synch rate causes bottlenecks in the ECP channel which increase the risk of long transactions.
  • 21. Post UNIX DB migration - 50% Reduction in Outlier Transaction
  • 22. Monthly Average Caché Web Transaction Time
  • 25. Parallel Migration Testing and Certification Effort DEV VC/m DEV QA PROD QA auto copy on promote promote check-out & check-in Developer test Analyst cert Production release promote auto copy on check-in Caché Environment 5.0 (WIN) 2008.1 (UNIX)
  • 26. Application Models Old New Browser client Web server Cache Cache VB client .Net server Cache Cache .Net client Browser client Web server Cache Web Services Browser client .Net client Scalability/Connection pooling, robustness/error handling, Vism Managed Provider Vism.ocx (MServices) Managed Provider Cache Web Services WebLink
  • 27. The experiences of migrating a large scale, high performance healthcare network Larry Williams Corporate Manager, Partners HealthCare