SlideShare a Scribd company logo
What To Do When It All Goes So WrongDavid LevyAdventuresInSql.comSQL Saturday #67 Chicago
More than 11 years in ITSQL Server DBA for over 3 yearsPrevious Life as DeveloperBloggerhttp://adventuresinsql.comSyndicated on SQLServerCentral.comSyndicated on SQLServerPedia.com@dave_levy on TwitterAbout Me
Peak Time of Peak Sales DayTypical Hourly Sales $100K/HROrder Entry Screen is Locked UpUsers report Slowness InitiallyNow the “Sales Center” Application is Just “Clocking”EMERGENCY!
Let Everyone Know There is a ProblemPrevent Duplicated EffortsAllows Others to Speak UpRecent ChangesRelated IssuesCommunicatehttp://www.freedigitalphotos.net/images/view_photog.php?photogid=1983
Send Up a FlareSend to an IT Only Distribution GroupKeep the Subject Line GeneralProvide Broad Overview Including:Systems ImpactedMajor Symptoms Including Error MessagesNumber of People ImpactedAny Location Specific InformationCommunicate
What Resources Do You need?Subject Matter ExpertsSpecialized EquipmentCommunicate
Never Assign BlameOnly State FactsCommunicate
To:		IT EmergenciesSubject:	Sales Center IssuesSales Center Users are reporting that the Order Entry screen has quit responding. We are currently investigating the issue with the Sales Center Development Team. We will provide updates as we know more.Communicate
What To Do When It All Goes So Wrong
What Are the Symptoms?What Locations are Involved?Collect
What Systems are Involved?SQL ServerAS400MainframeWeb FarmMajor Network Components like Load BalancersCollect
What Has Changed?Look at Change Control CalendarTalk to Primary On-Calls for Related SystemsCollect
Anything in the Logs?Windows LogsApplication Specific LogsCustom Exception Handling SystemsCollect
What are Performance Indicators Showing?PerfmonSQL Wait StatsThird-party toolsCollect
Analyze Collected InformationAre There Any Obvious Signs of Trouble?Can the Problem be Linked to a Change?Can Any Patterns be Identified?Process
Prove It Is Your IssueShows HumilityShows Respect for Everyone Else’s TimeAvoid Appearing ArrogantProcess
Prove It Is Your IssueConstruct Tests to Prove Theories in Order of Likelihood Until Problem Proven or Theories ExhaustedFaster than arguing about what it is notHow can you know it is not your issue?Process
List Potential ActionsRank by effort, confidence, level of riskDevelop action plans for best options and re-rankEach potential action should have a rollback planProcess
Define MeasuresWhat will indicate things have gotten better?Adding this index will reduce Disk IO by 10 million reads per secondThe execution time of query x will drop from 6 minutes to 50 millisecondsProcess
Define MeasuresWhat will indicate things have gotten worse?Disk IO may go upThe execution time of query x may go upAdding this index may slow inserts from the order upload processProcess
Communicate Your IntentionsMake the ChangeFollow a written planMake a single changeA single person should make the changeDocument any additional steps takenStart Over by Collecting More DataRespond
Signs You Need to Convene A War RoomHaving Trouble Finding Anything Wrong30 Minutes Without ProgressAn Issue Appears to Span Multiple SystemsHaving Difficulty Getting People EngagedThe War Room
Get Everyone in a RoomNo Changes Made Outside the RoomNo HeroesWatch out for people doing a lot of typingAvoid changes that take more than a few minutesHave a Call in Number for Remote CoworkersThe War Room
Have a Technology KitOld SwitchPatch CordsMice + Mouse PadsPower StripsThe War Room
Monitor Your Guest List1-2 Representatives From Each TeamTry to Keep Management OutWatch for Disruptive PeopleThe War Room
To:		IT EmergenciesSubject:	Sales Center IssuesWe are convening a war room for the Sales Center issue. Everyone working on the issue please meet in the North Conference Room. Remote/WFH coworkers should dial into the conference bridge 888-888-1234, participant code:1234.Communicate
What To Do When It All Goes So Wrong
White Board the IssueEvery System Gets Own ColumnWrite All Facts on White BoardClosed Items Get Crossed Out Not ErasedInclude a Resolution for Each Closed ItemThe War Room
Share the FloorLikely Issue Owner Has the LeadMake Sure Everyone is HeardContributing Often Involves Staying Out of the WayDon’t Be Afraid to Fade Back and Run The WhiteboardThe War Room
Never Call “Not-It” and LeaveNot HelpfulYou May be WrongAppears ArrogantThe War Room
Keep an Eye On TimeProvide Regular Updates to ManagementBring in Food Around Meal TimesRaises SpiritsBrings in More People to HelpThe War Room
To:		IT EmergenciesSubject:	Sales Center Issues UpdateThe Sales Center war room is still going. We are currently looking into a driver issue with IBM. All necessary resources have been engaged.Communicate
Keep People in ReserveEach Team Should Divide up the DayRotate People In and OutSend Someone Home Early to Come in EarlyThe War Room
Closing OutCommunicate ResolutionCapture Contents of WhiteboardClean Up RoomThe War Room
To:		IT EmergenciesSubject:	Sales Center Issues ResolvedThe Sales Center issue has been resolved. The issue was caused by a patch that was applied over the weekend. Now that it has been backed out everything has returned to normal.Communicate
?Questions?
What To Do When It All Goes So Wrong

More Related Content

PDF
How To Run a 5 Whys (With Humans, Not Robots)
PPTX
Reporting principles for every QA manager
PDF
Lecture 3
PDF
SDS-CAS
PPT
Iceland Volcano 2010
PPT
التعامل الآمن مع المواد الكيميائية Safe handling of chemicals
PPSX
انواع و علامات المواد الخطرة
PPT
DBA Best Practices.ppt
How To Run a 5 Whys (With Humans, Not Robots)
Reporting principles for every QA manager
Lecture 3
SDS-CAS
Iceland Volcano 2010
التعامل الآمن مع المواد الكيميائية Safe handling of chemicals
انواع و علامات المواد الخطرة
DBA Best Practices.ppt

Similar to What To Do When It All Goes So Wrong (20)

PPTX
How to Immediately Become a Better Closer
PPTX
The Lean Startup fbFund Edition
PPTX
The Lean Startup EA edition
PDF
UX Design Heuristics, aka "what makes an interaction good"?
PPT
Bally chohan support (Bally Chohan Bally ) | Bally chohan | Bally chohan support
KEY
Five whys summary
PPT
7 Secrets To A Successful Social Networking Launch
PPTX
2010 02 19 the lean startup - webstock 2010
PPT
Teleconference #2: Getting Down to Business - Selling and Daily Activity 0715
PDF
Pitch the way VCs think
PPTX
2010 10 28 the lean startup at ucsd
PPTX
The Lean Startup at Code for America fellows
PDF
Communication and Testing: Why You Have Been Wrong All Along!
PPT
Confurrent SWOmaha
PPTX
How to Build an Attribution Solution in 1 Day
PPTX
How to Build an Attribution Solution in 1 Day
PPT
Get Faster - While You're Getting Better
PPT
Am Fam Telecon2 Getting Down To Business Selling And Daily Activity 071509 ...
PPS
Off-Hours Critical Issue Escalation
PDF
Group Presentation on Bussiness Intelligence
How to Immediately Become a Better Closer
The Lean Startup fbFund Edition
The Lean Startup EA edition
UX Design Heuristics, aka "what makes an interaction good"?
Bally chohan support (Bally Chohan Bally ) | Bally chohan | Bally chohan support
Five whys summary
7 Secrets To A Successful Social Networking Launch
2010 02 19 the lean startup - webstock 2010
Teleconference #2: Getting Down to Business - Selling and Daily Activity 0715
Pitch the way VCs think
2010 10 28 the lean startup at ucsd
The Lean Startup at Code for America fellows
Communication and Testing: Why You Have Been Wrong All Along!
Confurrent SWOmaha
How to Build an Attribution Solution in 1 Day
How to Build an Attribution Solution in 1 Day
Get Faster - While You're Getting Better
Am Fam Telecon2 Getting Down To Business Selling And Daily Activity 071509 ...
Off-Hours Critical Issue Escalation
Group Presentation on Bussiness Intelligence
Ad

Recently uploaded (20)

PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Encapsulation theory and applications.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
KodekX | Application Modernization Development
PPTX
sap open course for s4hana steps from ECC to s4
PDF
cuic standard and advanced reporting.pdf
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
NewMind AI Weekly Chronicles - August'25 Week I
Unlocking AI with Model Context Protocol (MCP)
Spectral efficient network and resource selection model in 5G networks
Advanced methodologies resolving dimensionality complications for autism neur...
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Dropbox Q2 2025 Financial Results & Investor Presentation
Digital-Transformation-Roadmap-for-Companies.pptx
Encapsulation theory and applications.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
Mobile App Security Testing_ A Comprehensive Guide.pdf
KodekX | Application Modernization Development
sap open course for s4hana steps from ECC to s4
cuic standard and advanced reporting.pdf
Reach Out and Touch Someone: Haptics and Empathic Computing
Encapsulation_ Review paper, used for researhc scholars
Chapter 3 Spatial Domain Image Processing.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
NewMind AI Weekly Chronicles - August'25 Week I
Ad

What To Do When It All Goes So Wrong

  • 1. What To Do When It All Goes So WrongDavid LevyAdventuresInSql.comSQL Saturday #67 Chicago
  • 2. More than 11 years in ITSQL Server DBA for over 3 yearsPrevious Life as DeveloperBloggerhttp://adventuresinsql.comSyndicated on SQLServerCentral.comSyndicated on SQLServerPedia.com@dave_levy on TwitterAbout Me
  • 3. Peak Time of Peak Sales DayTypical Hourly Sales $100K/HROrder Entry Screen is Locked UpUsers report Slowness InitiallyNow the “Sales Center” Application is Just “Clocking”EMERGENCY!
  • 4. Let Everyone Know There is a ProblemPrevent Duplicated EffortsAllows Others to Speak UpRecent ChangesRelated IssuesCommunicatehttp://www.freedigitalphotos.net/images/view_photog.php?photogid=1983
  • 5. Send Up a FlareSend to an IT Only Distribution GroupKeep the Subject Line GeneralProvide Broad Overview Including:Systems ImpactedMajor Symptoms Including Error MessagesNumber of People ImpactedAny Location Specific InformationCommunicate
  • 6. What Resources Do You need?Subject Matter ExpertsSpecialized EquipmentCommunicate
  • 7. Never Assign BlameOnly State FactsCommunicate
  • 8. To: IT EmergenciesSubject: Sales Center IssuesSales Center Users are reporting that the Order Entry screen has quit responding. We are currently investigating the issue with the Sales Center Development Team. We will provide updates as we know more.Communicate
  • 10. What Are the Symptoms?What Locations are Involved?Collect
  • 11. What Systems are Involved?SQL ServerAS400MainframeWeb FarmMajor Network Components like Load BalancersCollect
  • 12. What Has Changed?Look at Change Control CalendarTalk to Primary On-Calls for Related SystemsCollect
  • 13. Anything in the Logs?Windows LogsApplication Specific LogsCustom Exception Handling SystemsCollect
  • 14. What are Performance Indicators Showing?PerfmonSQL Wait StatsThird-party toolsCollect
  • 15. Analyze Collected InformationAre There Any Obvious Signs of Trouble?Can the Problem be Linked to a Change?Can Any Patterns be Identified?Process
  • 16. Prove It Is Your IssueShows HumilityShows Respect for Everyone Else’s TimeAvoid Appearing ArrogantProcess
  • 17. Prove It Is Your IssueConstruct Tests to Prove Theories in Order of Likelihood Until Problem Proven or Theories ExhaustedFaster than arguing about what it is notHow can you know it is not your issue?Process
  • 18. List Potential ActionsRank by effort, confidence, level of riskDevelop action plans for best options and re-rankEach potential action should have a rollback planProcess
  • 19. Define MeasuresWhat will indicate things have gotten better?Adding this index will reduce Disk IO by 10 million reads per secondThe execution time of query x will drop from 6 minutes to 50 millisecondsProcess
  • 20. Define MeasuresWhat will indicate things have gotten worse?Disk IO may go upThe execution time of query x may go upAdding this index may slow inserts from the order upload processProcess
  • 21. Communicate Your IntentionsMake the ChangeFollow a written planMake a single changeA single person should make the changeDocument any additional steps takenStart Over by Collecting More DataRespond
  • 22. Signs You Need to Convene A War RoomHaving Trouble Finding Anything Wrong30 Minutes Without ProgressAn Issue Appears to Span Multiple SystemsHaving Difficulty Getting People EngagedThe War Room
  • 23. Get Everyone in a RoomNo Changes Made Outside the RoomNo HeroesWatch out for people doing a lot of typingAvoid changes that take more than a few minutesHave a Call in Number for Remote CoworkersThe War Room
  • 24. Have a Technology KitOld SwitchPatch CordsMice + Mouse PadsPower StripsThe War Room
  • 25. Monitor Your Guest List1-2 Representatives From Each TeamTry to Keep Management OutWatch for Disruptive PeopleThe War Room
  • 26. To: IT EmergenciesSubject: Sales Center IssuesWe are convening a war room for the Sales Center issue. Everyone working on the issue please meet in the North Conference Room. Remote/WFH coworkers should dial into the conference bridge 888-888-1234, participant code:1234.Communicate
  • 28. White Board the IssueEvery System Gets Own ColumnWrite All Facts on White BoardClosed Items Get Crossed Out Not ErasedInclude a Resolution for Each Closed ItemThe War Room
  • 29. Share the FloorLikely Issue Owner Has the LeadMake Sure Everyone is HeardContributing Often Involves Staying Out of the WayDon’t Be Afraid to Fade Back and Run The WhiteboardThe War Room
  • 30. Never Call “Not-It” and LeaveNot HelpfulYou May be WrongAppears ArrogantThe War Room
  • 31. Keep an Eye On TimeProvide Regular Updates to ManagementBring in Food Around Meal TimesRaises SpiritsBrings in More People to HelpThe War Room
  • 32. To: IT EmergenciesSubject: Sales Center Issues UpdateThe Sales Center war room is still going. We are currently looking into a driver issue with IBM. All necessary resources have been engaged.Communicate
  • 33. Keep People in ReserveEach Team Should Divide up the DayRotate People In and OutSend Someone Home Early to Come in EarlyThe War Room
  • 34. Closing OutCommunicate ResolutionCapture Contents of WhiteboardClean Up RoomThe War Room
  • 35. To: IT EmergenciesSubject: Sales Center Issues ResolvedThe Sales Center issue has been resolved. The issue was caused by a patch that was applied over the weekend. Now that it has been backed out everything has returned to normal.Communicate