SlideShare a Scribd company logo
Swarming: How a new approach to support can save
DevOps teams from 3rd-line ticket hell
Jon Hall
Principal Product Manager, BMC
@jonhall_
DevOps Summit Amsterdam 2018
The people that excel and create the most value
are the ones that step outside their box
Hank Barnes, Gartner: “Playing Outside Your Box” May 2018.
LEVEL 2 SUPPORT LEVEL 2 SUPPORTLEVEL 2 SUPPORT
LEVEL 3 SPECIALISTS LEVEL 3 SPECIALISTS LEVEL 3 SPECIALISTS LEVEL 3 SPECIALISTS
LEVEL 1 SUPPORT
Classic “Tiered” Support Structure
@jonhall_
Escalation
Escalation
Deconstructing the “Tiered” Support Structure
LEVEL 2 SUPPORT LEVEL 2 SUPPORTLEVEL 2 SUPPORT
LEVEL 1 SUPPORT
LEVEL 3 SPECIALISTS LEVEL 3 SPECIALISTS LEVEL 3 SPECIALISTS LEVEL 3 SPECIALISTS
@jonhall_
…when the answer is here… …or here.
Issues may spend time here
LEVEL 2 SUPPORT LEVEL 2 SUPPORTLEVEL 2 SUPPORT
LEVEL 1 SUPPORT
LEVEL 3 SPECIALISTS LEVEL 3 SPECIALISTS LEVEL 3 SPECIALISTS LEVEL 3 SPECIALISTS
@jonhall_
LEVEL 1 SUPPORT
LEVEL 2 SUPPORT
LEVEL 3 SPECIALISTS
When tickets
eventually
escalate…
…they frequently
bounce back for
clarification
@jonhall_
LEVEL 1 SUPPORT
LEVEL 2 SUPPORT
LEVEL 3 SPECIALISTS
LEVEL 1 SUPPORT
LEVEL 3 SPECIALISTS
LEVEL 2 SUPPORT
SUBJECT MATTER EXPERT
The system encourages “heroes” (not in a good way)
@jonhall_
Removing the tiers of support, and calling on the
collective expertise of a “swarm” of analysts.
Swarming defined
@jonhall_
Swarming
Network
Collaborative
Dynamic, loopy
Measured by value creation
Tiered support
Siloes and hierarchies
Directed
Linear, rigid
Measured on activity
24 hours, 365 days.
500 specialists, 3000 years of experience
200,000+ incidents addressed each year
10,000+ customers
Remedy, Control-M, TrueSight, etc…
Communication skills are a hiring focus
BMC Contact Centres
Support Centres
Support Centres Co-located with R&D
Pleasanton/
Sunnyvale
Houston
Austin
McLean/
Herndon
Lexington
Sao Paulo
Buenos Aires
Spain
Dublin
Winnersh
Amsterdam
Paris
Tel Hai
Pune
Singapore
Shanghai
Beijing
Seoul
Dalian
Tokyo
Melbourne
Houston, TX, USA
Dublin, Ireland
Dalian China
BMC Customer Support
@jonhall_
Swarming at BMC
Dispatch SwarmSeverity 1 Swarm Backlog Swarm
@jonhall_
Severity 1 Swarm
Prioritise
Swarming Process at BMC
@jonhall_
• Rapid responders
• Three agents, one week rotation
• Primary focus: Provide immediate response, resolve ASAP
Swarm lead
Communications
Other members
Research, coordinate, test
Severity 1 Swarm
@jonhall_
Severity 1 Swarm
Local Dispatch Swarm
Prioritise
30% solved here
Swarming Process at BMC
@jonhall_
• “Cherry pickers”
• Meet every 60-90 minutes
• Primary focus: Can new tickets be resolved immediately?
• Also: Validation of ticket details before assignment to specialists
Experienced analyst Less-experienced analyst
Dispatch Swarm
@jonhall_
Local Product-Line
Support Teams
Severity 1 Swarm
Local Dispatch Swarm
Prioritise
Swarming Process at BMC
@jonhall_
Local Product Line
Support Teams
Severity 1
Swarm
Local Dispatch Swarm
Prioritise
Severity 1
Swarm
Local Dispatch Swarm
Prioritise
Local Product Line
Support Teams
Swarming Process at BMC
@jonhall_
Local Product Line Support Teams Local Product Line Support Teams
Backlog Swarm Backlog Swarm Backlog Swarm
Swarming Process at BMC
@jonhall_
• Global fixers of troublesome tickets
• Meet regularly (often multiple times daily)
• Primary focus: Challenging tickets brought by local support teams
• Replaces inter-team and individual reassignments
Experienced analysts R&D Engineers
Backlog Swarms
@jonhall_
• Guidelines, not rules
• Metrics had to change (Swarming breaks traditional ones!)
• Supported people who became newly customer facing
• Banned ticket tennis and direct escalations to experts
• New tooling practices, particularly mobile and chat
Making it work at BMC Customer Support
@jonhall_
• 25% median resolution time improvement
• Customer satisfaction up 8 points
• More issues closed in <2 days
• Significant reduction in backlogs
• Halved on-boarding time
• Freed resources for innovative offerings
Results at BMC
@jonhall_
“Swarming works better than conventional processes.
I am able to get multiple experiences from swarm attendees
of similar cases they have worked, and what they did.
If there are no experiences, then it’s perspectives: Decades
of experience, providing guidance of how to troubleshoot”
- Senior Support Analyst, BMC
@jonhall_
“I have probably doubled my knowledge of the
products in a year because of Swarming,
and I have been here a long time”
- Senior Support Analyst, BMC
@jonhall_
Ford Connected Vehicles Division
Challenge: how to scale support from 275,000 cars, to
180+ million new vehicles every year.
• “You’ve got to go where people are” – Chad Jolly, Developer
• Tiered support would mean 4-5 days to get to the right team
• First Responders instigate and coordinate ad-hoc swarms for big issues
• Other teams have 1 person on rotation for swarming
• Swarm may get bigger over time as necessary and might include
engineers from Amazon, Microsoft, etc.
@jonhall_
• Costs may increase even as other metrics improve
• Difficult to evaluate individual contribution
• Organizing across time zones may be a challenge
• A few individuals sometimes dominate
• Finding the right people for a swarm is difficult
It’s not all positive!
Problems reported by some Swarming adopters
@jonhall_
“IT organizations that have tried to custom-adjust
current tools to meet DevOps practices have a
failure rate of 80%”
DevOps and the Cost of Downtime: Fortune 1000 Best Practice Metrics Quantified (IDC, 2014)
So… what does this all have to do with DevOps?
@jonhall_
• New services and applications suddenly appear
• More home-grown software
• Developers work in different tools
• New kinds of customer, especially external
DevOps challenges ServiceDesk orthodoxies…
@jonhall_
• Provision of support at industrial scale
• Adaptation to life "on call”
• Multi-cloud; Blend of old and new systems
• Customer/business context
• What to prioritize? Fix or build?
…but enterprise realities challenge DevOps
DevOps challenges ServiceDesk orthodoxies…
@jonhall_
“The enterprise space doesn’t move slowly
because they’re stupid, or they hate technology.
It’s because they have users”
—Luke Kanies, Puppet Founder, Configuration Management Camp 2015, Belgium.
@jonhall_
• Work-in-progress queues
• Asynchronous communication
• Single role teams
• Individual over-exposure
• Lack of knowledge sharing
How to annoy a DevOps practitioner
@jonhall_
LEVEL 2 SUPPORT LEVEL 2 SUPPORTLEVEL 2 SUPPORT
LEVEL 3 SPECIALISTS LEVEL 3 SPECIALISTS LEVEL 3 SPECIALISTS LEVEL 3 SPECIALISTS
LEVEL 1 SUPPORT
Uh oh…
@jonhall_
Swarming aligns really well to DevOps
• Autonomy and self-organisation
• Knowledge transfer and skills development
• ChatOps, not email
• Prevention of accumulation of queued work
• Protection of individuals from burnout
@jonhall_
• Cynefin (Pronounced “kuh-nev-in”)
• Developed by Dave Snowden at IBM in 1999
• Taken independent in 2005
• “Signifies the multiple factors in our environment and
our experience that influence us in ways we can
never understand”
Exploring further: Swarming to deliver Cynefin
@jonhall_
@jonhall_
• Obvious and Complicated domains:
• Repeating relationship between cause and effect
• With Complicated you need to do analysis to find
that relationship
• Complex domain:
• Understanding the problem requires
experimentation and analysis.
• May, over time, be able to move to Complicated
• Chaotic domain:
• Dramatic and unconstrained
• Focus on damage limitation, try to move to
another domain
“Obvious” Domain
@jonhall_
• “Sense, Categorise, Respond”
• Template/knowledge-driven resolution
• Self service
“Complicated” Domain
@jonhall_
• “Sense, Analyse, Respond”
• Dispatch-type swarm – pair agents with varied experience
• Capture detailed knowledge for organizational learning
“Complex” Domain
• “Probe, Sense, Respond”
@jonhall_
“Complex” Domain
• “Probe, Sense, Respond”
@jonhall_
“Chaotic” Domain
• “Act, Sense, Respond”
• Sub-swarms
• Deal with the acute situation
• Try to discover sufficient
information to move to complex
@jonhall_
• Service Management needs to evolve its practices and
tooling to better position its value to DevOps teams. We
need your help to do this right.
• We’d like to listen to how support is affecting your role,
as your impact grows in your enterprise.
• You are agents of change in enterprises, with a good
opportunity to influence thinking.
What next?
@jonhall_
medium.com/@jonhall_serviceinnovation.org/intelligent-swarming
Some more information

More Related Content

PPTX
SITS15: Swarming - A radical new way to deliver service
PPTX
Velocity19 Berlin: Swarming, Cynefin… and avoiding the problems of becoming a...
PPTX
ITSM, Swarming and Devops
PPTX
Service Manager Dag, Netherlands 2018: Why we should ditch the 3-tier support...
PPTX
SDI19: Swarming and Devops for ITSM
PDF
Turning Up the Magic in PI Planning
PPTX
Atlassian Community March 2023
PDF
Stayin' Alive! Feature Disco Your Way to PI Planning
SITS15: Swarming - A radical new way to deliver service
Velocity19 Berlin: Swarming, Cynefin… and avoiding the problems of becoming a...
ITSM, Swarming and Devops
Service Manager Dag, Netherlands 2018: Why we should ditch the 3-tier support...
SDI19: Swarming and Devops for ITSM
Turning Up the Magic in PI Planning
Atlassian Community March 2023
Stayin' Alive! Feature Disco Your Way to PI Planning

What's hot (20)

PPTX
Anatomy of a data driven architecture - Tamir Dresher
PPTX
Backlog Refinement 101 & 202
PPTX
Introduction to SAFe, the Scaled Agile Framework
PPTX
Scrumban - Projektentwicklung mit Scrum und Incident-Management ĂĽber Kanban m...
PDF
Entendendo o Kanban Maturity Model
PPSX
Apache Flink, AWS Kinesis, Analytics
PDF
Scrum guide presentation (Scrum Guide in easy to read PPT format)
PDF
Agile transformation Explained: Agile 2017 Session
PDF
Modern Professional Scrum using Flow and Kanban - Agile and Beyond Detroit 2019
PDF
Scrum Prioritization Techniques PowerPoint Presentation Slides
PPTX
Achieving Elite and High Performance DevOps Using DORA Metrics
PDF
Everything You wanted to Know About Distributed Tracing
PDF
Event storming recipes
PPTX
Understanding Scrum
PDF
Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)
PDF
Agile - Community of Practice
PDF
A New Introduction to Jira & Agile Product Management
PPTX
Modern Data Stack for Game Analytics / Dmitry Anoshin (Microsoft Gaming, The ...
PPTX
IBM Industry Models and Data Lake
PDF
Scrum and Kanban Sitting In A Tree...
Anatomy of a data driven architecture - Tamir Dresher
Backlog Refinement 101 & 202
Introduction to SAFe, the Scaled Agile Framework
Scrumban - Projektentwicklung mit Scrum und Incident-Management ĂĽber Kanban m...
Entendendo o Kanban Maturity Model
Apache Flink, AWS Kinesis, Analytics
Scrum guide presentation (Scrum Guide in easy to read PPT format)
Agile transformation Explained: Agile 2017 Session
Modern Professional Scrum using Flow and Kanban - Agile and Beyond Detroit 2019
Scrum Prioritization Techniques PowerPoint Presentation Slides
Achieving Elite and High Performance DevOps Using DORA Metrics
Everything You wanted to Know About Distributed Tracing
Event storming recipes
Understanding Scrum
Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)
Agile - Community of Practice
A New Introduction to Jira & Agile Product Management
Modern Data Stack for Game Analytics / Dmitry Anoshin (Microsoft Gaming, The ...
IBM Industry Models and Data Lake
Scrum and Kanban Sitting In A Tree...
Ad

Similar to Swarming: How a new approach to support can save DevOps teams from 3rd-line ticket hell (20)

PPTX
DevOps Enterprise Summit Las Vegas 2018: The Problem of Becoming a 3rd-Line S...
PPTX
Support at scale in a DevOps world How Swarming and Cynefin can save you from...
PPTX
DevOps Enterprise Summit 2019 - How Swarming Enables Enterprise Support to wo...
PPTX
DevOpsDays Riga - Swarming Presentation
PPTX
DevOpsDaysRiga 2018: Jon Hall - DevOps in the enterprise: how "swarming" can ...
PPTX
Devops In The Enterprise: How Swarming Can Fix The Problem Of Becoming A 3rd-...
PPTX
Configuration Management Camp 2018: The problem of becoming "3rd line support...
PPTX
devopsdays Stockholm Ignite talk: Aligning DevOps with Enterprise-scale custo...
PPTX
Devopsdays Edinburgh 2017 - Ignite talk - Swarming
PPTX
SRVision 2019, Utrecht: Swarming and Cynefin
PPTX
Lucas Gravley - HP - Self-Healing And Monitoring in a DevOps world
PDF
Operations as a Service: Because Failure Still Happens
PDF
Self-Service Operations: Because Ops Still Happens
PPTX
Is DevOps Really Changing IT Support?
PDF
Strategies for building, managing, and scaling technology teams
PDF
Dev ops lessons learned - Michael Collins
PDF
Go, Swarm and DevOps vs The Mighty Monolith
PDF
SaltConf14 - Justin Carmony, Deseret Digital Media - Teaching Devs About DevOps
PDF
Self-Service Operations: Because Failure Still Happens (Developer Edition)
PPTX
Swarm: Beyond Pair, Beyond Scrum
DevOps Enterprise Summit Las Vegas 2018: The Problem of Becoming a 3rd-Line S...
Support at scale in a DevOps world How Swarming and Cynefin can save you from...
DevOps Enterprise Summit 2019 - How Swarming Enables Enterprise Support to wo...
DevOpsDays Riga - Swarming Presentation
DevOpsDaysRiga 2018: Jon Hall - DevOps in the enterprise: how "swarming" can ...
Devops In The Enterprise: How Swarming Can Fix The Problem Of Becoming A 3rd-...
Configuration Management Camp 2018: The problem of becoming "3rd line support...
devopsdays Stockholm Ignite talk: Aligning DevOps with Enterprise-scale custo...
Devopsdays Edinburgh 2017 - Ignite talk - Swarming
SRVision 2019, Utrecht: Swarming and Cynefin
Lucas Gravley - HP - Self-Healing And Monitoring in a DevOps world
Operations as a Service: Because Failure Still Happens
Self-Service Operations: Because Ops Still Happens
Is DevOps Really Changing IT Support?
Strategies for building, managing, and scaling technology teams
Dev ops lessons learned - Michael Collins
Go, Swarm and DevOps vs The Mighty Monolith
SaltConf14 - Justin Carmony, Deseret Digital Media - Teaching Devs About DevOps
Self-Service Operations: Because Failure Still Happens (Developer Edition)
Swarm: Beyond Pair, Beyond Scrum
Ad

More from Jon Stevens-Hall (15)

PPTX
Expanding our Understanding: Complex Adaptive Systems
PPTX
Site Reliability Engineering: Harnessing (and redefining) it for ITSM
PPTX
Rethinking Site Reliability Engineering for ITSM - SDI virtual event "New Way...
PPTX
BMC Engage 2015: Optimizing Service Desk Interactions with Knowledge Management
PPTX
Knowledge Management in BMC Remedy 9.1
PPTX
How the Internet of Things and 20 billion devices will change your job
PPTX
IAITAM ACE 2016, New Orleans - Presentation
PPTX
Evolving Service for the Digital Workplace
PPTX
Optimizing Service Desk Interactions with Knowledge Management - BMC Engage 2015
PPTX
BMC Engage 2015: IT Asset Management - An essential pillar for the digital en...
PPTX
BMC Engage 2015: Smart IT, MyIT and the Power of the Service Platform
PPTX
IT Trends Set to Shape Software Asset Management (IBSMA SAM Summit June 2015)
PPTX
Bridging the Gap - The Value of Integrated Asset and Service Management
PPTX
BMC Engage - ITAM 2015-2020: The Evolving Role of the IT Asset Manager
PPTX
Bridging the Gap - the Value of Integrated Asset and Service Management
Expanding our Understanding: Complex Adaptive Systems
Site Reliability Engineering: Harnessing (and redefining) it for ITSM
Rethinking Site Reliability Engineering for ITSM - SDI virtual event "New Way...
BMC Engage 2015: Optimizing Service Desk Interactions with Knowledge Management
Knowledge Management in BMC Remedy 9.1
How the Internet of Things and 20 billion devices will change your job
IAITAM ACE 2016, New Orleans - Presentation
Evolving Service for the Digital Workplace
Optimizing Service Desk Interactions with Knowledge Management - BMC Engage 2015
BMC Engage 2015: IT Asset Management - An essential pillar for the digital en...
BMC Engage 2015: Smart IT, MyIT and the Power of the Service Platform
IT Trends Set to Shape Software Asset Management (IBSMA SAM Summit June 2015)
Bridging the Gap - The Value of Integrated Asset and Service Management
BMC Engage - ITAM 2015-2020: The Evolving Role of the IT Asset Manager
Bridging the Gap - the Value of Integrated Asset and Service Management

Recently uploaded (20)

PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
GamePlan Trading System Review: Professional Trader's Honest Take
PDF
solutions_manual_-_materials___processing_in_manufacturing__demargo_.pdf
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPTX
Big Data Technologies - Introduction.pptx
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
 
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
KodekX | Application Modernization Development
 
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
 
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
 
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Machine learning based COVID-19 study performance prediction
PDF
Approach and Philosophy of On baking technology
PDF
cuic standard and advanced reporting.pdf
PDF
Empathic Computing: Creating Shared Understanding
Dropbox Q2 2025 Financial Results & Investor Presentation
GamePlan Trading System Review: Professional Trader's Honest Take
solutions_manual_-_materials___processing_in_manufacturing__demargo_.pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Understanding_Digital_Forensics_Presentation.pptx
Big Data Technologies - Introduction.pptx
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
 
Review of recent advances in non-invasive hemoglobin estimation
KodekX | Application Modernization Development
 
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Network Security Unit 5.pdf for BCA BBA.
CIFDAQ's Market Insight: SEC Turns Pro Crypto
 
The Rise and Fall of 3GPP – Time for a Sabbatical?
 
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
20250228 LYD VKU AI Blended-Learning.pptx
Machine learning based COVID-19 study performance prediction
Approach and Philosophy of On baking technology
cuic standard and advanced reporting.pdf
Empathic Computing: Creating Shared Understanding

Swarming: How a new approach to support can save DevOps teams from 3rd-line ticket hell

  • 1. Swarming: How a new approach to support can save DevOps teams from 3rd-line ticket hell Jon Hall Principal Product Manager, BMC @jonhall_ DevOps Summit Amsterdam 2018
  • 2. The people that excel and create the most value are the ones that step outside their box Hank Barnes, Gartner: “Playing Outside Your Box” May 2018.
  • 3. LEVEL 2 SUPPORT LEVEL 2 SUPPORTLEVEL 2 SUPPORT LEVEL 3 SPECIALISTS LEVEL 3 SPECIALISTS LEVEL 3 SPECIALISTS LEVEL 3 SPECIALISTS LEVEL 1 SUPPORT Classic “Tiered” Support Structure @jonhall_
  • 4. Escalation Escalation Deconstructing the “Tiered” Support Structure LEVEL 2 SUPPORT LEVEL 2 SUPPORTLEVEL 2 SUPPORT LEVEL 1 SUPPORT LEVEL 3 SPECIALISTS LEVEL 3 SPECIALISTS LEVEL 3 SPECIALISTS LEVEL 3 SPECIALISTS @jonhall_
  • 5. …when the answer is here… …or here. Issues may spend time here LEVEL 2 SUPPORT LEVEL 2 SUPPORTLEVEL 2 SUPPORT LEVEL 1 SUPPORT LEVEL 3 SPECIALISTS LEVEL 3 SPECIALISTS LEVEL 3 SPECIALISTS LEVEL 3 SPECIALISTS @jonhall_
  • 6. LEVEL 1 SUPPORT LEVEL 2 SUPPORT LEVEL 3 SPECIALISTS When tickets eventually escalate… …they frequently bounce back for clarification @jonhall_
  • 7. LEVEL 1 SUPPORT LEVEL 2 SUPPORT LEVEL 3 SPECIALISTS LEVEL 1 SUPPORT LEVEL 3 SPECIALISTS LEVEL 2 SUPPORT SUBJECT MATTER EXPERT The system encourages “heroes” (not in a good way) @jonhall_
  • 8. Removing the tiers of support, and calling on the collective expertise of a “swarm” of analysts. Swarming defined @jonhall_ Swarming Network Collaborative Dynamic, loopy Measured by value creation Tiered support Siloes and hierarchies Directed Linear, rigid Measured on activity
  • 9. 24 hours, 365 days. 500 specialists, 3000 years of experience 200,000+ incidents addressed each year 10,000+ customers Remedy, Control-M, TrueSight, etc… Communication skills are a hiring focus BMC Contact Centres Support Centres Support Centres Co-located with R&D Pleasanton/ Sunnyvale Houston Austin McLean/ Herndon Lexington Sao Paulo Buenos Aires Spain Dublin Winnersh Amsterdam Paris Tel Hai Pune Singapore Shanghai Beijing Seoul Dalian Tokyo Melbourne Houston, TX, USA Dublin, Ireland Dalian China BMC Customer Support @jonhall_
  • 10. Swarming at BMC Dispatch SwarmSeverity 1 Swarm Backlog Swarm @jonhall_
  • 11. Severity 1 Swarm Prioritise Swarming Process at BMC @jonhall_
  • 12. • Rapid responders • Three agents, one week rotation • Primary focus: Provide immediate response, resolve ASAP Swarm lead Communications Other members Research, coordinate, test Severity 1 Swarm @jonhall_
  • 13. Severity 1 Swarm Local Dispatch Swarm Prioritise 30% solved here Swarming Process at BMC @jonhall_
  • 14. • “Cherry pickers” • Meet every 60-90 minutes • Primary focus: Can new tickets be resolved immediately? • Also: Validation of ticket details before assignment to specialists Experienced analyst Less-experienced analyst Dispatch Swarm @jonhall_
  • 15. Local Product-Line Support Teams Severity 1 Swarm Local Dispatch Swarm Prioritise Swarming Process at BMC @jonhall_
  • 16. Local Product Line Support Teams Severity 1 Swarm Local Dispatch Swarm Prioritise Severity 1 Swarm Local Dispatch Swarm Prioritise Local Product Line Support Teams Swarming Process at BMC @jonhall_
  • 17. Local Product Line Support Teams Local Product Line Support Teams Backlog Swarm Backlog Swarm Backlog Swarm Swarming Process at BMC @jonhall_
  • 18. • Global fixers of troublesome tickets • Meet regularly (often multiple times daily) • Primary focus: Challenging tickets brought by local support teams • Replaces inter-team and individual reassignments Experienced analysts R&D Engineers Backlog Swarms @jonhall_
  • 19. • Guidelines, not rules • Metrics had to change (Swarming breaks traditional ones!) • Supported people who became newly customer facing • Banned ticket tennis and direct escalations to experts • New tooling practices, particularly mobile and chat Making it work at BMC Customer Support @jonhall_
  • 20. • 25% median resolution time improvement • Customer satisfaction up 8 points • More issues closed in <2 days • Significant reduction in backlogs • Halved on-boarding time • Freed resources for innovative offerings Results at BMC @jonhall_
  • 21. “Swarming works better than conventional processes. I am able to get multiple experiences from swarm attendees of similar cases they have worked, and what they did. If there are no experiences, then it’s perspectives: Decades of experience, providing guidance of how to troubleshoot” - Senior Support Analyst, BMC @jonhall_
  • 22. “I have probably doubled my knowledge of the products in a year because of Swarming, and I have been here a long time” - Senior Support Analyst, BMC @jonhall_
  • 23. Ford Connected Vehicles Division Challenge: how to scale support from 275,000 cars, to 180+ million new vehicles every year. • “You’ve got to go where people are” – Chad Jolly, Developer • Tiered support would mean 4-5 days to get to the right team • First Responders instigate and coordinate ad-hoc swarms for big issues • Other teams have 1 person on rotation for swarming • Swarm may get bigger over time as necessary and might include engineers from Amazon, Microsoft, etc. @jonhall_
  • 24. • Costs may increase even as other metrics improve • Difficult to evaluate individual contribution • Organizing across time zones may be a challenge • A few individuals sometimes dominate • Finding the right people for a swarm is difficult It’s not all positive! Problems reported by some Swarming adopters @jonhall_
  • 25. “IT organizations that have tried to custom-adjust current tools to meet DevOps practices have a failure rate of 80%” DevOps and the Cost of Downtime: Fortune 1000 Best Practice Metrics Quantified (IDC, 2014) So… what does this all have to do with DevOps? @jonhall_
  • 26. • New services and applications suddenly appear • More home-grown software • Developers work in different tools • New kinds of customer, especially external DevOps challenges ServiceDesk orthodoxies… @jonhall_
  • 27. • Provision of support at industrial scale • Adaptation to life "on call” • Multi-cloud; Blend of old and new systems • Customer/business context • What to prioritize? Fix or build? …but enterprise realities challenge DevOps DevOps challenges ServiceDesk orthodoxies… @jonhall_
  • 28. “The enterprise space doesn’t move slowly because they’re stupid, or they hate technology. It’s because they have users” —Luke Kanies, Puppet Founder, Configuration Management Camp 2015, Belgium. @jonhall_
  • 29. • Work-in-progress queues • Asynchronous communication • Single role teams • Individual over-exposure • Lack of knowledge sharing How to annoy a DevOps practitioner @jonhall_
  • 30. LEVEL 2 SUPPORT LEVEL 2 SUPPORTLEVEL 2 SUPPORT LEVEL 3 SPECIALISTS LEVEL 3 SPECIALISTS LEVEL 3 SPECIALISTS LEVEL 3 SPECIALISTS LEVEL 1 SUPPORT Uh oh… @jonhall_
  • 31. Swarming aligns really well to DevOps • Autonomy and self-organisation • Knowledge transfer and skills development • ChatOps, not email • Prevention of accumulation of queued work • Protection of individuals from burnout @jonhall_
  • 32. • Cynefin (Pronounced “kuh-nev-in”) • Developed by Dave Snowden at IBM in 1999 • Taken independent in 2005 • “Signifies the multiple factors in our environment and our experience that influence us in ways we can never understand” Exploring further: Swarming to deliver Cynefin @jonhall_
  • 33. @jonhall_ • Obvious and Complicated domains: • Repeating relationship between cause and effect • With Complicated you need to do analysis to find that relationship • Complex domain: • Understanding the problem requires experimentation and analysis. • May, over time, be able to move to Complicated • Chaotic domain: • Dramatic and unconstrained • Focus on damage limitation, try to move to another domain
  • 34. “Obvious” Domain @jonhall_ • “Sense, Categorise, Respond” • Template/knowledge-driven resolution • Self service
  • 35. “Complicated” Domain @jonhall_ • “Sense, Analyse, Respond” • Dispatch-type swarm – pair agents with varied experience • Capture detailed knowledge for organizational learning
  • 36. “Complex” Domain • “Probe, Sense, Respond” @jonhall_
  • 37. “Complex” Domain • “Probe, Sense, Respond” @jonhall_
  • 38. “Chaotic” Domain • “Act, Sense, Respond” • Sub-swarms • Deal with the acute situation • Try to discover sufficient information to move to complex @jonhall_
  • 39. • Service Management needs to evolve its practices and tooling to better position its value to DevOps teams. We need your help to do this right. • We’d like to listen to how support is affecting your role, as your impact grows in your enterprise. • You are agents of change in enterprises, with a good opportunity to influence thinking. What next? @jonhall_

Editor's Notes

  • #28: In a company of 4000 people, things can get out of hand really fast if you don't have customer context” “If you're dropped in the middle of something, how did you get here?”\ “The person who is on call at 4am needs to know who has been doing what”