SlideShare a Scribd company logo
StackStorm
If-This-Than-That for DevOps
automation
© 2016 BROCADE COMMUNICATIONS SYSTEMS, INC.
Event-driven automation, DevOps way ~IoT時代の自動化、そのリアリティとは?~
@StackStorm@Stack_Storm
Agenda
What is StackStorm
Why we made it
Use cases
Why is it better than …
StackStorm: What is it?
© 2016 BROCADE COMMUNICATIONS SYSTEMS, INC.
Event-driven automation, DevOps way ~IoT時代の自動化、そのリアリティとは?~
StackStorm is like …
7
for DevOps
IFTTT,
for DevOps
8
cat /opt/stackstorm/packs/st2-demos/rules/demo.yaml
---
name: "sensu_crit_to_slack"
pack: "st2-demos"
description: "Post all critical alerts to the demo
enabled: true
trigger:
type: "sensu.event_handler"
criteria:
trigger.check.status:
pattern: 2
type: "equals"
trigger.check.name:
pattern: "demo_.*"
type: "matchregex"
action:
ref: "slack.post_message"
parameters:
message: >
[ALERT]{{trigger.client.name}}
{{trigger.check.output}}
channel: "#demos"
9
10
Trigger Action
Rule
Ingredients
11
IT Domains
Config mgmtStorageNetworking ContainersCloud InfraMonitoring
ActionsSensors
WorkflowsRules
Ops Support
Trigger Call
Automation Example
12
Automation
EngineerService
Monitoring Incident
Management
Event: “low disk
on web301”
Web301 is
“low disk”
Resolve known
cases, fast. Is it
/var/log?
Clean up!
Unknown
problem, need
a human
Wake up, buddy.
Something real
is going on…
StackStorm: Why we made it?
© 2016 BROCADE COMMUNICATIONS SYSTEMS, INC.
Opalis, now Microsoft SC Orchestrator
2004-2008
© 2016 BROCADE COMMUNICATIONS SYSTEMS, INC.
Event-driven automation with workflows,
for Enterprise IT
VS
VMware, 2008 - 2013
VS
© 2016 BROCADE COMMUNICATIONS SYSTEMS, INC. 16
OpenStack VMWare
DevOps Tools
© 2016 BROCADE COMMUNICATIONS SYSTEMS, INC. 17
VS
Enterprise Suites
VS
18
Event-driven automation with workflows,
for Cloud and DevOps
DevOps Tools Enterprise Suites
19
What can be it automated?
20
Got StackStorm !
Every automation looks like a nail
What SHOULD be automated?
From: Practice of Cloud System Administration, by Thomas Limoncelli
What HAS BEEN automated with StackStorm
• Security checks
– On malware detection in a VM, isolate
network port on a switch
• App blue-green deployment
– On Jenkins tests passed, bring new vm
claster, deploy and configure app, set
loadbalancer to send % of traffic to new
app, monitor, roll forward, or back out
• Networking
– On BGP peer goes down: collect
troubleshooting data, post on slack &
create JIRA ticket
– On Link aggregation member error,
check load, if capacity of rest of LAG
bundle enough, disable link with error
• OpenStack
– orphan VM clean-up: On orphans
detected, shut down, email owner, keep for
few days, delete
– VM evacuation on HW failures: On host
RAID failure, get list of impacted VMs,
email VM owners, evacuate VMs, create
JIRA ticket for hardware replacement.
• Service remediation:
– Cassandra “node down” recovery: On ring
node dying, deploy new node, configure,
add to the ring.
– Remediating RabbitMQ, Galera cluster,
MySQL, and more…
22
23
Auto-Remediation
FB auto-remediates 98% alarms,
can you?
“Auto-Remediation & Automation at Facebook” @ Auto-Remediation Meetup SF
https://www.meetup.com/Auto-Remediation-and-Event-Driven-Automation/events/236704012/
Facebook FBAR:
43 % Problem Fixed
51 % False positives
94 % Automated
“Auto-Remediation & Automation at Facebook” @ Auto-Remediation Meetup SF
https://www.meetup.com/Auto-Remediation-and-Event-Driven-Automation/events/236704012/
FBAR & StackStorm are friends
StackStorm is inspired by FBAR
StackStorm and FB FBAR collaborating since 2014
“Sleep Better at Night: OpenStack Cloud Auto-Healing” @ OpenStack Summit Barcelona
Mirantis: Auto-remediating 2,000 node OpenStack cluster at Symantec with StackStorm
“Sleep Better at Night: OpenStack Cloud Auto-Healing” @ OpenStack Summit Barcelona
Mirantis: Auto-remediating 2,000 node OpenStack cluster at Symantec with StackStorm
User: Symantec (Mirantis)
Use case: OpenStack cluster remediation
Presented by Mirantis at OpenStack Barcelona
StackStorm at Symantec
Engineer
Wakes up
Logs in
and ACK
Checks
runbook
Studies
the alert
Fixes the
problem
Runs
diagnostics
PagerDuty
Alert
2:02 AM 2:07 AM 2:15 AM2:10 AM 2:30 AM2:20 AM2:00 AM
On-call, Without Automation
Source: “Winston: Helping Netflix Engineers Sleep at Night” @ Qcon ‘16 SF
https://goo.gl/lHzq4r
False
Positive
Winston
2:00
AM
2:05
AM
2:05
AM
2:15
AM
Assisted
Diagnostics
Fixed
the
problem
On-call With Winston
Source: “Winston: Helping Netflix Engineers Sleep at Night” @ Qcon ‘16 SF
https://goo.gl/lHzq4r
Benefits
• Reduce MTR (Mean Time to Resolution)
• Avoid failures (fixing on computer time, not human time)
• Reduce risk of human error (no fat fingers)
• Positive team impact
– Avoid pager fatigue and team burn-out
– Turn from reactive to proactive (break reactive vicious cycle)
– Capture operational knowledge – as code
© 2016 BROCADE COMMUNICATIONS SYSTEMS, INC. 30
31
Network Automation
Now supporting Multiple Vendors
Event-driven automation, DevOps way ~IoT時代の自動化、そのリアリティとは?~
Device support
Proven approach from cloud compute space
33
NAPALM
AUTOMATION
Packs
PYTHON
BINDINGS
INTEGRATION
PACKS
DEVICE / OS
INTERFACES
Some workflows [1] leverage unique device functions, so they
call actions of the device’s integration pack.
Other workflows [2] need to be abstracted and treat devices
alike (e.g. IXP provision on mixture of SLX and MLX). So they
use “unified” Napalm pack.
device drivers
Integration packs expose device configurations and operations
as st2 actions. VDX and SLX packs will expose most/all of
device functionality. MLX is “best effort”. Napalm integration
pack provide “unified” device actions, just like libcloud does in
“compute” domain.
While integration packs can call device interfaces directly,
python bindings provide reuse, and abstract device OS
versions. PyNOS binds VDX via netconf. SLX and MLX don’t
exist yet.
Napalm is Open source project that exposes a unified python
API to interact with different network devices via device drivers.
Devices expose various interfaces: RESTCONF, NETCONF,
CLI/TELNET, SNMP.
VDX MLX SLX
[1] [2]
Other vendors
34
Including legacy, business apps …
Integration
“Innovation at Dimension Data: Optimizing Operations with Event Driven Automation”
https://stackstorm.com/2016/12/15/dimension-data-devops-beyond-deployment/
Dimension Data (SP, part of NTT)
“Innovation at Dimension Data: Optimizing Operations with Event Driven Automation”
https://stackstorm.com/2016/12/15/dimension-data-devops-beyond-deployment/
• Integrate IT systems & tools
• Security automation
• Legacy Run-book replacement
• Automation-aaS to end-users
• Top st2 contributors
Dimension Data (SP, part of NTT)
Event-driven automation, DevOps way ~IoT時代の自動化、そのリアリティとは?~
38
IoT
Fun stuff
Event-driven automation, DevOps way ~IoT時代の自動化、そのリアリティとは?~
40
Serverless
Grab StackStorm & DIY
StackStorm is like …
41
AWS Lambda AWS Step Functions
OpenSource, for DIY Serverless
StackStorm is like …
42
ActionsSensors
WorkflowsRules
IT Domains
Config mgmtStorageNetworking ContainersCloud InfraMonitoring Ops Support
Step Functions
AWS Lambda
Serverless with Swarm
for Genomic Annotation Computing
Dmitri Zimine
http://github.com/dzimine/serverle
@dzimine
Image by Miki Yoshihito, Creative Commons license
Many Use Cases – One Platform
44
StackStorm automation platform
Network
Automation
Assisted
Troubleshooting
Auto
Remediation
ITprocess
integration
IoT
InternetofThings
Serverless
CI/CD
ContinuousDeployment
NFV
Security
Orchestration
ChatOps
Why not…
45
Why not Scripts?
46
Why not Scripts?
47
• Simple to define, reason, visualize
• Transparent
– state is clear, execution is trackable: running, complete, failed steps
•
–
–
–
© 2016 BROCADE COMMUNICATIONS SYSTEMS, INC.48
Workflows Better in Operations
• Simple to define, reason, visualize
• Transparent
– state is clear, execution is trackable: running, complete, failed steps
• Reliable
– Workflows are long-running
– Crash tolerance
– “Restart from point of failure”
© 2016 BROCADE COMMUNICATIONS SYSTEMS, INC. 49
50
Why not legacy Runbook Automation?
• Microsoft System Center Orchestrator
• HP Operation Orchestrator
• Cisco Process Orchestrator (CPO)
• VMWare vCO / vRealize
They do not DevOps
Infrastructure As Code
Leverage social coding and collaboration
OpenSource
Designed for Devops
Infrastructure As Code
Leverage social coding and collaboration
OpenSource
Designed for Devops
53
Infrastructure as code
Case Study
• Service Catalog backed up by workflow
• Automate provisioning on VMW/OpenStack, 4 Data centers
• Before: CPO, operator updates via GUI, click and pray, x4
• After: StackStorm, dev -> code review -> staging -> QA-> prod
Infrastructure As Code
Leverage social coding and collaboration
OpenSource
Designed for Devops
265 Contributors
Source: https://openhub.net/p/st2
256 = 100,000,0002
Contributors
StackStorm & BWC Usage
© 2016 BROCADE COMMUNICATIONS SYSTEMS, INC. 57
0
500
1000
1500
2000
2500
3000
Sep Oct Nov Dec Jan Feb
Installations/month
- StackStorm
- BWC
StackStorm Exchange
© 2016 BROCADE COMMUNICATIONS SYSTEMS, INC. 58
StackStorm Exchange
© 2016 BROCADE COMMUNICATIONS SYSTEMS, INC. 59
© 2016 BROCADE COMMUNICATIONS SYSTEMS, INC. 60
© 2016 BROCADE COMMUNICATIONS SYSTEMS, INC. 61
Take away:
* Try it
* Use it
* Contribute to it
62
63
• Use StackStorm.
Try it, find automation, nail POC. Let us know, good & bad.
curl -sSL https://stackstorm.com/packages/install.sh | bash -s
docs.stackstorm.com/install
• Commit code. Become a “community maintainer”
It is not hard (2 days?). We help & support.
• Spread the word
Blog. Tweet. Talk. Mention. Bug. Github Star!
64
Contribute! Everything counts
Thank You!
@Stack_Storm
http://github.com/StackStorm/st2 Star 1,869
Dmitri Zimine
@dzimine
OpenSource Apache 2.0
• Github: github.com/StackStorm/st2
• Twitter: Stack_Storm
• IRC: #stackstorm on FreeNode
• stackstorm.slack.com on Slack
• www.stackstorm.com
© 2016 BROCADE COMMUNICATIONS SYSTEMS, INC. 66
StackStorm Brocade Workflow Composer
Commercial Edition
• Enterprise features
• Priority support
• brocade.com/bwc
• docs: bwc-docs.brocade.com
• Network lifecycle automation suite

More Related Content

PDF
コンテナとimmutableとわたし。あとセキュリティ。(Kubernetes Novice Tokyo #15 発表資料)
PDF
うちではこうやっています UI構築のルールとPlaymakerを使った画面遷移
ODP
Goのサーバサイド実装におけるレイヤ設計とレイヤ内実装について考える
PDF
「龍が如くスタジオ」のQAエンジニアリング技術を結集した全自動バグ取りシステム
PPTX
SSRF対策としてAmazonから発表されたIMDSv2の効果と破り方
PDF
MQTTとAMQPと.NET
PDF
[Aurora事例祭り]Amazon Aurora を使いこなすためのベストプラクティス
PDF
Docker Swarm入門
コンテナとimmutableとわたし。あとセキュリティ。(Kubernetes Novice Tokyo #15 発表資料)
うちではこうやっています UI構築のルールとPlaymakerを使った画面遷移
Goのサーバサイド実装におけるレイヤ設計とレイヤ内実装について考える
「龍が如くスタジオ」のQAエンジニアリング技術を結集した全自動バグ取りシステム
SSRF対策としてAmazonから発表されたIMDSv2の効果と破り方
MQTTとAMQPと.NET
[Aurora事例祭り]Amazon Aurora を使いこなすためのベストプラクティス
Docker Swarm入門

What's hot (20)

PDF
マイクロサービス化に向けて
PDF
PostgreSQL Query Cache - "pqc"
PDF
コンテナセキュリティにおける権限制御(OCHaCafe5 #3 Kubernetes のセキュリティ 発表資料)
PPTX
イベント・ソーシングを知る
PDF
Wireshark だけに頼らない! パケット解析ツールの紹介
PDF
JJUG CCC 2018 Spring - I-7 (俺が)はじめての Netty
PPTX
「関心の分離」と「疎結合」 ソフトウェアアーキテクチャのひとかけら
PPTX
MongoDBが遅いときの切り分け方法
PDF
初心者向けWebinar AWSで開発環境を構築しよう
PPTX
Metaspace
PDF
CyberAgent における OSS の CI/CD 基盤開発 myshoes #CICD2021
PDF
Dalvik仮想マシンのアーキテクチャ 改訂版
PDF
ルータコンフィグのGit管理のススメ 〜Git管理以外を自動化してみた〜
PDF
Docker道場オンライン#1 Docker基礎概念と用語の理解
PDF
Docker Compose入門~今日から始めるComposeの初歩からswarm mode対応まで
PPTX
RLSを用いたマルチテナント実装 for Django
PDF
AWSのログ管理ベストプラクティス
PDF
【第26回Elasticsearch勉強会】Logstashとともに振り返る、やっちまった事例ごった煮
PDF
jenkinsで遊ぶ
マイクロサービス化に向けて
PostgreSQL Query Cache - "pqc"
コンテナセキュリティにおける権限制御(OCHaCafe5 #3 Kubernetes のセキュリティ 発表資料)
イベント・ソーシングを知る
Wireshark だけに頼らない! パケット解析ツールの紹介
JJUG CCC 2018 Spring - I-7 (俺が)はじめての Netty
「関心の分離」と「疎結合」 ソフトウェアアーキテクチャのひとかけら
MongoDBが遅いときの切り分け方法
初心者向けWebinar AWSで開発環境を構築しよう
Metaspace
CyberAgent における OSS の CI/CD 基盤開発 myshoes #CICD2021
Dalvik仮想マシンのアーキテクチャ 改訂版
ルータコンフィグのGit管理のススメ 〜Git管理以外を自動化してみた〜
Docker道場オンライン#1 Docker基礎概念と用語の理解
Docker Compose入門~今日から始めるComposeの初歩からswarm mode対応まで
RLSを用いたマルチテナント実装 for Django
AWSのログ管理ベストプラクティス
【第26回Elasticsearch勉強会】Logstashとともに振り返る、やっちまった事例ごった煮
jenkinsで遊ぶ
Ad

Similar to Event-driven automation, DevOps way ~IoT時代の自動化、そのリアリティとは?~ (20)

PDF
StackStrom: If-This-Than-That for Devops Automation
PPTX
Weave User Group Talk - DockerCon 2017 Recap
PDF
Masterless Puppet Using AWS S3 Buckets and IAM Roles
PDF
06 network automationwithansible
PDF
Microservices Application Tracing Standards and Simulators - Adrians at OSCON
PPTX
Oscon 2017: Build your own container-based system with the Moby project
PDF
VMworld 2013: Deploying vSphere with OpenStack: What It Means to Your Cloud E...
PPTX
Application Development on Metapod
PDF
How Zalando runs Kubernetes clusters at scale on AWS - AWS re:Invent
PDF
Accelerate Your OpenStack Deployment Presented by SolidFire and Red Hat
PPTX
Serverless on OpenStack with Docker Swarm, Mistral, and StackStorm
PDF
Introduction to Apache NiFi 1.11.4
PPTX
Integrating Splunk into your Spring Applications
PDF
CAPS: What's best for deploying and managing OpenStack? Chef vs. Ansible vs. ...
PDF
CAPS: What's best for deploying and managing OpenStack? Chef vs. Ansible vs. ...
PDF
A GitOps model for High Availability and Disaster Recovery on EKS
PDF
Cloud-native .NET-Microservices mit Kubernetes @BASTAcon
PDF
Cloud-native .NET Microservices mit Kubernetes
PDF
A DevOps guide to Kubernetes
KEY
Django deployment with PaaS
StackStrom: If-This-Than-That for Devops Automation
Weave User Group Talk - DockerCon 2017 Recap
Masterless Puppet Using AWS S3 Buckets and IAM Roles
06 network automationwithansible
Microservices Application Tracing Standards and Simulators - Adrians at OSCON
Oscon 2017: Build your own container-based system with the Moby project
VMworld 2013: Deploying vSphere with OpenStack: What It Means to Your Cloud E...
Application Development on Metapod
How Zalando runs Kubernetes clusters at scale on AWS - AWS re:Invent
Accelerate Your OpenStack Deployment Presented by SolidFire and Red Hat
Serverless on OpenStack with Docker Swarm, Mistral, and StackStorm
Introduction to Apache NiFi 1.11.4
Integrating Splunk into your Spring Applications
CAPS: What's best for deploying and managing OpenStack? Chef vs. Ansible vs. ...
CAPS: What's best for deploying and managing OpenStack? Chef vs. Ansible vs. ...
A GitOps model for High Availability and Disaster Recovery on EKS
Cloud-native .NET-Microservices mit Kubernetes @BASTAcon
Cloud-native .NET Microservices mit Kubernetes
A DevOps guide to Kubernetes
Django deployment with PaaS
Ad

More from Brocade (20)

PPTX
Network infrastructure visibility and analytics with data streaming
PDF
~ストレージの価値を最大化!~次世代ストレージの導入ベネフィットを無駄なく享受するために、“ネットワーク”視点で、知っておくべきこと
PDF
“見てわかる”ファイバーチャネルSAN基礎講座(第6弾: 最終回)~困った時もこれで安心(?)、FC SANにおけるトラブルシューティングのコツとは??~
PDF
FC SAN Fabric環境におけるパフォーマンストラブルの対処法
PDF
Brocade PartnerWebinar: Network Edge キャンパス・スイッチ新製品&新機能、および有線&無線統合ソリューションアップデー...
PDF
“見てわかる” ファイバーチャネルSAN基礎講座(第5弾)~さあ、いよいよ、運用です!~
PDF
“見てわかる” ファイバーチャネルSAN基礎講座(第4弾)~続・間違わない!FC SAN導入のヒントとコツ~
PPTX
Two Fast Paths to Docker Networking with Brocade VDX
PPTX
Five Networking Must - Haves For Containers
PPTX
Let the conversation flow with Brocade vADC
PPTX
Always-on performance for Always-on Business
PDF
“見てわかる” ファイバーチャネルSAN基礎講座(第3弾)~間違わない!FC SAN導入のヒントとコツ~
PPTX
Recent Advances in Machine Learning: Bringing a New Level of Intelligence to ...
PDF
“見てわかる” ファイバーチャネルSAN基礎講座(第2弾)~FC SAN設計における勘所とは?~
PPTX
VM Farms Thrive with Dedicated IP Storage Networks
PDF
仮想ロードバランサ最前線: 性能も柔軟さも。ソフトウェア生まれだから実現できること
PDF
自動化万歳!朝起きたらトラブルシューティングが完了している世界の実現方法
PDF
変わるモバイル・ネットワークの世界 ~ソフトウェアが可能にする破壊的進化のリアリティと将来~
PDF
NEW WORLD. NEW NETWORK.: 3ステップで考えるNew IPへの進化
PDF
“見てわかる” ファイバーチャネルSAN基礎講座(第1弾)~まず理解しよう! 基本の “キ”~
Network infrastructure visibility and analytics with data streaming
~ストレージの価値を最大化!~次世代ストレージの導入ベネフィットを無駄なく享受するために、“ネットワーク”視点で、知っておくべきこと
“見てわかる”ファイバーチャネルSAN基礎講座(第6弾: 最終回)~困った時もこれで安心(?)、FC SANにおけるトラブルシューティングのコツとは??~
FC SAN Fabric環境におけるパフォーマンストラブルの対処法
Brocade PartnerWebinar: Network Edge キャンパス・スイッチ新製品&新機能、および有線&無線統合ソリューションアップデー...
“見てわかる” ファイバーチャネルSAN基礎講座(第5弾)~さあ、いよいよ、運用です!~
“見てわかる” ファイバーチャネルSAN基礎講座(第4弾)~続・間違わない!FC SAN導入のヒントとコツ~
Two Fast Paths to Docker Networking with Brocade VDX
Five Networking Must - Haves For Containers
Let the conversation flow with Brocade vADC
Always-on performance for Always-on Business
“見てわかる” ファイバーチャネルSAN基礎講座(第3弾)~間違わない!FC SAN導入のヒントとコツ~
Recent Advances in Machine Learning: Bringing a New Level of Intelligence to ...
“見てわかる” ファイバーチャネルSAN基礎講座(第2弾)~FC SAN設計における勘所とは?~
VM Farms Thrive with Dedicated IP Storage Networks
仮想ロードバランサ最前線: 性能も柔軟さも。ソフトウェア生まれだから実現できること
自動化万歳!朝起きたらトラブルシューティングが完了している世界の実現方法
変わるモバイル・ネットワークの世界 ~ソフトウェアが可能にする破壊的進化のリアリティと将来~
NEW WORLD. NEW NETWORK.: 3ステップで考えるNew IPへの進化
“見てわかる” ファイバーチャネルSAN基礎講座(第1弾)~まず理解しよう! 基本の “キ”~

Recently uploaded (20)

PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPT
Teaching material agriculture food technology
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
Big Data Technologies - Introduction.pptx
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Modernizing your data center with Dell and AMD
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
A Presentation on Artificial Intelligence
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Approach and Philosophy of On baking technology
PPTX
Cloud computing and distributed systems.
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Electronic commerce courselecture one. Pdf
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Advanced methodologies resolving dimensionality complications for autism neur...
Teaching material agriculture food technology
Digital-Transformation-Roadmap-for-Companies.pptx
Encapsulation_ Review paper, used for researhc scholars
Big Data Technologies - Introduction.pptx
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Modernizing your data center with Dell and AMD
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
A Presentation on Artificial Intelligence
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Approach and Philosophy of On baking technology
Cloud computing and distributed systems.
Mobile App Security Testing_ A Comprehensive Guide.pdf
Building Integrated photovoltaic BIPV_UPV.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
Electronic commerce courselecture one. Pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...

Event-driven automation, DevOps way ~IoT時代の自動化、そのリアリティとは?~

  • 1. StackStorm If-This-Than-That for DevOps automation © 2016 BROCADE COMMUNICATIONS SYSTEMS, INC.
  • 4. Agenda What is StackStorm Why we made it Use cases Why is it better than …
  • 5. StackStorm: What is it? © 2016 BROCADE COMMUNICATIONS SYSTEMS, INC.
  • 7. StackStorm is like … 7 for DevOps
  • 8. IFTTT, for DevOps 8 cat /opt/stackstorm/packs/st2-demos/rules/demo.yaml --- name: "sensu_crit_to_slack" pack: "st2-demos" description: "Post all critical alerts to the demo enabled: true trigger: type: "sensu.event_handler" criteria: trigger.check.status: pattern: 2 type: "equals" trigger.check.name: pattern: "demo_.*" type: "matchregex" action: ref: "slack.post_message" parameters: message: > [ALERT]{{trigger.client.name}} {{trigger.check.output}} channel: "#demos"
  • 9. 9
  • 11. Ingredients 11 IT Domains Config mgmtStorageNetworking ContainersCloud InfraMonitoring ActionsSensors WorkflowsRules Ops Support Trigger Call
  • 12. Automation Example 12 Automation EngineerService Monitoring Incident Management Event: “low disk on web301” Web301 is “low disk” Resolve known cases, fast. Is it /var/log? Clean up! Unknown problem, need a human Wake up, buddy. Something real is going on…
  • 13. StackStorm: Why we made it? © 2016 BROCADE COMMUNICATIONS SYSTEMS, INC.
  • 14. Opalis, now Microsoft SC Orchestrator 2004-2008 © 2016 BROCADE COMMUNICATIONS SYSTEMS, INC. Event-driven automation with workflows, for Enterprise IT
  • 16. VS © 2016 BROCADE COMMUNICATIONS SYSTEMS, INC. 16 OpenStack VMWare
  • 17. DevOps Tools © 2016 BROCADE COMMUNICATIONS SYSTEMS, INC. 17 VS Enterprise Suites
  • 18. VS 18 Event-driven automation with workflows, for Cloud and DevOps DevOps Tools Enterprise Suites
  • 19. 19 What can be it automated?
  • 20. 20 Got StackStorm ! Every automation looks like a nail
  • 21. What SHOULD be automated? From: Practice of Cloud System Administration, by Thomas Limoncelli
  • 22. What HAS BEEN automated with StackStorm • Security checks – On malware detection in a VM, isolate network port on a switch • App blue-green deployment – On Jenkins tests passed, bring new vm claster, deploy and configure app, set loadbalancer to send % of traffic to new app, monitor, roll forward, or back out • Networking – On BGP peer goes down: collect troubleshooting data, post on slack & create JIRA ticket – On Link aggregation member error, check load, if capacity of rest of LAG bundle enough, disable link with error • OpenStack – orphan VM clean-up: On orphans detected, shut down, email owner, keep for few days, delete – VM evacuation on HW failures: On host RAID failure, get list of impacted VMs, email VM owners, evacuate VMs, create JIRA ticket for hardware replacement. • Service remediation: – Cassandra “node down” recovery: On ring node dying, deploy new node, configure, add to the ring. – Remediating RabbitMQ, Galera cluster, MySQL, and more… 22
  • 24. “Auto-Remediation & Automation at Facebook” @ Auto-Remediation Meetup SF https://www.meetup.com/Auto-Remediation-and-Event-Driven-Automation/events/236704012/ Facebook FBAR: 43 % Problem Fixed 51 % False positives 94 % Automated
  • 25. “Auto-Remediation & Automation at Facebook” @ Auto-Remediation Meetup SF https://www.meetup.com/Auto-Remediation-and-Event-Driven-Automation/events/236704012/ FBAR & StackStorm are friends StackStorm is inspired by FBAR StackStorm and FB FBAR collaborating since 2014
  • 26. “Sleep Better at Night: OpenStack Cloud Auto-Healing” @ OpenStack Summit Barcelona Mirantis: Auto-remediating 2,000 node OpenStack cluster at Symantec with StackStorm
  • 27. “Sleep Better at Night: OpenStack Cloud Auto-Healing” @ OpenStack Summit Barcelona Mirantis: Auto-remediating 2,000 node OpenStack cluster at Symantec with StackStorm User: Symantec (Mirantis) Use case: OpenStack cluster remediation Presented by Mirantis at OpenStack Barcelona StackStorm at Symantec
  • 28. Engineer Wakes up Logs in and ACK Checks runbook Studies the alert Fixes the problem Runs diagnostics PagerDuty Alert 2:02 AM 2:07 AM 2:15 AM2:10 AM 2:30 AM2:20 AM2:00 AM On-call, Without Automation Source: “Winston: Helping Netflix Engineers Sleep at Night” @ Qcon ‘16 SF https://goo.gl/lHzq4r
  • 29. False Positive Winston 2:00 AM 2:05 AM 2:05 AM 2:15 AM Assisted Diagnostics Fixed the problem On-call With Winston Source: “Winston: Helping Netflix Engineers Sleep at Night” @ Qcon ‘16 SF https://goo.gl/lHzq4r
  • 30. Benefits • Reduce MTR (Mean Time to Resolution) • Avoid failures (fixing on computer time, not human time) • Reduce risk of human error (no fat fingers) • Positive team impact – Avoid pager fatigue and team burn-out – Turn from reactive to proactive (break reactive vicious cycle) – Capture operational knowledge – as code © 2016 BROCADE COMMUNICATIONS SYSTEMS, INC. 30
  • 33. Device support Proven approach from cloud compute space 33 NAPALM AUTOMATION Packs PYTHON BINDINGS INTEGRATION PACKS DEVICE / OS INTERFACES Some workflows [1] leverage unique device functions, so they call actions of the device’s integration pack. Other workflows [2] need to be abstracted and treat devices alike (e.g. IXP provision on mixture of SLX and MLX). So they use “unified” Napalm pack. device drivers Integration packs expose device configurations and operations as st2 actions. VDX and SLX packs will expose most/all of device functionality. MLX is “best effort”. Napalm integration pack provide “unified” device actions, just like libcloud does in “compute” domain. While integration packs can call device interfaces directly, python bindings provide reuse, and abstract device OS versions. PyNOS binds VDX via netconf. SLX and MLX don’t exist yet. Napalm is Open source project that exposes a unified python API to interact with different network devices via device drivers. Devices expose various interfaces: RESTCONF, NETCONF, CLI/TELNET, SNMP. VDX MLX SLX [1] [2] Other vendors
  • 34. 34 Including legacy, business apps … Integration
  • 35. “Innovation at Dimension Data: Optimizing Operations with Event Driven Automation” https://stackstorm.com/2016/12/15/dimension-data-devops-beyond-deployment/ Dimension Data (SP, part of NTT)
  • 36. “Innovation at Dimension Data: Optimizing Operations with Event Driven Automation” https://stackstorm.com/2016/12/15/dimension-data-devops-beyond-deployment/ • Integrate IT systems & tools • Security automation • Legacy Run-book replacement • Automation-aaS to end-users • Top st2 contributors Dimension Data (SP, part of NTT)
  • 41. StackStorm is like … 41 AWS Lambda AWS Step Functions OpenSource, for DIY Serverless
  • 42. StackStorm is like … 42 ActionsSensors WorkflowsRules IT Domains Config mgmtStorageNetworking ContainersCloud InfraMonitoring Ops Support Step Functions AWS Lambda
  • 43. Serverless with Swarm for Genomic Annotation Computing Dmitri Zimine http://github.com/dzimine/serverle @dzimine Image by Miki Yoshihito, Creative Commons license
  • 44. Many Use Cases – One Platform 44 StackStorm automation platform Network Automation Assisted Troubleshooting Auto Remediation ITprocess integration IoT InternetofThings Serverless CI/CD ContinuousDeployment NFV Security Orchestration ChatOps
  • 47. Why not Scripts? 47 • Simple to define, reason, visualize • Transparent – state is clear, execution is trackable: running, complete, failed steps • – – –
  • 48. © 2016 BROCADE COMMUNICATIONS SYSTEMS, INC.48
  • 49. Workflows Better in Operations • Simple to define, reason, visualize • Transparent – state is clear, execution is trackable: running, complete, failed steps • Reliable – Workflows are long-running – Crash tolerance – “Restart from point of failure” © 2016 BROCADE COMMUNICATIONS SYSTEMS, INC. 49
  • 50. 50 Why not legacy Runbook Automation? • Microsoft System Center Orchestrator • HP Operation Orchestrator • Cisco Process Orchestrator (CPO) • VMWare vCO / vRealize They do not DevOps
  • 51. Infrastructure As Code Leverage social coding and collaboration OpenSource Designed for Devops
  • 52. Infrastructure As Code Leverage social coding and collaboration OpenSource Designed for Devops
  • 53. 53 Infrastructure as code Case Study • Service Catalog backed up by workflow • Automate provisioning on VMW/OpenStack, 4 Data centers • Before: CPO, operator updates via GUI, click and pray, x4 • After: StackStorm, dev -> code review -> staging -> QA-> prod
  • 54. Infrastructure As Code Leverage social coding and collaboration OpenSource Designed for Devops
  • 57. StackStorm & BWC Usage © 2016 BROCADE COMMUNICATIONS SYSTEMS, INC. 57 0 500 1000 1500 2000 2500 3000 Sep Oct Nov Dec Jan Feb Installations/month - StackStorm - BWC
  • 58. StackStorm Exchange © 2016 BROCADE COMMUNICATIONS SYSTEMS, INC. 58
  • 59. StackStorm Exchange © 2016 BROCADE COMMUNICATIONS SYSTEMS, INC. 59
  • 60. © 2016 BROCADE COMMUNICATIONS SYSTEMS, INC. 60
  • 61. © 2016 BROCADE COMMUNICATIONS SYSTEMS, INC. 61
  • 62. Take away: * Try it * Use it * Contribute to it 62
  • 63. 63
  • 64. • Use StackStorm. Try it, find automation, nail POC. Let us know, good & bad. curl -sSL https://stackstorm.com/packages/install.sh | bash -s docs.stackstorm.com/install • Commit code. Become a “community maintainer” It is not hard (2 days?). We help & support. • Spread the word Blog. Tweet. Talk. Mention. Bug. Github Star! 64 Contribute! Everything counts
  • 66. OpenSource Apache 2.0 • Github: github.com/StackStorm/st2 • Twitter: Stack_Storm • IRC: #stackstorm on FreeNode • stackstorm.slack.com on Slack • www.stackstorm.com © 2016 BROCADE COMMUNICATIONS SYSTEMS, INC. 66 StackStorm Brocade Workflow Composer Commercial Edition • Enterprise features • Priority support • brocade.com/bwc • docs: bwc-docs.brocade.com • Network lifecycle automation suite