SlideShare a Scribd company logo
Production Grade Edge Computing
Using Kubernetes
Steve Wong
@cantbewong
Open Source Community Relations Engineer
VMware
Open Source Summit Europe
October 23, 2018
2
Abstract
​Some applications benefit from moving closer to data ingest, or the user. Edge reduces local
processing latency, and supports isolated operation. It also has challenges compared to the
pooled resources, and single point of management of centralized clouds.
You won’t achieve the Google Borg experience at an edge location - nor are you likely to need it.
But with planning it is possible to achieve edge deployments that are secure, with predictable
performance and “highly available enough” considering constraints on money, physical space,
power, etc.
Steve will provide specific recommendations related to architecture, networking, storage,
patching, logging, disaster recovery, and remote manageability - based on using Kubernetes,
and other open source tools and technology.
This is a rapidly changing space, and Steve will also touch on some interesting proposals and
work underway in the space.
Agenda
3
​Production Grade Kubernetes
​What Does It Mean?
​Critical Components in a Kubernetes Cluster
​Architecture of the control plane
​Impacts of Limited Resources at an Edge location
​Making the best of limited budget and facilities
​Kubernetes Configuration
​Defaults may not be appropriate for edge
​Security
​Considerations for edge
​Disaster Recovery & Backup
​Planning checklist
​Futures
4
What does it mean to be “Production Grade”?
When you deploy to edge, you own 100% of this
• The installation is secure
• The deployment is managed with a
repeatable and recorded process
• Performance is predictable and consistent
• Updates and configuration changes can be
safely applied
• Logging and monitoring is in place to
detect and diagnose failures and resource
shortages
• Service is “highly available enough”
considering available resources, including
constraints on money, physical space,
power, etc.
• A recovery process is available,
documented, and tested for use in the
event of failures
Photo attribution: By Tony Webster from Portland, Oregon, United States (212 days without recordable incident) [CC BY 2.0
(https://creativecommons.org/licenses/by/2.0)], via Wikimedia Commons
5
Kubernetes Cluster Architecture
Kubernetes = a distributed system with a control plane and clustered worker nodes 
6
Control Plane
Critical components
component role effect of loss
etcd Maintains state for all
Kubernetes objects
Loss of storage catastrophic. Loss of quorum = Kubernetes
loses control plane. Read only API calls might continue to
work. Existing workloads may continue to run.
API server Provides API used internally
and externally
Can’t start, stop, update pods, services, replication
controllers. Scheduler and Controller Manager down.
Workloads continue if not dependent on API calls
(operators, customer controllers, CRDs, etc.)
scheduler Places pods on nodes No pod placements, priority, preemption.
controller manager Runs many controllers Core control loops that regulate state cease
7
Keeping critical components available
Recommendations:
• Use redundancy
• Hardware
• Software
• Enable rapid recovery
• Backups
• DR plans
• Training and documentation
• Security
• Monitoring, Metrics, Logging
• Automate operations
• Installs
• Updates
The Risks:
• Hardware failures
• Software bugs
• Bad updates
• Human errors
• Network outages
• Intentional attacks
• Overloaded systems resulting in
resource exhaustion
• Power, Cooling losses
• Weather
8
On-premises deployments have finite resources
even in public clouds, budget may limit what you choose to consume
Dreams &
Goals
Resources
a tricky balance
Drawing attribution: By Б.Золзаяа [CC BY-SA 4.0 (https://creativecommons.org/licenses/by-sa/4.0)], from Wikimedia Commons
9
Kubernetes on a single hardware host
Operating on the edge in more ways than one
Minimum
• Hardware
• Dual disks – mirrored
• Dual fans
• UPS
Recommended
• Hypervisor
• 3 node Kubernetes control
plane
• Resource governance
• Hardware
• Dual Power Supplies
• ECC memory
• 3 disks
10
Kubernetes on dual hardware hosts
etcd quorum must be an odd number
• No real availability advantage to
splitting etcd across 2 nodes - put all
etcd instances on one node
Shared storage is advantageous for availability of
control plane and workloads
• External storage may be expensive
• Software Defined storage will generally require a
witness “2+1”
11
Kubernetes on 3 or more hardware hosts
recommendation Put a control plane instance with etcd on each of 3
nodes
• Loss of a node reduces capacity but does not
bring down Kubernetes, and is recoverable
12
Kubernetes configuration settings
Protect the system from resource overloads
Throttle things like :
• API call rates
• Pods per node
Reserve for system daemons
Recommended for predictable,
repeatable behavior:
• Explicitly state resources on
container specs
• Explicitly configure out of
resource behavior on nodes
• Use namespace quotas
13
Security
Recommendations
• Certificates
• Lock down worker nodes
• Use an image repository with
governance & security features
• Utilize Pod Security Policies
• Use RBAC to drive authorization
decisions and enforcement
• Consider physical security at
edge locations
• Storage encryption
• Protection from attachment
of malicious devices
• Avoid use of plain text credentials
(access key, token, passwords)
• Consider security features when
you choose your network solution
• Logging and monitoring/metrics
can contribute to security
14
Disaster Recovery
Redundancy can help reduce outages but
failures can still occur
DR plan elements:
• Backups
• Availability of
replacements
• A planned process
• People to carry it out
• Training
• Documentation of
the procedure
(runbooks)
• Automation can help
Backup concerns
• etcd
• Stateful workload
storage
• Certificates and keypairs
• DNS records
• IP/subnet assignments
• Config files
• Service accounts and
creds
Photo attribution: US Air Force photo, B-24 in cloud with flak, No. 2 engine smoking
15
Final Thoughts + Roadmap
Kubernetes can be applied to edge – but doing
so requires some care
Join the Kubernetes Edge IoT working group to get involved
• https://groups.google.com/forum/#!forum/kubernetes-wg-iot-edge
Photo attribution: By Matti Blume [CC BY-SA 4.0 (https://creativecommons.org/licenses/by-sa/4.0)], from Wikimedia Commons
There is opportunity to improve this
• Issues with scale
• Issues with control plane to worker node connectivity
Thank You
Questions
Deck is here:
More detailed coverage can be found in this blog post:
https://kubernetes.io/blog/2018/08/03/out-of-the-clouds-onto-the-ground-how-to-make-kubernetes-pro
duction-grade-anywhere/
Contact me later:
Twitter: @cantbewong
Kubernetes Slack: steve-wong
​References:

More Related Content

PDF
Why you need a private container image registry SCALE 2019
PDF
OSS Japan 2019 service mesh bridging Kubernetes and legacy
PDF
Learn how to use Harbor
PPTX
Kubernetes Disaster Recovery - Los Angeles K8s meetup Dec 10 2019
PPTX
Docker Meetup 08 03-2016
PDF
DockerCon EU 2015: The Latest in Docker Engine
PDF
Configuration Management and Transforming Legacy Applications in the Enterpri...
PDF
DCSF 19 Kubernetes and Container Storage Interface Update
Why you need a private container image registry SCALE 2019
OSS Japan 2019 service mesh bridging Kubernetes and legacy
Learn how to use Harbor
Kubernetes Disaster Recovery - Los Angeles K8s meetup Dec 10 2019
Docker Meetup 08 03-2016
DockerCon EU 2015: The Latest in Docker Engine
Configuration Management and Transforming Legacy Applications in the Enterpri...
DCSF 19 Kubernetes and Container Storage Interface Update

What's hot (20)

PDF
Effective Data Pipelines with Docker & Jenkins - Brian Donaldson
PDF
Docker for any type of workload and any IT Infrastructure
PPTX
How to be successful running Docker in Production
PDF
Kubernetes - Sailing a Sea of Containers
PDF
Production Ready Containers from IBM and Docker
PDF
Net core, mssql, container und kubernetes
PDF
Continuous delivery of microservices with kubernetes - Quintor 27-2-2017
PDF
DCSF19 How Docker Simplifies Kubernetes for the Masses
PPTX
Compare Docker deployment options in the public cloud
PDF
In-Cluster Continuous Testing Framework for Docker Containers
PPTX
CI, CD with Docker, Jenkins and Tutum
PDF
What’s New in Docker - Victor Vieux, Docker
PPTX
Security best practices for kubernetes deployment
PPTX
Platform as a Service with Kubernetes and Mesos
PPTX
containerd the universal container runtime
PPTX
Docker 101 Checonf 2016
PDF
DCSF19 CMD and Conquer: Containerizing the Monolith
PDF
Kubernetes laravel and kubernetes
PDF
Sharding Containers: Make Go Apps Computer-Friendly Again by Andrey Sibiryov
PPTX
Introducing LinuxKit
Effective Data Pipelines with Docker & Jenkins - Brian Donaldson
Docker for any type of workload and any IT Infrastructure
How to be successful running Docker in Production
Kubernetes - Sailing a Sea of Containers
Production Ready Containers from IBM and Docker
Net core, mssql, container und kubernetes
Continuous delivery of microservices with kubernetes - Quintor 27-2-2017
DCSF19 How Docker Simplifies Kubernetes for the Masses
Compare Docker deployment options in the public cloud
In-Cluster Continuous Testing Framework for Docker Containers
CI, CD with Docker, Jenkins and Tutum
What’s New in Docker - Victor Vieux, Docker
Security best practices for kubernetes deployment
Platform as a Service with Kubernetes and Mesos
containerd the universal container runtime
Docker 101 Checonf 2016
DCSF19 CMD and Conquer: Containerizing the Monolith
Kubernetes laravel and kubernetes
Sharding Containers: Make Go Apps Computer-Friendly Again by Andrey Sibiryov
Introducing LinuxKit
Ad

Similar to Production grade edge computing on Kubernetes OSS EU 2018 (20)

PPTX
TRAINING_ABOUT_KUBERNETES_Nguyen_Si_Nhan.pptx
PDF
Kubernetes and CoreOS @ Athens Docker meetup
PDF
Kubecon seattle 2018 workshop slides
PPTX
Introduction to Kubernetes
PPTX
Kubernetes Presentation
PDF
Successful K8S Platforms in Airgapped Environments
PDF
Women Who Code Connect 2018 Conference
PPTX
Aks: k8s e azure
PPTX
Kubernetes
PDF
Future of Kubernetes and its Impact on Technology Industry.pdf
PDF
Kubernetes Basics - ICP Workshop Batch II
PDF
From Containerized Application to Secure and Scaling With Kubernetes
PPTX
Container orchestration and microservices world
PDF
KubeCon China June 2019 - Survey of Kubernetes related solutions for IoT and ...
PDF
Five Strategies for Enterprises to Secure their Kubernetes Clusters.pdf
PDF
Docker Madison, Introduction to Kubernetes
PPTX
Simplify Your Way To Expert Kubernetes Management
PPTX
Containing the cloud
PDF
Kubernetes From Scratch .pdf
PDF
Build your own private Cloud environment
TRAINING_ABOUT_KUBERNETES_Nguyen_Si_Nhan.pptx
Kubernetes and CoreOS @ Athens Docker meetup
Kubecon seattle 2018 workshop slides
Introduction to Kubernetes
Kubernetes Presentation
Successful K8S Platforms in Airgapped Environments
Women Who Code Connect 2018 Conference
Aks: k8s e azure
Kubernetes
Future of Kubernetes and its Impact on Technology Industry.pdf
Kubernetes Basics - ICP Workshop Batch II
From Containerized Application to Secure and Scaling With Kubernetes
Container orchestration and microservices world
KubeCon China June 2019 - Survey of Kubernetes related solutions for IoT and ...
Five Strategies for Enterprises to Secure their Kubernetes Clusters.pdf
Docker Madison, Introduction to Kubernetes
Simplify Your Way To Expert Kubernetes Management
Containing the cloud
Kubernetes From Scratch .pdf
Build your own private Cloud environment
Ad

More from Steve Wong (7)

PDF
Kubernetes for IoT and Edge - Instrument Society of America Meeting
PDF
SCALE 16x on-prem container orchestrator deployment
PDF
Introduction to Apache Mesos and DC/OS
PDF
Open Source Summit NorthAmerica 2017 Stateful Apps in Containers
PDF
Smart Cities and IOT: with opportunity comes risk
PDF
Open Source Storage Summit Tokyo 2016 - Storage for Containers
PDF
Heresey in the church of 12 factors
Kubernetes for IoT and Edge - Instrument Society of America Meeting
SCALE 16x on-prem container orchestrator deployment
Introduction to Apache Mesos and DC/OS
Open Source Summit NorthAmerica 2017 Stateful Apps in Containers
Smart Cities and IOT: with opportunity comes risk
Open Source Storage Summit Tokyo 2016 - Storage for Containers
Heresey in the church of 12 factors

Recently uploaded (20)

PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
Cloud computing and distributed systems.
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
Spectroscopy.pptx food analysis technology
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPT
Teaching material agriculture food technology
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Approach and Philosophy of On baking technology
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Electronic commerce courselecture one. Pdf
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
cuic standard and advanced reporting.pdf
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Machine learning based COVID-19 study performance prediction
PPTX
Machine Learning_overview_presentation.pptx
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
Digital-Transformation-Roadmap-for-Companies.pptx
Cloud computing and distributed systems.
MYSQL Presentation for SQL database connectivity
Chapter 3 Spatial Domain Image Processing.pdf
Spectroscopy.pptx food analysis technology
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
20250228 LYD VKU AI Blended-Learning.pptx
Teaching material agriculture food technology
Building Integrated photovoltaic BIPV_UPV.pdf
Approach and Philosophy of On baking technology
The AUB Centre for AI in Media Proposal.docx
Electronic commerce courselecture one. Pdf
Dropbox Q2 2025 Financial Results & Investor Presentation
cuic standard and advanced reporting.pdf
Programs and apps: productivity, graphics, security and other tools
Spectral efficient network and resource selection model in 5G networks
Machine learning based COVID-19 study performance prediction
Machine Learning_overview_presentation.pptx
Per capita expenditure prediction using model stacking based on satellite ima...

Production grade edge computing on Kubernetes OSS EU 2018

  • 1. Production Grade Edge Computing Using Kubernetes Steve Wong @cantbewong Open Source Community Relations Engineer VMware Open Source Summit Europe October 23, 2018
  • 2. 2 Abstract ​Some applications benefit from moving closer to data ingest, or the user. Edge reduces local processing latency, and supports isolated operation. It also has challenges compared to the pooled resources, and single point of management of centralized clouds. You won’t achieve the Google Borg experience at an edge location - nor are you likely to need it. But with planning it is possible to achieve edge deployments that are secure, with predictable performance and “highly available enough” considering constraints on money, physical space, power, etc. Steve will provide specific recommendations related to architecture, networking, storage, patching, logging, disaster recovery, and remote manageability - based on using Kubernetes, and other open source tools and technology. This is a rapidly changing space, and Steve will also touch on some interesting proposals and work underway in the space.
  • 3. Agenda 3 ​Production Grade Kubernetes ​What Does It Mean? ​Critical Components in a Kubernetes Cluster ​Architecture of the control plane ​Impacts of Limited Resources at an Edge location ​Making the best of limited budget and facilities ​Kubernetes Configuration ​Defaults may not be appropriate for edge ​Security ​Considerations for edge ​Disaster Recovery & Backup ​Planning checklist ​Futures
  • 4. 4 What does it mean to be “Production Grade”? When you deploy to edge, you own 100% of this • The installation is secure • The deployment is managed with a repeatable and recorded process • Performance is predictable and consistent • Updates and configuration changes can be safely applied • Logging and monitoring is in place to detect and diagnose failures and resource shortages • Service is “highly available enough” considering available resources, including constraints on money, physical space, power, etc. • A recovery process is available, documented, and tested for use in the event of failures Photo attribution: By Tony Webster from Portland, Oregon, United States (212 days without recordable incident) [CC BY 2.0 (https://creativecommons.org/licenses/by/2.0)], via Wikimedia Commons
  • 5. 5 Kubernetes Cluster Architecture Kubernetes = a distributed system with a control plane and clustered worker nodes 
  • 6. 6 Control Plane Critical components component role effect of loss etcd Maintains state for all Kubernetes objects Loss of storage catastrophic. Loss of quorum = Kubernetes loses control plane. Read only API calls might continue to work. Existing workloads may continue to run. API server Provides API used internally and externally Can’t start, stop, update pods, services, replication controllers. Scheduler and Controller Manager down. Workloads continue if not dependent on API calls (operators, customer controllers, CRDs, etc.) scheduler Places pods on nodes No pod placements, priority, preemption. controller manager Runs many controllers Core control loops that regulate state cease
  • 7. 7 Keeping critical components available Recommendations: • Use redundancy • Hardware • Software • Enable rapid recovery • Backups • DR plans • Training and documentation • Security • Monitoring, Metrics, Logging • Automate operations • Installs • Updates The Risks: • Hardware failures • Software bugs • Bad updates • Human errors • Network outages • Intentional attacks • Overloaded systems resulting in resource exhaustion • Power, Cooling losses • Weather
  • 8. 8 On-premises deployments have finite resources even in public clouds, budget may limit what you choose to consume Dreams & Goals Resources a tricky balance Drawing attribution: By Б.Золзаяа [CC BY-SA 4.0 (https://creativecommons.org/licenses/by-sa/4.0)], from Wikimedia Commons
  • 9. 9 Kubernetes on a single hardware host Operating on the edge in more ways than one Minimum • Hardware • Dual disks – mirrored • Dual fans • UPS Recommended • Hypervisor • 3 node Kubernetes control plane • Resource governance • Hardware • Dual Power Supplies • ECC memory • 3 disks
  • 10. 10 Kubernetes on dual hardware hosts etcd quorum must be an odd number • No real availability advantage to splitting etcd across 2 nodes - put all etcd instances on one node Shared storage is advantageous for availability of control plane and workloads • External storage may be expensive • Software Defined storage will generally require a witness “2+1”
  • 11. 11 Kubernetes on 3 or more hardware hosts recommendation Put a control plane instance with etcd on each of 3 nodes • Loss of a node reduces capacity but does not bring down Kubernetes, and is recoverable
  • 12. 12 Kubernetes configuration settings Protect the system from resource overloads Throttle things like : • API call rates • Pods per node Reserve for system daemons Recommended for predictable, repeatable behavior: • Explicitly state resources on container specs • Explicitly configure out of resource behavior on nodes • Use namespace quotas
  • 13. 13 Security Recommendations • Certificates • Lock down worker nodes • Use an image repository with governance & security features • Utilize Pod Security Policies • Use RBAC to drive authorization decisions and enforcement • Consider physical security at edge locations • Storage encryption • Protection from attachment of malicious devices • Avoid use of plain text credentials (access key, token, passwords) • Consider security features when you choose your network solution • Logging and monitoring/metrics can contribute to security
  • 14. 14 Disaster Recovery Redundancy can help reduce outages but failures can still occur DR plan elements: • Backups • Availability of replacements • A planned process • People to carry it out • Training • Documentation of the procedure (runbooks) • Automation can help Backup concerns • etcd • Stateful workload storage • Certificates and keypairs • DNS records • IP/subnet assignments • Config files • Service accounts and creds Photo attribution: US Air Force photo, B-24 in cloud with flak, No. 2 engine smoking
  • 15. 15 Final Thoughts + Roadmap Kubernetes can be applied to edge – but doing so requires some care Join the Kubernetes Edge IoT working group to get involved • https://groups.google.com/forum/#!forum/kubernetes-wg-iot-edge Photo attribution: By Matti Blume [CC BY-SA 4.0 (https://creativecommons.org/licenses/by-sa/4.0)], from Wikimedia Commons There is opportunity to improve this • Issues with scale • Issues with control plane to worker node connectivity
  • 17. Questions Deck is here: More detailed coverage can be found in this blog post: https://kubernetes.io/blog/2018/08/03/out-of-the-clouds-onto-the-ground-how-to-make-kubernetes-pro duction-grade-anywhere/ Contact me later: Twitter: @cantbewong Kubernetes Slack: steve-wong ​References: