SlideShare a Scribd company logo
Container Federation Use Cases
Ricardo Rocha <ricardo.rocha@cern.ch>
CERN IT-CM-RPS
Motivation for Federation
Periodic Load Spikes
International Conferences, Reconstruction Campaigns
Simplification
Monitoring, Lifecycle, Alarms
Deployment
Uniform API, Replication, Load Balancing
Sched Collector
Negotiator
StartD
AcctGroup = "ATLAS"
JobPrio = 0
RequestCpus = 2
RequestMemory = 4260
...
CERNEnvironment = “production”
Datacenter = “meyrin”
HasMPI = true
TotalCpus = 8
TotalMemory = 22500
...
Matchmaking with ClassAds
Fair Share
Preemption
Running Virtualized
Extensive Experience in HEP
External Storage and Networking
Sched Collector
Negotiator
StartD
AcctGroup = "ATLAS"
JobPrio = 0
RequestCpus = 2
RequestMemory = 4260
...
CERNEnvironment = “production”
Datacenter = “meyrin”
HasMPI = true
TotalCpus = 8
TotalMemory = 22500
...
Matchmaking with ClassAds
Fair Share
Preemption
Running Virtualized
Extensive Experience in HEP
External Storage and Networking
Sched
Negotiator
Collector
Host
kubefed init fed --host-cluster-context=condor-host ...
kind: DaemonSet
...
hostNetwork: true
containers:
- name: condor-startd
image: .../cloud/condor-startd
command: ["/usr/sbin/condor_startd", "-f"]
securityContext:
privileged: true
livenessProbe:
exec:
command:
- condor_who
Sched
Negotiator
Collector
Host
StartD
...
StartD
...
StartD
...
kubefed init fed --host-cluster-context=condor-host ...
kubefed join --context fed tsystems 
--host-cluster-context condor-host --cluster-context tsystems
REANA / RECAST
Reusable Analysis Platform
Workflow Engine (Yadage)
Each step a Kubernetes Job
Integrated Monitoring & Logging
Centralized Log Collection
https://github.com/reanahubhttps://github.com/recast-hep https://github.com/diana-hep/yadage
Summary
• Federation support in Kubernetes is ready
• Ongoing development for the v2 API, with significant changes
• Work with the community to have our use case taken into account
• Initial use cases
• Started with a legacy application, limited integration
• Expanded to a cloud native implementation, with great results
• Ongoing work to expand its usage
• Use built-in “Kubernetes As a Service” capabilities
• Investigations on how to best handle data distribution
https://www.youtube.com/watch?v=jNyd97LiTXk

More Related Content

PDF
Integrating vert.x v2
PPTX
Company Presentation - ClusterVision
PDF
Pimp Your Pipeline - Central Configuration Management - Jens Saade
PDF
Codecamp2015 pimp yourpipeline-saade-jens-1.1
PDF
Camel on Cloud by Christina Lin
PPTX
Nike tech talk.2
PPTX
SnappyData overview NikeTechTalk 11/19/15
PDF
OpenStack Training | OpenStack Tutorial For Beginners | OpenStack Certificati...
Integrating vert.x v2
Company Presentation - ClusterVision
Pimp Your Pipeline - Central Configuration Management - Jens Saade
Codecamp2015 pimp yourpipeline-saade-jens-1.1
Camel on Cloud by Christina Lin
Nike tech talk.2
SnappyData overview NikeTechTalk 11/19/15
OpenStack Training | OpenStack Tutorial For Beginners | OpenStack Certificati...

Similar to Container Federation Use Cases (20)

PDF
The value of reactive
PDF
The Value of Reactive
PDF
Breaking a monolith: In-place refactoring with service-oriented architecture ...
PDF
LF_APIStrat17_Breaking a Monolith: In-Place Refactoring with Service-Oriented...
PDF
What the CRaC - Superfast JVM startup
PDF
OpenStack Tutorial For Beginners | OpenStack Tutorial | OpenStack Training | ...
PPTX
OpenACC Monthly Highlights - February 2018
PDF
OpenStack Magnum
PDF
F5 Automation and service discovery
PPTX
OpenACC Monthly Highlights September 2019
PDF
Microservices Application Tracing Standards and Simulators - Adrians at OSCON
PPTX
OpenACC and Open Hackathons Monthly Highlights: April 2022
PDF
Ampere Altra + LINBIT SDS + CloudStack
PDF
Web Scale Reasoning and the LarKC Project
PDF
AdaEurope 2013 - Berlin - Industrial Experience - Feedback on improving perfo...
PPTX
Eco4Cloud - Company Presentation
PPTX
Developer Intro to OpenShift
PDF
Mist - Serverless proxy to Apache Spark
PDF
Data Summer Conf 2018, “Mist – Serverless proxy for Apache Spark (RUS)” — Vad...
PDF
Apache Spark - Intro to Large-scale recommendations with Apache Spark and Python
The value of reactive
The Value of Reactive
Breaking a monolith: In-place refactoring with service-oriented architecture ...
LF_APIStrat17_Breaking a Monolith: In-Place Refactoring with Service-Oriented...
What the CRaC - Superfast JVM startup
OpenStack Tutorial For Beginners | OpenStack Tutorial | OpenStack Training | ...
OpenACC Monthly Highlights - February 2018
OpenStack Magnum
F5 Automation and service discovery
OpenACC Monthly Highlights September 2019
Microservices Application Tracing Standards and Simulators - Adrians at OSCON
OpenACC and Open Hackathons Monthly Highlights: April 2022
Ampere Altra + LINBIT SDS + CloudStack
Web Scale Reasoning and the LarKC Project
AdaEurope 2013 - Berlin - Industrial Experience - Feedback on improving perfo...
Eco4Cloud - Company Presentation
Developer Intro to OpenShift
Mist - Serverless proxy to Apache Spark
Data Summer Conf 2018, “Mist – Serverless proxy for Apache Spark (RUS)” — Vad...
Apache Spark - Intro to Large-scale recommendations with Apache Spark and Python
Ad

More from Helix Nebula The Science Cloud (20)

PDF
M-PIL-3.2 Public Session
PDF
Deep Learning for Fast Simulation
PDF
Interactive Data Analysis for End Users on HN Science Cloud
PDF
CERN Batch in the HNSciCloud
PDF
LHCb on RHEA and T-Systems
PDF
HNSciCloud CMS status-report
PDF
Helix Nebula Science Cloud usage by ALICE
PDF
Hybrid cloud for science
PDF
HNSciCloud PILOT PLATFORM OVERVIEW
PDF
HNSciCloud Overview
PDF
This Helix Nebula Science Cloud Pilot Phase Open Session
PDF
Cloud Services for Education - HNSciCloud applied to the UP2U project
PDF
Network experiences with Public Cloud Services @ TNC2017
PDF
EOSC in practice - Silvana Muscella (chair EOSC HLEG)
PDF
Helix Nebula Science Cloud Pilot Phase, 6 February 2018, Bologna, Italy
PDF
Pilot phase Award Ceremony - INFN Introduction and welcome
PDF
Early adopter group and closing of webinar - João Fernandes (CERN)
PDF
HNSciCloud pilot phase - Andrea Chierici (INFN)
PDF
Pilot phase Award Ceremony - T-Systems
PDF
Pilot phase Award Ceremony - RHEA
M-PIL-3.2 Public Session
Deep Learning for Fast Simulation
Interactive Data Analysis for End Users on HN Science Cloud
CERN Batch in the HNSciCloud
LHCb on RHEA and T-Systems
HNSciCloud CMS status-report
Helix Nebula Science Cloud usage by ALICE
Hybrid cloud for science
HNSciCloud PILOT PLATFORM OVERVIEW
HNSciCloud Overview
This Helix Nebula Science Cloud Pilot Phase Open Session
Cloud Services for Education - HNSciCloud applied to the UP2U project
Network experiences with Public Cloud Services @ TNC2017
EOSC in practice - Silvana Muscella (chair EOSC HLEG)
Helix Nebula Science Cloud Pilot Phase, 6 February 2018, Bologna, Italy
Pilot phase Award Ceremony - INFN Introduction and welcome
Early adopter group and closing of webinar - João Fernandes (CERN)
HNSciCloud pilot phase - Andrea Chierici (INFN)
Pilot phase Award Ceremony - T-Systems
Pilot phase Award Ceremony - RHEA
Ad

Recently uploaded (20)

PPTX
Reimagine Home Health with the Power of Agentic AI​
PDF
Adobe Illustrator 28.6 Crack My Vision of Vector Design
PDF
How to Migrate SBCGlobal Email to Yahoo Easily
PPTX
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
PPTX
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
PPTX
Odoo POS Development Services by CandidRoot Solutions
PPTX
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
PPTX
history of c programming in notes for students .pptx
PDF
wealthsignaloriginal-com-DS-text-... (1).pdf
PDF
Odoo Companies in India – Driving Business Transformation.pdf
PPTX
Transform Your Business with a Software ERP System
PDF
Softaken Excel to vCard Converter Software.pdf
PDF
Design an Analysis of Algorithms II-SECS-1021-03
PPTX
Introduction to Artificial Intelligence
PPTX
Operating system designcfffgfgggggggvggggggggg
PDF
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
PDF
Understanding Forklifts - TECH EHS Solution
PDF
Raksha Bandhan Grocery Pricing Trends in India 2025.pdf
PDF
Flood Susceptibility Mapping Using Image-Based 2D-CNN Deep Learnin. Overview ...
PDF
System and Network Administration Chapter 2
Reimagine Home Health with the Power of Agentic AI​
Adobe Illustrator 28.6 Crack My Vision of Vector Design
How to Migrate SBCGlobal Email to Yahoo Easily
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
Odoo POS Development Services by CandidRoot Solutions
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
history of c programming in notes for students .pptx
wealthsignaloriginal-com-DS-text-... (1).pdf
Odoo Companies in India – Driving Business Transformation.pdf
Transform Your Business with a Software ERP System
Softaken Excel to vCard Converter Software.pdf
Design an Analysis of Algorithms II-SECS-1021-03
Introduction to Artificial Intelligence
Operating system designcfffgfgggggggvggggggggg
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
Understanding Forklifts - TECH EHS Solution
Raksha Bandhan Grocery Pricing Trends in India 2025.pdf
Flood Susceptibility Mapping Using Image-Based 2D-CNN Deep Learnin. Overview ...
System and Network Administration Chapter 2

Container Federation Use Cases

  • 1. Container Federation Use Cases Ricardo Rocha <ricardo.rocha@cern.ch> CERN IT-CM-RPS
  • 2. Motivation for Federation Periodic Load Spikes International Conferences, Reconstruction Campaigns Simplification Monitoring, Lifecycle, Alarms Deployment Uniform API, Replication, Load Balancing
  • 3. Sched Collector Negotiator StartD AcctGroup = "ATLAS" JobPrio = 0 RequestCpus = 2 RequestMemory = 4260 ... CERNEnvironment = “production” Datacenter = “meyrin” HasMPI = true TotalCpus = 8 TotalMemory = 22500 ... Matchmaking with ClassAds Fair Share Preemption Running Virtualized Extensive Experience in HEP External Storage and Networking
  • 4. Sched Collector Negotiator StartD AcctGroup = "ATLAS" JobPrio = 0 RequestCpus = 2 RequestMemory = 4260 ... CERNEnvironment = “production” Datacenter = “meyrin” HasMPI = true TotalCpus = 8 TotalMemory = 22500 ... Matchmaking with ClassAds Fair Share Preemption Running Virtualized Extensive Experience in HEP External Storage and Networking
  • 5. Sched Negotiator Collector Host kubefed init fed --host-cluster-context=condor-host ...
  • 6. kind: DaemonSet ... hostNetwork: true containers: - name: condor-startd image: .../cloud/condor-startd command: ["/usr/sbin/condor_startd", "-f"] securityContext: privileged: true livenessProbe: exec: command: - condor_who Sched Negotiator Collector Host StartD ... StartD ... StartD ... kubefed init fed --host-cluster-context=condor-host ... kubefed join --context fed tsystems --host-cluster-context condor-host --cluster-context tsystems
  • 7. REANA / RECAST Reusable Analysis Platform Workflow Engine (Yadage) Each step a Kubernetes Job Integrated Monitoring & Logging Centralized Log Collection https://github.com/reanahubhttps://github.com/recast-hep https://github.com/diana-hep/yadage
  • 8. Summary • Federation support in Kubernetes is ready • Ongoing development for the v2 API, with significant changes • Work with the community to have our use case taken into account • Initial use cases • Started with a legacy application, limited integration • Expanded to a cloud native implementation, with great results • Ongoing work to expand its usage • Use built-in “Kubernetes As a Service” capabilities • Investigations on how to best handle data distribution