SlideShare a Scribd company logo
Jonathan Churchill, Campus network engineering workshop
19/10/2016 100G networking within JASMIN
100G networking within JASMIN
Campus network engineering for data-
intensive science workshop
October 19th 2016
Jonathan Churchill
JASMIN Infrastructure Manager
( STFC Scientific Computing Dept.)
JASMIN is a world leading, unique hybrid of:
• 16PB high performance storage (~250GByte/s)
• High-performance computing (~4,000 cores)
• 35PB Archive and Elastic Tape
• Non-blocking Networking (> 3Tbit/sec),
and Optical Private Network WAN’s
• Coupled with cloud hosting capabilities
Cloud is here !
17PB
40PB
5,000
JASMIN 1
JASMIN 2
JASMIN 3
DMZ
Cloud Lives here
JASMIN 4,5
(2016–20) …)
Storage and Servers distributed
over the fabric network.
JASMIN
3.5
LOTUS
100G lives here
JASMIN “Fabric” Networking
The need for speed
347Gbps
347Gbps = 34,700 Broadband connections
JC2-LSW1 JC2-LSW1 JC2-LSW1JC2-LSW1 JC2-LSW1 JC2-LSW1 JC2-LSW1 JC2-LSW1 JC2-LSW1JC2-LSW1 JC2-LSW1 JC2-LSW1
48 * 16 = 768 10GbE Non-blocking
16 x 12 x 40GbE = 192 40GbE ports
S1036 = 32 x 40GbE
JC2-LSW1JC2-LSW1
JC2-SP1 JC2-SP1 JC2-SP1 JC2-SP1 JC2-SP1 JC2-SP1
16 x MSX1024B-1BFS
48x10GBE + 12 40 GbE
16 x 12 40GbE = 192 Ports / 32 = 6
Total 192 40 GbE Cable
1,900 @ 10GbE Ports
• Non-Blocking. Zero Contention (48x10Gb = 12x 40Gb uplinks)
• Low Latency (250nS L3 / per switch/router).
• Cheap(er)
• But its all layer 3 routed (ECMP OSPF)
954 Routes
954 Routes
Bandwidth ??
Data via the DTZ
Through the IaaS firewall
DTZ Bandwidth 1:1 match
to IaaS hypervisors.
Data rates inside PaaS = IaaS ?
• How can we provide data rate
access to Iaas Cloud tenants at
similar rates to “inside” JASMIN
(PaaS) ?
– aka 100Gbits/sec
Non Blocking data access
inside JASMIN
SP1 SP2 SP3 SP4
LSW1
host001
host002
host024
iSCSI
Underlay networks
LSW2
host025
host026
host027
LSW3
172.26.66.64/26172.26.66.0/26
LSWn
~10x 10-12Gbps per “Bladeset”
24x 10Gbps
172.16.136.0/24 172.16.137.0/24
12x 40Gb ECMP uplinks
per switch/router
 A non-blocking to IaaS cloud
needs to duplicate or fit into
this fabric.
 And still 1:1 using 10Gb
servers
LSW21
Non Blocking data access
to JASMIN IaaS via 100G ?
SP1 SP2 SP3 SP4
LSW1
host001
host002
host024
iSCSI
Underlay networks
LSW2
host025
host026
host027
LSW3
172.26.66.64/26172.26.66.0/26
LSW20
1:10 server to client
LSWn
~10x 10-12Gbps per “Bladeset”
24x 10Gbps
172.16.136.0/24 172.16.137.0/24 host-100G-1
host-100G-2
12x 40Gb ECMP uplinks
per switch/router
vmhost1
vmhost2
vmhost2424x 10Gbps
“Blessed”
private
subnet
Hardware
• Mellanox Connect-X4 Dual port 100Gb QSFP+ DA
– Dell R730XD servers.
– VXLAN/NV|GRE and Erasure Coding offload in h/w
• Mellanox Dual MSN2100 16 port x 100G switch/routers
Potential Issues
• Blocking “backdoor” access across the fabric
– Port ingress/egress ACL’s ?
• …but trunked VLAN’s or VXLAN’s at hypervisor port(s)
• Performance impact of VXLAN terminations
• 100G on the server at all
– cf. 1->10Gb kernel tuning transition
– 2x 100Gb ports > PCI3 bandwidth limited to 120Gb
• Can the software keep up ?
100G server software ?
host-100G-1
tomcat-1 tomcat-4tomcat-2 tomcat-3
apache/nginx Load Balancing
• OpenDAP
– Parallel servers and threads ?
– CPU and RAM implications
– JVM memory issues ?
Summary
• Target :
Provide “non-blocking” data access to JASMIN IaaS.
• Use of 100Gb Networking :
– Reduces server count
– Scaleable for growing infrastructure
• Experimental. Many potential issues to resolve:
– Fabric routing egress/ingress ACLs
– 100G kernel tuning ?
– Can the software keep up
Questions

More Related Content

PPT
Solving Network Throughput Problems at the Diamond Light Source
PPTX
perfSONAR: getting telemetry on your network
PPT
EAP TLS, the Rolls-Royce of extensible authentication protocol (EAP) methods ...
PPTX
Exhibitor sessions: Khipu and Aruba, HPE
PPTX
The new Janet access architecture
PDF
Demonstrating 100 Gbps in and out of the Clouds
PDF
Ceph optimized Storage / Global HW solutions for SDS, David Alvarez
PPTX
20190620 accelerating containers v3
Solving Network Throughput Problems at the Diamond Light Source
perfSONAR: getting telemetry on your network
EAP TLS, the Rolls-Royce of extensible authentication protocol (EAP) methods ...
Exhibitor sessions: Khipu and Aruba, HPE
The new Janet access architecture
Demonstrating 100 Gbps in and out of the Clouds
Ceph optimized Storage / Global HW solutions for SDS, David Alvarez
20190620 accelerating containers v3

What's hot (20)

PDF
PLNOG 8: Ivan Pepelnjak - Data Center Fabrics - What Really Matters
PPTX
20121017 OpenStack CERN Accelerating Science
PPTX
20181219 ucc open stack 5 years v3
PPTX
Future services on Janet
PPTX
CERN IT Monitoring
PDF
Modern Data Center Network Architecture - The house that Clos built
PPTX
Storage area network (san)
PDF
Storage networking-technologies
PDF
The Impact of Software-based Virtual Network in the Public Cloud
PDF
IPv6 at CSCS
PDF
The CMS openstack, opportunistic, overlay, online-cluster Cloud (CMSooooCloud)
PPTX
Demystifying Networking: Data Center Networking Trends 2017
PDF
Backup management with Ceph Storage - Camilo Echevarne, Félix Barbeira
PDF
Ceph used in Cancer Research at OICR
PPTX
OpenStack at CERN : A 5 year perspective
PDF
Unlock Bigdata Analytic Efficiency with Ceph Data Lake - Zhang Jian, Fu Yong
PDF
Clemson: Solving the HPC Data Deluge
PDF
SDN and NFV: Facts, Extensions, and Carrier Opportunities
PDF
LCA13: Jason Taylor Keynote - ARM & Disaggregated Rack - LCA13-Hong - 6 March...
PLNOG 8: Ivan Pepelnjak - Data Center Fabrics - What Really Matters
20121017 OpenStack CERN Accelerating Science
20181219 ucc open stack 5 years v3
Future services on Janet
CERN IT Monitoring
Modern Data Center Network Architecture - The house that Clos built
Storage area network (san)
Storage networking-technologies
The Impact of Software-based Virtual Network in the Public Cloud
IPv6 at CSCS
The CMS openstack, opportunistic, overlay, online-cluster Cloud (CMSooooCloud)
Demystifying Networking: Data Center Networking Trends 2017
Backup management with Ceph Storage - Camilo Echevarne, Félix Barbeira
Ceph used in Cancer Research at OICR
OpenStack at CERN : A 5 year perspective
Unlock Bigdata Analytic Efficiency with Ceph Data Lake - Zhang Jian, Fu Yong
Clemson: Solving the HPC Data Deluge
SDN and NFV: Facts, Extensions, and Carrier Opportunities
LCA13: Jason Taylor Keynote - ARM & Disaggregated Rack - LCA13-Hong - 6 March...
Ad

Viewers also liked (17)

PPTX
Science DMZ security
PPTX
Challenges in end-to-end performance
PPTX
Provisioning Janet
PPTX
Archiving data from Durham to RAL using the File Transfer Service (FTS)
PPTX
Enabling efficient movement of data into & out of a high-performance analysis...
PDF
Science DMZ at Imperial
PPTX
Science DMZ
PPTX
The Science DMZ
PPTX
Electron Microscopy Between OPIC, Oxford and eBIC
PDF
Protecting our customers - BT security
PPTX
Data and information governance: getting this right to support an information...
PPTX
Cyber Crime - "Who, What and How"
PPT
Role of the CISO in Higher Education
PPTX
Mitigation starts now
PPTX
Certifying and Securing a Trusted Environment for Health Informatics Research...
PPT
Working with students and ISO27001
PPTX
Closing plenary and keynote from Lauren Sager Weinstein
Science DMZ security
Challenges in end-to-end performance
Provisioning Janet
Archiving data from Durham to RAL using the File Transfer Service (FTS)
Enabling efficient movement of data into & out of a high-performance analysis...
Science DMZ at Imperial
Science DMZ
The Science DMZ
Electron Microscopy Between OPIC, Oxford and eBIC
Protecting our customers - BT security
Data and information governance: getting this right to support an information...
Cyber Crime - "Who, What and How"
Role of the CISO in Higher Education
Mitigation starts now
Certifying and Securing a Trusted Environment for Health Informatics Research...
Working with students and ISO27001
Closing plenary and keynote from Lauren Sager Weinstein
Ad

Similar to 110G networking within JASMIN (20)

PDF
Новые коммутаторы QFX10000. Технология JunOS Fusion
PDF
Scaling Beyond 100G With 400G and 800G
PDF
Challenges with high density networks
PPT
NFS and Oracle
PDF
Tendencias de Uso y Diseño de Redes de Interconexión en Computadores Paralel...
PPTX
Collaborate nfs kyle_final
PPTX
Lecture notes - Data Centers________.pptx
PDF
Cisco CCNA Data Center Networking Fundamentals
PDF
Bare Metal Club ATX: Networking Discussion
PPTX
IBM Blade University: Emulex Connects the Data Center of the Future
PDF
Analyst Perspective - Next Generation Storage Networking for Next Generation ...
PDF
Accelerated SDN in Azure
PPT
High Performance Cyberinfrastructure is Needed to Enable Data-Intensive Scien...
PPT
Introduction to san ( storage area networks )
PDF
PLNOG 13: Alexis Dacquay: Handling high-bandwidth-consumption applications in...
ODP
LinuxCon2009: 10Gbit/s Bi-Directional Routing on standard hardware running Linux
PDF
Private cloud virtual reality to reality a partner story daniel mar_technicom
PDF
PLNOG16: Coping with Growing Demands – Developing the Network to New Bandwidt...
PPT
Bandwidth, Throughput, Iops, And Flops
PPTX
Building a Regional 100G Collaboration Infrastructure
Новые коммутаторы QFX10000. Технология JunOS Fusion
Scaling Beyond 100G With 400G and 800G
Challenges with high density networks
NFS and Oracle
Tendencias de Uso y Diseño de Redes de Interconexión en Computadores Paralel...
Collaborate nfs kyle_final
Lecture notes - Data Centers________.pptx
Cisco CCNA Data Center Networking Fundamentals
Bare Metal Club ATX: Networking Discussion
IBM Blade University: Emulex Connects the Data Center of the Future
Analyst Perspective - Next Generation Storage Networking for Next Generation ...
Accelerated SDN in Azure
High Performance Cyberinfrastructure is Needed to Enable Data-Intensive Scien...
Introduction to san ( storage area networks )
PLNOG 13: Alexis Dacquay: Handling high-bandwidth-consumption applications in...
LinuxCon2009: 10Gbit/s Bi-Directional Routing on standard hardware running Linux
Private cloud virtual reality to reality a partner story daniel mar_technicom
PLNOG16: Coping with Growing Demands – Developing the Network to New Bandwidt...
Bandwidth, Throughput, Iops, And Flops
Building a Regional 100G Collaboration Infrastructure

More from Jisc (20)

PPTX
Strengthening open access through collaboration: building connections with OP...
PPTX
Andrew-Brown-JUSP-showcase-20240730.pptx
PPTX
JUSP Showcase - Rebuilding Data presentation
PPTX
Adobe Express Engagement Webinar (Delegate).pptx
PPTX
FE Accessibility training matrix partnership - information session
PPTX
Procuring a research management system: why is it so hard?
PPTX
Adobe Express Engagement Webinar (Delegate).pptx
PPTX
How libraries can support authors with open access requirements for UKRI fund...
PPTX
Supporting (UKRI) OA monographs at Salford.pptx
PPTX
The approach at University of Liverpool.pptx
PPTX
Jisc's value to HE: the University of Sheffield
PPTX
Towards a code of practice for AI in AT.pptx
PPTX
Jamworks pilot and AI at Jisc (20/03/2024)
PPTX
Wellbeing inclusion and digital dystopias.pptx
PPTX
Accessible Digital Futures project (20/03/2024)
PPTX
Procuring digital preservation CAN be quick and painless with our new dynamic...
PPTX
International students’ digital experience: understanding and mitigating the ...
PPTX
Digital Storytelling Community Launch!.pptx
PPTX
Open Access book publishing understanding your options (1).pptx
PPTX
Scottish Universities Press supporting authors with requirements for open acc...
Strengthening open access through collaboration: building connections with OP...
Andrew-Brown-JUSP-showcase-20240730.pptx
JUSP Showcase - Rebuilding Data presentation
Adobe Express Engagement Webinar (Delegate).pptx
FE Accessibility training matrix partnership - information session
Procuring a research management system: why is it so hard?
Adobe Express Engagement Webinar (Delegate).pptx
How libraries can support authors with open access requirements for UKRI fund...
Supporting (UKRI) OA monographs at Salford.pptx
The approach at University of Liverpool.pptx
Jisc's value to HE: the University of Sheffield
Towards a code of practice for AI in AT.pptx
Jamworks pilot and AI at Jisc (20/03/2024)
Wellbeing inclusion and digital dystopias.pptx
Accessible Digital Futures project (20/03/2024)
Procuring digital preservation CAN be quick and painless with our new dynamic...
International students’ digital experience: understanding and mitigating the ...
Digital Storytelling Community Launch!.pptx
Open Access book publishing understanding your options (1).pptx
Scottish Universities Press supporting authors with requirements for open acc...

Recently uploaded (20)

PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Spectral efficient network and resource selection model in 5G networks
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
Cloud computing and distributed systems.
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Machine learning based COVID-19 study performance prediction
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Empathic Computing: Creating Shared Understanding
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Building Integrated photovoltaic BIPV_UPV.pdf
Spectral efficient network and resource selection model in 5G networks
The AUB Centre for AI in Media Proposal.docx
Review of recent advances in non-invasive hemoglobin estimation
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
NewMind AI Weekly Chronicles - August'25 Week I
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Advanced methodologies resolving dimensionality complications for autism neur...
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Cloud computing and distributed systems.
MYSQL Presentation for SQL database connectivity
20250228 LYD VKU AI Blended-Learning.pptx
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Machine learning based COVID-19 study performance prediction
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Dropbox Q2 2025 Financial Results & Investor Presentation
Empathic Computing: Creating Shared Understanding

110G networking within JASMIN

  • 1. Jonathan Churchill, Campus network engineering workshop 19/10/2016 100G networking within JASMIN
  • 2. 100G networking within JASMIN Campus network engineering for data- intensive science workshop October 19th 2016 Jonathan Churchill JASMIN Infrastructure Manager ( STFC Scientific Computing Dept.)
  • 3. JASMIN is a world leading, unique hybrid of: • 16PB high performance storage (~250GByte/s) • High-performance computing (~4,000 cores) • 35PB Archive and Elastic Tape • Non-blocking Networking (> 3Tbit/sec), and Optical Private Network WAN’s • Coupled with cloud hosting capabilities Cloud is here ! 17PB 40PB 5,000
  • 4. JASMIN 1 JASMIN 2 JASMIN 3 DMZ Cloud Lives here JASMIN 4,5 (2016–20) …) Storage and Servers distributed over the fabric network. JASMIN 3.5 LOTUS 100G lives here
  • 6. The need for speed 347Gbps 347Gbps = 34,700 Broadband connections
  • 7. JC2-LSW1 JC2-LSW1 JC2-LSW1JC2-LSW1 JC2-LSW1 JC2-LSW1 JC2-LSW1 JC2-LSW1 JC2-LSW1JC2-LSW1 JC2-LSW1 JC2-LSW1 48 * 16 = 768 10GbE Non-blocking 16 x 12 x 40GbE = 192 40GbE ports S1036 = 32 x 40GbE JC2-LSW1JC2-LSW1 JC2-SP1 JC2-SP1 JC2-SP1 JC2-SP1 JC2-SP1 JC2-SP1 16 x MSX1024B-1BFS 48x10GBE + 12 40 GbE 16 x 12 40GbE = 192 Ports / 32 = 6 Total 192 40 GbE Cable 1,900 @ 10GbE Ports • Non-Blocking. Zero Contention (48x10Gb = 12x 40Gb uplinks) • Low Latency (250nS L3 / per switch/router). • Cheap(er) • But its all layer 3 routed (ECMP OSPF) 954 Routes 954 Routes
  • 8. Bandwidth ?? Data via the DTZ Through the IaaS firewall DTZ Bandwidth 1:1 match to IaaS hypervisors.
  • 9. Data rates inside PaaS = IaaS ? • How can we provide data rate access to Iaas Cloud tenants at similar rates to “inside” JASMIN (PaaS) ? – aka 100Gbits/sec
  • 10. Non Blocking data access inside JASMIN SP1 SP2 SP3 SP4 LSW1 host001 host002 host024 iSCSI Underlay networks LSW2 host025 host026 host027 LSW3 172.26.66.64/26172.26.66.0/26 LSWn ~10x 10-12Gbps per “Bladeset” 24x 10Gbps 172.16.136.0/24 172.16.137.0/24 12x 40Gb ECMP uplinks per switch/router  A non-blocking to IaaS cloud needs to duplicate or fit into this fabric.  And still 1:1 using 10Gb servers
  • 11. LSW21 Non Blocking data access to JASMIN IaaS via 100G ? SP1 SP2 SP3 SP4 LSW1 host001 host002 host024 iSCSI Underlay networks LSW2 host025 host026 host027 LSW3 172.26.66.64/26172.26.66.0/26 LSW20 1:10 server to client LSWn ~10x 10-12Gbps per “Bladeset” 24x 10Gbps 172.16.136.0/24 172.16.137.0/24 host-100G-1 host-100G-2 12x 40Gb ECMP uplinks per switch/router vmhost1 vmhost2 vmhost2424x 10Gbps “Blessed” private subnet
  • 12. Hardware • Mellanox Connect-X4 Dual port 100Gb QSFP+ DA – Dell R730XD servers. – VXLAN/NV|GRE and Erasure Coding offload in h/w • Mellanox Dual MSN2100 16 port x 100G switch/routers
  • 13. Potential Issues • Blocking “backdoor” access across the fabric – Port ingress/egress ACL’s ? • …but trunked VLAN’s or VXLAN’s at hypervisor port(s) • Performance impact of VXLAN terminations • 100G on the server at all – cf. 1->10Gb kernel tuning transition – 2x 100Gb ports > PCI3 bandwidth limited to 120Gb • Can the software keep up ?
  • 14. 100G server software ? host-100G-1 tomcat-1 tomcat-4tomcat-2 tomcat-3 apache/nginx Load Balancing • OpenDAP – Parallel servers and threads ? – CPU and RAM implications – JVM memory issues ?
  • 15. Summary • Target : Provide “non-blocking” data access to JASMIN IaaS. • Use of 100Gb Networking : – Reduces server count – Scaleable for growing infrastructure • Experimental. Many potential issues to resolve: – Fabric routing egress/ingress ACLs – 100G kernel tuning ? – Can the software keep up

Editor's Notes

  • #4: JASMIN = Joint Analysis System Meeting Infrastructure Needs. A “Super Data Cluster” not a “Super Computer” The data analysis “missing link” between environmental data sources (satellites, aircraft, in-situ etc)and super computer simulations. Funded by NERC for all of NERC sciences. User and science management and operations by STFC RAL Space ‘CEDA’ team (Centre for Environmental Data Analysis). Physically hosted, procured and operated at STFC RAL by the Research Infrastructure group of the SCD Built in multiple phases since late 2011. Plans for expansion to 50PB on disc and matching on tape to 2020. One of NERCs 3 major compute infrastructures (along with the supercomputers ; Archer@ EPCC and MonSoon @MetOffice Exeter) Uses: * Satellite data, weather data, eg JASMIN is the ESA ‘Sentinel’ Missons UK Data hub (2-10Tbytes per day). * Climate simulation output from the biggest super computers (“Archer”, MetOffce “Monsoon”, DKRZ.) Primary use is for dataset intercomparision Eg JASMIN Holds >60% of Data used by the latest IPCC report on Climate change. Largest data set 600TB ( 10yr climate simulation from DKRZ ) Oceanography, landslip, forestry fires……plus other environmental research eg Genetic data from bugs in the environment (Environmental Genomics) Supports 5,000 world wide users (download services) and ~1,000 direct login users Stats: 16 PetaBytes useable high performance spinning disc (20PB raw) ~ 3,200,000 DVD’s = ~ 3.8 km high tower of DVD’s or > 30,000 years of MP3 Two largest Panasas ‘realms’ in the world (109 and 125 shelves). Largest single site Panasas installation in the world and Panasas’ largest customer. 900TB useable (1.44PB raw) NetApp iSCSI/NFS for virtualisation + Dell Equallogic PS6210XS for high IOPS low latency iSCSI 4,500 CPU cores split dynamically between batch cluster and cloud/virtualisation (VMware vCloud Director and vCenter/vSphere) 35PB Tape archive and Elastic tape 40 Racks >3 Tera bits per second bandwidth (~3500 DVD’s per minute). IO Capability of ~250GBytes/sec – Arguably in the top 10 in the world for IO performance. “hyper” converged network infrastructure - 10GbE + MPI low latency (~8uS) + iSCSI over same network fabric. (ie No separate SAN or Infiniband) Finalist for BCS UK industry Awards “Big Data Project of the Year” 2012 and 2014 (Won the 2012 SVC industry Public Sector Storage Project of the year award.) Designed throughout to be installed, operated and upgraded by very small (SCD) team. Currently 3FTE. Team designs, procures and integrates all components separately for best of breed solutions. You cannot buy JASMIN off the shelf (although several have tried ! Inc. Crick and MetOffice). Capital spending todate of ~£14M on a tick – tock model : ~£3-5M / ~£1M and assumed into the future.
  • #8: There are no traditional “Network problems” in JASMIN ! Aka Congestion.