SlideShare a Scribd company logo
Upgrade of the CMS Event Builder
Andrea Petrucci - CERN (PH/CMD) on behalf of the CMS DAQ group
19th International Conference on Computing in High Energy and Nuclear Physics
CHEP 2012, 21-25 May 2012, New York, United States
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 2
 Introduction to CMS upgrade
 Current DAQ system
 Architecture of the upgrade DAQ system
 Feasibility studies on advanced networking
technologies for event builder
 Network technologies
 CMS online software
 Test setups
 Preliminary measurements
 Summary
Outline
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 3
CMS Upgrade
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 4
CMS Upgrade timeline
First beam in CMS September 10th
First collisions in CMS November 23st
First 7 TeV collisions in CMS March 30th
Magnet
Test at surface
5fb -1
collected
14Tev7Tev LS1 LS2
Higher pileup
(depending on LHC operation 2x 1034 at 25 ns or 50 ns bunch
spacing)
Pileup could reach 100 underlying interactions
More data will be generated due to higher occupancy
Install Pixel 4th barrel
layer and 3rd endcap
layer
Upgrade of detector:
 HCAL introduces uTCA based readout system
 Trigger system will be upgraded: the new system
will be first operated in parallel to the existing
system. Readout will be based in uTCA.
DAQ: new optical link to readout uTCA based
systems
8Tev LS1.5
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 5
 Aging of existing hardware (PCs and Networks at least
5 years old)
 Accommodate sub-detectors with upgraded off-detector
electronics
 HCAL upgrade front-end electronics and readout system
 New Pixel detector with 4 barrel layers and 3 endcap layers
(LS1.5), estimated requirements: ~80 readout channels with
640MB/s
 Trigger system will be upgraded: the new system will be first
operated in parallel to the existing system.
Motivations for upgrade of CMS DAQ
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 6
Current CMS Data Acquisition
System
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 7
Current DAQ system
Detector Front-Ends
Computing Services
Readout
Systems
Filter
Systems
Event
Manager
Level 1
Trigger
Control
and
Monitor
Builder Network
40 MHz
100 kHz
100 GB/s
~700
100’s Hz
13000’s cores
LV1
HLT
µs
sec
 Read out 700 detector front-ends (fragment size ~2 kB)
 Build complete events at 100 kHz ( L1 trigger rate)
 Make them available to a filter farm of O(13000) cores
 Store 100’s of Hz to disk (10’s of TB/day)
 Scalable system employing commercial components wherever
possible
 Proprietary / Commercial: Front-Ends, VME, PCI, PC servers, networks, Protocols,
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 8
Current DAQ system (II)
72 FED builders:
assemble FED
(~2 kB) into
super-fragment
(~16 kB)
Eight RU builders (EVB DAQ slices):
Assemble super-fragments (~16 kB) into full events (~1 MB) at 12.5
kHz each
12.5 kHz +12.5 kHz
100 kHz
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 9
 Full readout of all sub-detectors at L1 rate
 SLINK64 operates on 400 MB/s
 Fed builder operates on ~200-350 MB/s, depending on event
size distribution
 8 DAQ Slices, 100 GB/s event builder
 Event Filter
 ~1300 dual-CPU (multi-core) PC nodes, 13 k-cores, 220 kHEPSpec06
 running about ~20 k HLT processes, CPU budget is ~175 ms/evt
@100 kHz.
 Storage Manager
 Event data to disk
 pp: ~200 MB/s, design for 600 MB/s
 Heavy Ions: ~1,4 GB/s (up to 2.8 GB/s w/o transfer)
Current DAQ system (III)
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 10
Upgrade of CMS Data Acquisition
System for long shutdown 1
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 11
Requirements
 L1 rate of 100 KHz
 Detector Front End Drivers (FED)
 ~552 Legacy FEDs (fragment size increases from 2 kB to 4 kB due to pile-up)
 37 (TRG, HCAL, HF) + 40 (Pixel - 2 x 10 GbE links) new readout links from new
FEDs (expected maximal fragment size 8kB)
 Frontend Readout Links (FRLs)
 ~360 FRLs (Legacy FEDs, ~400 MB/s)
 ~120 FRLs (new FEDs, ~640 MB/s)
 High availability (redundancy, load balancing, failover, etc.)
Upgrade of DAQ system
DAQ plans for upgrade
 Replace myrinet-based fedbuilder
 Replace event builder network
 two options: conservative (10 GbE) and aggressive (40 GbE or
Infiniband)
 New architecture between Event Builder and Filter Farm
 Replace of the Storage manager hardware
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 12
 New readout system must be compatible with old data sources
(FEDs – SLINK) and with the newly designed FEDs.
Read out link
Sender
Mezzanine
SLINK-64
Cable
up to 10m,
400 MB/s
Frontend Readout Link
reads 1 or 2 links
Interfaces to Myrinet NIC
Build a new PCI card for the existing FRL
 Will implement new 10GB link to the
RU
 Will be able to receive data via a
fiber from newly built uTCA FEDs
 Pros: Small change wrt existing
system. Rely on stable existing
hardware
 Cons: bottle neck of internal PCI bus
(520MB/s per FRL if reads SLINK)
link to RU
link from uTCA FED
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 13
New Data to surface (D2S)
 Moving from myrinet 2x
2Gb/s to 10 Gb Ethernet
 Point to point protocol used
between FRL and RU
 Concentrate FRLs sources in
USC switch or/and RU
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 14
Event builder network: conservative option
 Sources (in total 480)
 At 100 kHz 2 to 4 KB fragments (legacy
FEDs)
 At 100 kHz 4 to 8 KB fragments (new FEDs)
 Input NIC 2 - 10 GbE ports
 Required throughput per link100 kHz, 2 to 4
kB per fragment
 PC concentrates by factor 2 for legacy
FEDs
 360 FRLs  180 RUs
 Output NIC – 10 GbE port
 Required throughput per link 100 kHz, 4 to
8 kB
 EVB configuration
 300 RU PCs
 300 x 300 switch 10 GbE
~360 FRLs (legacy FEDs, ~400 MB/s)
~120 FRLs (new FEDs, ~640 MB/s)
I/O switches
200m fiber
2-ports NIC - 10 GbE
RU-PC
port NIC - 10 GbE
port NIC - 10 GbE
BU-PC
I/O switches
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 15
 Sources (in total 480)
 At 100 kHz 2 to 4 KB fragments (legacy
FEDs)
 At 100 kHz 4 to 8 KB fragments (new FEDs)
 Front Switch
 Concentrate by factor of 2 for legacy FEDs
 Input NIC 4 – 10 GbE ports
 Required throughput per link 200 kHz, 2 to 4
kB
 Required throughput per link 100 kHz, 4 to 8
kB
 PC concentrates by factor 4
 Output NIC – 40 GbE/ Infiniband port
 Required throughput per link 100 kHz, 16 to
32 kB
 EVB configuration
~360 FRLs (legacy FEDs, ~400 MB/s)
~120 FRLs (new FEDs, ~640 MB/s)
I/O switches
200m fiber
4-ports NIC - 10 GbE
RU-PC
port NIC - 40 GbE / Infiniband
port NIC - 40 GbE / Infiniband
BU-PC
Event builder network: aggressive option
I/O switches
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 16
New Builder Units, Filter Unites and Storage Managers
configuration
 Builder Units/Filter Units
 Add 10 GbE switch to
connect BUs to Fus
 2 links 10 GbE from BU and
1 link 10 GbE for each FU
 Storage Managers
 1 link 10 GbE from BU-FU
switch to SM switch
 3 links 10 GbE for each
Storage Manager
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 17
Feasibility studies on advanced
networking technologies for event
builder
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 18
Our feasibility studies are focused in two network
technologies:
 Ethernet
 10/40 Gigabit Ethernet (different vendors)
 iWARP (RDMA) – TCP/IP full offload (Chelsio T4 Unified Wire
Adapters)
 performance measurements using TCP/IP and DAPL (Direct
Access Programming Library- OpenFabrics)
 Infiniband
 4x quad data rate (QDR)
 40 Gb/s - 8B/10B encoding -32 Gb/s data rate
 4x fourteen data rate (FDR)
 56 Gb/s – 64B/66B encoding – 54.54 Gb/s data rate
 performance measurements using DAPL (Direct Access
Programming Library- OpenFabrics) and IPoIB (IP over InfiniBand)
Feasibility studies on advanced networking
technologies
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 19
A unified, cross-platform, transport-independent software stack for
RDMA and kernel bypass
 http://www.openfabrics.org/
The OFED Stack (source: OpenFabrics Alliance)
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 20
 Developed by DAT collaborative
 http://www.datcollaborative.org/
 Transport and platform (OS) independent
 Define user (uDAPL) and kernel (kDAPL) APIs
 DAT supports reliable connection
 Data Transfer Operations send, receive, rdma_read, rdma_write
 uDAPL Version 2.x, January, 2007
DAT Model (source DAT Collaborative)
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 21
 The protocol is defined as a very thin set of zero copy functions
when compared to thicker protocol implementations such as
TCP/IP
Comparison of the Stacks (Infiniband vs TCP/IP)
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 22
CMS online software for data
acquisition system
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 23
The CMS online application are based on XDAQ framework.
What is the XDAQ framework?
 XDAQ is a software platform designed specifically for the
development of distributed data acquisition systems.
 XDAQ is the definitive across the board middleware that eases
the tasks of designing, programming and managing data
acquisition applications by providing a simple, consistent and
integrated distributed programming environment.
 The framework builds upon industrial standards, open protocols
and libraries.
CMS online software
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 24
Architecture Foundation
Executive
ConfigurationXML
HTTP
SOAP
Control
and GUI
I2O/
B2IN
Fast control
and data
Application
Components
Service
Plug-ins
Devices
Custom device
access
 Uniform building block
One or more executives per
computer contain application
and service components
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 25
Architecture Foundation (II)
pluggable Peer-Transport:
 SOAP/HTML: HTTP
 I2O: TCP, Myrinet, FIFO,
etc.
Application code does not change when
moving from Ethernet to Myrinet or
Infiniband
Peer-Transport a pluggable layer of various networking medium
 The XDAQ framework provides peers with a mechanism for determine a route to an
endpoint, allowing the peer to send data to the remote endpoint
 Peer transports are the entity responsible for conducting the actual exchange of
information over a network
 Peer transport encapsulate a set of network interfaces, allowing a peer to send and
receiver data independently of the type of network being employed
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 26
 ptuDAPL is a new peer-transport for DAT library (Infiniband and iWarp) using
I2O messaging
 Use of smart memory pool based on uDAPL memory region allocator (random
access to memory with no intermediate management by using cookies)
 Profiting for inherent non blocking and queuing of uDAPL API for minimizing latency
 All I/O operations centered on dedicated uDAPL memory pool (full zero-copy
between XDAQ applications and DAPL driver)
 Based on DAT Spec 2.x
Pluggable Peer-Transport for DAT library
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 27
Test setups
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 28
Setup 1
(LHCb)
Setup 2
(daqval 2)
Setup 3
(daqval 2)
Nodes 8 16 32
Type DELL R710 DELL R310 DELL C6220
CPU Xeon E5530 2x
4-core at 2.27 GHz
Xeon X3323 2x
4-core at 2.50
GHz
Xeon E5-2670 2x
8-core at 2.6 GHz
Memory 3 GB 4 GB 32 GB
Networ
k
Ethernet Infiniband Ethernet Ethernet Infiniband
Adapter Chelsio T420-
CR 10GBASE-
SFP RNIC
(iWarp)
Qlogic HCA,
qle7340 4x
QDR PCIe
Silicom
PE210G2SPI9-
SR
Mellanox - ConnectX-3
VPI (single FDR or 40GbE) -
MCX353A-FCBT
Switch Voltaire
Vantage 6048,
48 ports, 10
GbE
Qlogic 12300-
BS01, 36
ports, 4x QDR
Brocade MLXE16,
16 ports, 10 GbE
Mellanox 36 -
port 40 GbE
Switch -
MSX1036B-
1SFR
Mellanox - IB
switch (36
ports FDR
QFSP) -
MSX6025F-1
Test setups
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 29
Preliminary Measurements
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 30
 Simple XDAQ application to compute the One-way delay
 Time packet to travel from a specific source to a specific destination and back
again
 One-way latency is measured by timing a round-trip message and dividing the
obtained result by two
Roundtrip
OS
OS Stack
Network
Driver
App Processing
OS Overhead
Network Processing
I/O Processing
Wire Latency Port to port Latency
OS Stack
Network
Driver
Application Application
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 31
Latency measurements for ptuDAPL with Infiniband
and iWarp
8.87 us for 32
Bytes
6.33 us for 32
Bytes
only ~ 1 us added by XDAQ framework
Setup 1
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 32
Stream I/O
 Unidirectional throughput is measured using a unidirectional send of N
messages. Time sampling is done at the receiver side and starts with the
first incoming message
Throughput per node
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 33
Stream I/O (TCP/IP vs uDAPL) – Chelsio T4 – 10
GbE
10 GbE
1.25 GB/s 100%
96%
80%
64%
48%
32%
16%
drop at 4352
bytes
SIZE MB/
S
MB/
S
1024 663 272
4096 1208 547
8192 1034 682
Setup 1
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 34
Multi-Stream I/O
 Unidirectional throughput (bandwidth) is measured using a unidirectional
send of N messages to N receivers. Time sampling is done at the
receivers side and starts with the first incoming message.
Throughput per node
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 35
Multi-stream I/O 1 to 4 (TCP/IP vs uDAPL) – Chelsio T4 –
10 GbE
96%
80%
64%
48%
32%
16%
10 GbE
1.25 GB/s 100%
Setup 1
SIZE MB/
S
MB/
S
1024 930 399
4096 1168 1130
8192 1228 1168
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 36
Event Builder
 CMS RU-builder
 Currently used in CMS DAQ for data taking
Event
Manager
Trigger
Detector Frontend
Readout Units
Builder Units
Filter Units
1
2
2
3
Trigger
Detector Frontend
Readout Units
Event
Manager
Builder Units
Filter Units
4 4
5 5
1: I have n free resources
2: Broadcast identifier association
3: Trigger information for
events id1, id2, … idn
4: Send me fragments for
events id1, id2, … idn
5: Superfragment data
6: Allocate events to Filter Units
7: Discard events
8: Release n identifiers
Trigger
Detector Frontend
Readout Units
Event
Manager
6
7
8
Builder Units
Filter Units
RU-builder protocol – Token passing
4
4
4
4
1
4
4
3
3
3
3
3
3
2
2
2
2
2
4
1
1
1
1
123
23
2
1
1
1
Event building traffic
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 37
Event builder 1+7x8 vs Multi-stream 1x8 (TCP/IP -
10GbE)
10 GbE
1.25 GB/s 100%
96%
80%
64%
48%
32%
16%
Setup 2
SIZE MB/
S
MB/
S
1024 635 94
4096 1223 375
8192 1231 743
Targe
t
800
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 38
Event builder 1x7x8 vs 100 KHz (TCP/IP - 10GbE)
10 GbE
1.25 GB/s 100%
96%
80%
64%
48%
32%
16%
Setup 2
SIZE RATE
kHz
1024 91.85
4096 91.63
8192 90.71
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 39
Event builder 1x7x8 vs current event builder (TCP/IP -
10GbE)
10 GbE
1.25 GB/s 100%
96%
80%
64%
48%
32%
16%
Setup 2
SIZE MB/
S
MB/
S
1024 94 20
4096 375 83
8192 743 161
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 40
Infiniband Measurements
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 41
MStream 1x4 uDAPL - IB vs TCP/IP - IPoIB
100%
87.5%
75%
62.5%
50%
37.5%
25%
12.5%
Setup 1
SIZE MB/
S
MB/
S
1638
4
2160
2041
3276
8
3256
1538
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 42
Event builder 1+3x3 uDAPL - IB vs TCP/IP - IPoIB
100%
87.5%
75%
62.5%
50%
37.5%
25%
12.5%
Setup 1
Target
1638
3276
SIZE MB/
S
MB/
S
1638
4
372
1805
3276
8
400
2387
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 43
Summary
 New readout system must be compatible with old data sources
and new data sources
 New DAQ column architecture replaces aging existing hardware
and improves readout performance
 Feasibility studies on advanced networking technologies for
event builder
 Different network technologies under investigation
 Possibly Quality of Service will give high throughput and
scalability
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 44
Thank you for you attention!
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 45
Backup
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 46
 Replicated building
blocks
Scalable cluster system
architecture
Architecture Foundation (II)
Executive Executive
Executive Executive
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 47
Software Distribution
Core framework Reusable applications CMS specific applications
coretools powerpack worksuite
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 48
Layered View
Online Software
Coretools
Powerpack
Worksuite
Platforms
Event
Builders
Front-end
Controllers
Data
Monitoring
Error and
Alarming
Job Control
External System
Interfaces
Hardware
Access
OS Abstraction
Communication
Subsystems
Executive
Framework
Operating
Systems
Networking
Infrastructures
Hardware
Device
Interfaces
ConfigurationManagementSupport
User
Interfaces
Detector
Specific
Applications
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 49
Timeline
2012
2008
2006
2004
2002
2000
First version of XDAQ - I2O communication kernel
XDAQ 2 – Web enters DAQ (SOAP)
XDAQ 3 – experiment wide adoption
XMAS – monitoring, orthogonal to applications
Commissioning and first beam event successfully achieved
Well consolidated after twelve years
of development and use
Successefully used in the first tree years of LHC
CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 50
Outlook
0.5 - 2 KB/msg 128 KB - 4 MB/msg
Tuples Tables
O(15000) msg/s O(100) msg/s O(10) msg/s
Visualization
 Integrated web technologies
 Monitoring
 Errors and Alarms
 Reusable Event Builder

More Related Content

PDF
Network Programming: Data Plane Development Kit (DPDK)
PPTX
XDAQ_AP_LNL-INFN_27112014
PDF
draft-georgescu-bmwg-ipv6-tran-tech-benchmarking-00
PDF
Linac Coherent Light Source (LCLS) Data Transfer Requirements
PPTX
RT2014_TCPLA_Nara_27052014-V1
PPTX
Packet Framework - Cristian Dumitrescu
PDF
LinuxCon 2015 Linux Kernel Networking Walkthrough
PDF
Taking Security Groups to Ludicrous Speed with OVS (OpenStack Summit 2015)
Network Programming: Data Plane Development Kit (DPDK)
XDAQ_AP_LNL-INFN_27112014
draft-georgescu-bmwg-ipv6-tran-tech-benchmarking-00
Linac Coherent Light Source (LCLS) Data Transfer Requirements
RT2014_TCPLA_Nara_27052014-V1
Packet Framework - Cristian Dumitrescu
LinuxCon 2015 Linux Kernel Networking Walkthrough
Taking Security Groups to Ludicrous Speed with OVS (OpenStack Summit 2015)

What's hot (20)

PDF
DPDK in Containers Hands-on Lab
PDF
A Comparative Analysis of Additional Overhead Imposed by Internet Protocol Se...
PDF
DPDK Summit - 08 Sept 2014 - 6WIND - High Perf Networking Leveraging the DPDK...
PPTX
Dynamic user trace
PDF
100 M pps on PC.
PDF
Performance challenges in software networking
PDF
DevConf 2014 Kernel Networking Walkthrough
PDF
A Low-Power CoAP for Contiki
PPTX
The n00bs guide to ovs dpdk
PDF
BPF & Cilium - Turning Linux into a Microservices-aware Operating System
PPTX
Opensample: A Low-latency, Sampling-based Measurement Platform for Software D...
ODP
Dpdk performance
DOCX
Poll mode driver integration into dpdk
PPTX
Scalable Web Technology for the Internet of Things
PDF
[Webinar Slides] Programming the Network Dataplane in P4
PDF
Networking Protocols for Internet of Things
PDF
Cilium - API-aware Networking and Security for Containers based on BPF
PDF
Virtual IoT Meetup: Connecting Sensor Networks
PDF
The Next Generation Firewall for Red Hat Enterprise Linux 7 RC
PDF
Introduction of IPv6NET in Tridentcom 2014
DPDK in Containers Hands-on Lab
A Comparative Analysis of Additional Overhead Imposed by Internet Protocol Se...
DPDK Summit - 08 Sept 2014 - 6WIND - High Perf Networking Leveraging the DPDK...
Dynamic user trace
100 M pps on PC.
Performance challenges in software networking
DevConf 2014 Kernel Networking Walkthrough
A Low-Power CoAP for Contiki
The n00bs guide to ovs dpdk
BPF & Cilium - Turning Linux into a Microservices-aware Operating System
Opensample: A Low-latency, Sampling-based Measurement Platform for Software D...
Dpdk performance
Poll mode driver integration into dpdk
Scalable Web Technology for the Internet of Things
[Webinar Slides] Programming the Network Dataplane in P4
Networking Protocols for Internet of Things
Cilium - API-aware Networking and Security for Containers based on BPF
Virtual IoT Meetup: Connecting Sensor Networks
The Next Generation Firewall for Red Hat Enterprise Linux 7 RC
Introduction of IPv6NET in Tridentcom 2014
Ad

Similar to CHEP2012_Upgrade_CMS_Event_Builder_AP_21052011 (20)

PDF
PETRUCCI_Andrea_Research_Projects_and_Publications
PDF
Scalability20140226
PPTX
Big Data for Big Discoveries
PDF
Big Data Management at CERN: The CMS Example
PDF
Computing Challenges at the Large Hadron Collider
PDF
the FC7 AMC for DAQ & control applications in CMS
PDF
Summer @CERN
PDF
The World Wide Distributed Computing Architecture of the LHC Datagrid
PPTX
CERN_Big_Data_BI_Presentation_Decis.pptx
PPTX
ESCAPE Kick-off meeting - HL-LHC ESFRI Landmark (Feb 2019)
PDF
Designing and building the world largest machine.
PPTX
SiemensOpenLabMajorReview_February_2014_v9.pptx
PDF
awards competences talks
PDF
The size and complexity of the CERN network
PDF
OSMC 2012 | Monitoring at CERN by Christophe Haen
PPTX
20190314 cern register v3
PDF
CRVBreakoutGlenzinski090807
PPTX
DA-JPL-final
PDF
Jarp big data_sydney_v7
PETRUCCI_Andrea_Research_Projects_and_Publications
Scalability20140226
Big Data for Big Discoveries
Big Data Management at CERN: The CMS Example
Computing Challenges at the Large Hadron Collider
the FC7 AMC for DAQ & control applications in CMS
Summer @CERN
The World Wide Distributed Computing Architecture of the LHC Datagrid
CERN_Big_Data_BI_Presentation_Decis.pptx
ESCAPE Kick-off meeting - HL-LHC ESFRI Landmark (Feb 2019)
Designing and building the world largest machine.
SiemensOpenLabMajorReview_February_2014_v9.pptx
awards competences talks
The size and complexity of the CERN network
OSMC 2012 | Monitoring at CERN by Christophe Haen
20190314 cern register v3
CRVBreakoutGlenzinski090807
DA-JPL-final
Jarp big data_sydney_v7
Ad

CHEP2012_Upgrade_CMS_Event_Builder_AP_21052011

  • 1. Upgrade of the CMS Event Builder Andrea Petrucci - CERN (PH/CMD) on behalf of the CMS DAQ group 19th International Conference on Computing in High Energy and Nuclear Physics CHEP 2012, 21-25 May 2012, New York, United States
  • 2. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 2  Introduction to CMS upgrade  Current DAQ system  Architecture of the upgrade DAQ system  Feasibility studies on advanced networking technologies for event builder  Network technologies  CMS online software  Test setups  Preliminary measurements  Summary Outline
  • 3. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 3 CMS Upgrade
  • 4. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 4 CMS Upgrade timeline First beam in CMS September 10th First collisions in CMS November 23st First 7 TeV collisions in CMS March 30th Magnet Test at surface 5fb -1 collected 14Tev7Tev LS1 LS2 Higher pileup (depending on LHC operation 2x 1034 at 25 ns or 50 ns bunch spacing) Pileup could reach 100 underlying interactions More data will be generated due to higher occupancy Install Pixel 4th barrel layer and 3rd endcap layer Upgrade of detector:  HCAL introduces uTCA based readout system  Trigger system will be upgraded: the new system will be first operated in parallel to the existing system. Readout will be based in uTCA. DAQ: new optical link to readout uTCA based systems 8Tev LS1.5
  • 5. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 5  Aging of existing hardware (PCs and Networks at least 5 years old)  Accommodate sub-detectors with upgraded off-detector electronics  HCAL upgrade front-end electronics and readout system  New Pixel detector with 4 barrel layers and 3 endcap layers (LS1.5), estimated requirements: ~80 readout channels with 640MB/s  Trigger system will be upgraded: the new system will be first operated in parallel to the existing system. Motivations for upgrade of CMS DAQ
  • 6. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 6 Current CMS Data Acquisition System
  • 7. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 7 Current DAQ system Detector Front-Ends Computing Services Readout Systems Filter Systems Event Manager Level 1 Trigger Control and Monitor Builder Network 40 MHz 100 kHz 100 GB/s ~700 100’s Hz 13000’s cores LV1 HLT µs sec  Read out 700 detector front-ends (fragment size ~2 kB)  Build complete events at 100 kHz ( L1 trigger rate)  Make them available to a filter farm of O(13000) cores  Store 100’s of Hz to disk (10’s of TB/day)  Scalable system employing commercial components wherever possible  Proprietary / Commercial: Front-Ends, VME, PCI, PC servers, networks, Protocols,
  • 8. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 8 Current DAQ system (II) 72 FED builders: assemble FED (~2 kB) into super-fragment (~16 kB) Eight RU builders (EVB DAQ slices): Assemble super-fragments (~16 kB) into full events (~1 MB) at 12.5 kHz each 12.5 kHz +12.5 kHz 100 kHz
  • 9. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 9  Full readout of all sub-detectors at L1 rate  SLINK64 operates on 400 MB/s  Fed builder operates on ~200-350 MB/s, depending on event size distribution  8 DAQ Slices, 100 GB/s event builder  Event Filter  ~1300 dual-CPU (multi-core) PC nodes, 13 k-cores, 220 kHEPSpec06  running about ~20 k HLT processes, CPU budget is ~175 ms/evt @100 kHz.  Storage Manager  Event data to disk  pp: ~200 MB/s, design for 600 MB/s  Heavy Ions: ~1,4 GB/s (up to 2.8 GB/s w/o transfer) Current DAQ system (III)
  • 10. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 10 Upgrade of CMS Data Acquisition System for long shutdown 1
  • 11. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 11 Requirements  L1 rate of 100 KHz  Detector Front End Drivers (FED)  ~552 Legacy FEDs (fragment size increases from 2 kB to 4 kB due to pile-up)  37 (TRG, HCAL, HF) + 40 (Pixel - 2 x 10 GbE links) new readout links from new FEDs (expected maximal fragment size 8kB)  Frontend Readout Links (FRLs)  ~360 FRLs (Legacy FEDs, ~400 MB/s)  ~120 FRLs (new FEDs, ~640 MB/s)  High availability (redundancy, load balancing, failover, etc.) Upgrade of DAQ system DAQ plans for upgrade  Replace myrinet-based fedbuilder  Replace event builder network  two options: conservative (10 GbE) and aggressive (40 GbE or Infiniband)  New architecture between Event Builder and Filter Farm  Replace of the Storage manager hardware
  • 12. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 12  New readout system must be compatible with old data sources (FEDs – SLINK) and with the newly designed FEDs. Read out link Sender Mezzanine SLINK-64 Cable up to 10m, 400 MB/s Frontend Readout Link reads 1 or 2 links Interfaces to Myrinet NIC Build a new PCI card for the existing FRL  Will implement new 10GB link to the RU  Will be able to receive data via a fiber from newly built uTCA FEDs  Pros: Small change wrt existing system. Rely on stable existing hardware  Cons: bottle neck of internal PCI bus (520MB/s per FRL if reads SLINK) link to RU link from uTCA FED
  • 13. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 13 New Data to surface (D2S)  Moving from myrinet 2x 2Gb/s to 10 Gb Ethernet  Point to point protocol used between FRL and RU  Concentrate FRLs sources in USC switch or/and RU
  • 14. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 14 Event builder network: conservative option  Sources (in total 480)  At 100 kHz 2 to 4 KB fragments (legacy FEDs)  At 100 kHz 4 to 8 KB fragments (new FEDs)  Input NIC 2 - 10 GbE ports  Required throughput per link100 kHz, 2 to 4 kB per fragment  PC concentrates by factor 2 for legacy FEDs  360 FRLs  180 RUs  Output NIC – 10 GbE port  Required throughput per link 100 kHz, 4 to 8 kB  EVB configuration  300 RU PCs  300 x 300 switch 10 GbE ~360 FRLs (legacy FEDs, ~400 MB/s) ~120 FRLs (new FEDs, ~640 MB/s) I/O switches 200m fiber 2-ports NIC - 10 GbE RU-PC port NIC - 10 GbE port NIC - 10 GbE BU-PC I/O switches
  • 15. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 15  Sources (in total 480)  At 100 kHz 2 to 4 KB fragments (legacy FEDs)  At 100 kHz 4 to 8 KB fragments (new FEDs)  Front Switch  Concentrate by factor of 2 for legacy FEDs  Input NIC 4 – 10 GbE ports  Required throughput per link 200 kHz, 2 to 4 kB  Required throughput per link 100 kHz, 4 to 8 kB  PC concentrates by factor 4  Output NIC – 40 GbE/ Infiniband port  Required throughput per link 100 kHz, 16 to 32 kB  EVB configuration ~360 FRLs (legacy FEDs, ~400 MB/s) ~120 FRLs (new FEDs, ~640 MB/s) I/O switches 200m fiber 4-ports NIC - 10 GbE RU-PC port NIC - 40 GbE / Infiniband port NIC - 40 GbE / Infiniband BU-PC Event builder network: aggressive option I/O switches
  • 16. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 16 New Builder Units, Filter Unites and Storage Managers configuration  Builder Units/Filter Units  Add 10 GbE switch to connect BUs to Fus  2 links 10 GbE from BU and 1 link 10 GbE for each FU  Storage Managers  1 link 10 GbE from BU-FU switch to SM switch  3 links 10 GbE for each Storage Manager
  • 17. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 17 Feasibility studies on advanced networking technologies for event builder
  • 18. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 18 Our feasibility studies are focused in two network technologies:  Ethernet  10/40 Gigabit Ethernet (different vendors)  iWARP (RDMA) – TCP/IP full offload (Chelsio T4 Unified Wire Adapters)  performance measurements using TCP/IP and DAPL (Direct Access Programming Library- OpenFabrics)  Infiniband  4x quad data rate (QDR)  40 Gb/s - 8B/10B encoding -32 Gb/s data rate  4x fourteen data rate (FDR)  56 Gb/s – 64B/66B encoding – 54.54 Gb/s data rate  performance measurements using DAPL (Direct Access Programming Library- OpenFabrics) and IPoIB (IP over InfiniBand) Feasibility studies on advanced networking technologies
  • 19. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 19 A unified, cross-platform, transport-independent software stack for RDMA and kernel bypass  http://www.openfabrics.org/ The OFED Stack (source: OpenFabrics Alliance)
  • 20. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 20  Developed by DAT collaborative  http://www.datcollaborative.org/  Transport and platform (OS) independent  Define user (uDAPL) and kernel (kDAPL) APIs  DAT supports reliable connection  Data Transfer Operations send, receive, rdma_read, rdma_write  uDAPL Version 2.x, January, 2007 DAT Model (source DAT Collaborative)
  • 21. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 21  The protocol is defined as a very thin set of zero copy functions when compared to thicker protocol implementations such as TCP/IP Comparison of the Stacks (Infiniband vs TCP/IP)
  • 22. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 22 CMS online software for data acquisition system
  • 23. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 23 The CMS online application are based on XDAQ framework. What is the XDAQ framework?  XDAQ is a software platform designed specifically for the development of distributed data acquisition systems.  XDAQ is the definitive across the board middleware that eases the tasks of designing, programming and managing data acquisition applications by providing a simple, consistent and integrated distributed programming environment.  The framework builds upon industrial standards, open protocols and libraries. CMS online software
  • 24. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 24 Architecture Foundation Executive ConfigurationXML HTTP SOAP Control and GUI I2O/ B2IN Fast control and data Application Components Service Plug-ins Devices Custom device access  Uniform building block One or more executives per computer contain application and service components
  • 25. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 25 Architecture Foundation (II) pluggable Peer-Transport:  SOAP/HTML: HTTP  I2O: TCP, Myrinet, FIFO, etc. Application code does not change when moving from Ethernet to Myrinet or Infiniband Peer-Transport a pluggable layer of various networking medium  The XDAQ framework provides peers with a mechanism for determine a route to an endpoint, allowing the peer to send data to the remote endpoint  Peer transports are the entity responsible for conducting the actual exchange of information over a network  Peer transport encapsulate a set of network interfaces, allowing a peer to send and receiver data independently of the type of network being employed
  • 26. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 26  ptuDAPL is a new peer-transport for DAT library (Infiniband and iWarp) using I2O messaging  Use of smart memory pool based on uDAPL memory region allocator (random access to memory with no intermediate management by using cookies)  Profiting for inherent non blocking and queuing of uDAPL API for minimizing latency  All I/O operations centered on dedicated uDAPL memory pool (full zero-copy between XDAQ applications and DAPL driver)  Based on DAT Spec 2.x Pluggable Peer-Transport for DAT library
  • 27. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 27 Test setups
  • 28. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 28 Setup 1 (LHCb) Setup 2 (daqval 2) Setup 3 (daqval 2) Nodes 8 16 32 Type DELL R710 DELL R310 DELL C6220 CPU Xeon E5530 2x 4-core at 2.27 GHz Xeon X3323 2x 4-core at 2.50 GHz Xeon E5-2670 2x 8-core at 2.6 GHz Memory 3 GB 4 GB 32 GB Networ k Ethernet Infiniband Ethernet Ethernet Infiniband Adapter Chelsio T420- CR 10GBASE- SFP RNIC (iWarp) Qlogic HCA, qle7340 4x QDR PCIe Silicom PE210G2SPI9- SR Mellanox - ConnectX-3 VPI (single FDR or 40GbE) - MCX353A-FCBT Switch Voltaire Vantage 6048, 48 ports, 10 GbE Qlogic 12300- BS01, 36 ports, 4x QDR Brocade MLXE16, 16 ports, 10 GbE Mellanox 36 - port 40 GbE Switch - MSX1036B- 1SFR Mellanox - IB switch (36 ports FDR QFSP) - MSX6025F-1 Test setups
  • 29. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 29 Preliminary Measurements
  • 30. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 30  Simple XDAQ application to compute the One-way delay  Time packet to travel from a specific source to a specific destination and back again  One-way latency is measured by timing a round-trip message and dividing the obtained result by two Roundtrip OS OS Stack Network Driver App Processing OS Overhead Network Processing I/O Processing Wire Latency Port to port Latency OS Stack Network Driver Application Application
  • 31. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 31 Latency measurements for ptuDAPL with Infiniband and iWarp 8.87 us for 32 Bytes 6.33 us for 32 Bytes only ~ 1 us added by XDAQ framework Setup 1
  • 32. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 32 Stream I/O  Unidirectional throughput is measured using a unidirectional send of N messages. Time sampling is done at the receiver side and starts with the first incoming message Throughput per node
  • 33. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 33 Stream I/O (TCP/IP vs uDAPL) – Chelsio T4 – 10 GbE 10 GbE 1.25 GB/s 100% 96% 80% 64% 48% 32% 16% drop at 4352 bytes SIZE MB/ S MB/ S 1024 663 272 4096 1208 547 8192 1034 682 Setup 1
  • 34. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 34 Multi-Stream I/O  Unidirectional throughput (bandwidth) is measured using a unidirectional send of N messages to N receivers. Time sampling is done at the receivers side and starts with the first incoming message. Throughput per node
  • 35. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 35 Multi-stream I/O 1 to 4 (TCP/IP vs uDAPL) – Chelsio T4 – 10 GbE 96% 80% 64% 48% 32% 16% 10 GbE 1.25 GB/s 100% Setup 1 SIZE MB/ S MB/ S 1024 930 399 4096 1168 1130 8192 1228 1168
  • 36. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 36 Event Builder  CMS RU-builder  Currently used in CMS DAQ for data taking Event Manager Trigger Detector Frontend Readout Units Builder Units Filter Units 1 2 2 3 Trigger Detector Frontend Readout Units Event Manager Builder Units Filter Units 4 4 5 5 1: I have n free resources 2: Broadcast identifier association 3: Trigger information for events id1, id2, … idn 4: Send me fragments for events id1, id2, … idn 5: Superfragment data 6: Allocate events to Filter Units 7: Discard events 8: Release n identifiers Trigger Detector Frontend Readout Units Event Manager 6 7 8 Builder Units Filter Units RU-builder protocol – Token passing 4 4 4 4 1 4 4 3 3 3 3 3 3 2 2 2 2 2 4 1 1 1 1 123 23 2 1 1 1 Event building traffic
  • 37. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 37 Event builder 1+7x8 vs Multi-stream 1x8 (TCP/IP - 10GbE) 10 GbE 1.25 GB/s 100% 96% 80% 64% 48% 32% 16% Setup 2 SIZE MB/ S MB/ S 1024 635 94 4096 1223 375 8192 1231 743 Targe t 800
  • 38. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 38 Event builder 1x7x8 vs 100 KHz (TCP/IP - 10GbE) 10 GbE 1.25 GB/s 100% 96% 80% 64% 48% 32% 16% Setup 2 SIZE RATE kHz 1024 91.85 4096 91.63 8192 90.71
  • 39. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 39 Event builder 1x7x8 vs current event builder (TCP/IP - 10GbE) 10 GbE 1.25 GB/s 100% 96% 80% 64% 48% 32% 16% Setup 2 SIZE MB/ S MB/ S 1024 94 20 4096 375 83 8192 743 161
  • 40. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 40 Infiniband Measurements
  • 41. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 41 MStream 1x4 uDAPL - IB vs TCP/IP - IPoIB 100% 87.5% 75% 62.5% 50% 37.5% 25% 12.5% Setup 1 SIZE MB/ S MB/ S 1638 4 2160 2041 3276 8 3256 1538
  • 42. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 42 Event builder 1+3x3 uDAPL - IB vs TCP/IP - IPoIB 100% 87.5% 75% 62.5% 50% 37.5% 25% 12.5% Setup 1 Target 1638 3276 SIZE MB/ S MB/ S 1638 4 372 1805 3276 8 400 2387
  • 43. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 43 Summary  New readout system must be compatible with old data sources and new data sources  New DAQ column architecture replaces aging existing hardware and improves readout performance  Feasibility studies on advanced networking technologies for event builder  Different network technologies under investigation  Possibly Quality of Service will give high throughput and scalability
  • 44. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 44 Thank you for you attention!
  • 45. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 45 Backup
  • 46. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 46  Replicated building blocks Scalable cluster system architecture Architecture Foundation (II) Executive Executive Executive Executive
  • 47. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 47 Software Distribution Core framework Reusable applications CMS specific applications coretools powerpack worksuite
  • 48. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 48 Layered View Online Software Coretools Powerpack Worksuite Platforms Event Builders Front-end Controllers Data Monitoring Error and Alarming Job Control External System Interfaces Hardware Access OS Abstraction Communication Subsystems Executive Framework Operating Systems Networking Infrastructures Hardware Device Interfaces ConfigurationManagementSupport User Interfaces Detector Specific Applications
  • 49. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 49 Timeline 2012 2008 2006 2004 2002 2000 First version of XDAQ - I2O communication kernel XDAQ 2 – Web enters DAQ (SOAP) XDAQ 3 – experiment wide adoption XMAS – monitoring, orthogonal to applications Commissioning and first beam event successfully achieved Well consolidated after twelve years of development and use Successefully used in the first tree years of LHC
  • 50. CHEP 2012, 21-25 May 2012, Ney York, USA - Upgrade of the CMS Event Builder - Andrea Petrucci - CERN (PH/CMD) 50 Outlook 0.5 - 2 KB/msg 128 KB - 4 MB/msg Tuples Tables O(15000) msg/s O(100) msg/s O(10) msg/s Visualization  Integrated web technologies  Monitoring  Errors and Alarms  Reusable Event Builder