SlideShare a Scribd company logo
What is CERNBox ?
2
IT Technical Forum - 12.06.2015
Luca Mascetti & Hugo Gonzalez Labrador
Data & Storage Services (IT-DSS)
Outline
• Description and Architecture
• User Community and Service Numbers
• Technical Aspects & Integration
• Success Stories & Future Use-cases
• Overview and Summary
3
Description
and Architecture
4
What is CERNBox ?
CERNBox provides a cloud synchronisation service
• Available for all CERN users
• Synchronise files ( data at CERN )
• Offline data access
• Easy way to sharing with other users
• All major platforms supported
• Based on ownCloud
• Use EOS as storage backend
• EOS is the disk storage for physics data
• 70 PB installed of usable capacity
• 50% in Wigner
5
What is CERNBox
6powered by
7
HTTPS LB
OC Shares
HTTPS LB
HTTPS LB
Sync Client
Web Access
FUSE, xroot,
gridftp, http, S3
Direct Data
Access
https, webdav
CERNBox Architecture
Synchronisation from other EOS instances
with Sync Client is also possible
EOSUSER
Available Access Methods
8
Web Access
Sync Client
Mobile AppWebDAV
Directly from the
storage backend
EOSUSER
(xroot, http, s3, …)
User Community
and Service Numbers
9
Physicists
CERNBox User Community
10
User community very active
• Very positive feedback
• Several useful suggestions
• Important contributions
• Users happy to help testing
new features
Service numbers
• ~2350 users
• ~3000 shares
• ~20 million files stored
20% 60% 20%
Engineers
Services &
Administration
1E+10%
1E+11%
1E+12%
1E+13%
1E+14%
1E+15%
1E+16%
M
ar.14%Apr.14%M
ay.14%Jun.14%
Jul.14%Aug.14%Sep.14%Oct.14%Nov.14%Dec.14%
Jan.15%Feb.15%M
ar.15%Apr.15%M
ay.15%
Deployed%Space% Used%Space%
CERNBox Service Numbers
11
1 PB
10 PB
100 TB
10 TB
1 TB
100 GB
10 GB
Users 2350
# files 20 Million
# dirs 1.8 Million
Quota 1TB/user
Used Space 55 TB
Deployed
Space
1.3 PB
Migration from NFS
to EOS
EOS offers “virtually unlimited”
cloud-storage for our end-users
The total EOS installation at
CERN is around 70 PB usable
with the primary role of storing
physics data
0"
500"
1000"
1500"
2000"
2500"
M
ar)14"Apr)14"M
ay)14"Jun)14"
Jul)14"Aug)14"Sep)14"Oct)14"Nov)14"Dec)14"
Jan)15"Feb)15"M
ar)15"Apr)15"M
ay)15"
Users%
0"
5"
10"
15"
20"
25"
30"
35"
4:00"
6:00"
8:00"
10:00"
12:00"
14:00"
16:00"
18:00"
20:00"
22:00"
0:00"
2:00"
4:00"
Hz#
Daily#User#Access#Pa0ern#
0"
10"
20"
30"
40"
50"
1"
Hz#
CERNBox#Weekly#User#Access#Pa7ern#
Current System Usage
12
Sun SunMon Tue Wed Thu Fri Sat
Dinner
Lunch Break
Late night
work?
0"Hz"
400"Hz"
800"Hz"
1200"Hz"
7:42:40&
7:42:45&
7:42:50&
7:42:55&
7:43:00&
7:43:05&
Peak Requests
at 1.1kHz
and the
system can
sustain much more
Technical aspects
and integration
13
Technical aspects
• Innovative integration of user environments & huge
data repositories
• Integration of CERNBox with
• CERN SSO (in testing)
• E-Groups (in testing)
• Root Viewer (in testing)
• Architecture: OwnCloud vs. CERNBox
• Cool features: Trash & Versions
• Testing CERNBox: SmashBox
14
Integration with
15
CERN SSO
16
17
Nested e-groups
Embedded ROOT Viewer
18
The viewer is based on the ROOT data
analysis framework developed at CERN
by PH-SFT.
Integration done by CERNBox team.
19
DEMO
Vanilla OwnCloud
Web Application Server Database
Storage
NAMESPACE
ACLs
DATA
Metadata ops
Data ops
backgroundscanand
population
- ACL consistency
OwnCloud Namespace
21
CERNBox
Web Application Server
ACLs DATANAMESPACE
- ACLs consistency
Metadata ops
Data ops
Primary Object Store
CERNBox
metadata plugin
Real owner (no apache/www-data)
23
Cool features
24
Versions feature
25
Trash bin feature
26
• Extensive Test Framework
• Developed by CERNBox team
• Validate integration + operational state
• Avoid regression
• Successful outside CERN
• External contributions
• Other sites use it (e.g.SWITCH)
• Part of QA cycle of OwnCloud
SmashBox
27
https://github.com/cernbox
Success Stories
28
E-Science
29
Thanks to Mauro Arcorace, members of UNITAR/UNOSAT and CIMA foundation
for the material provided
Problem:
- how to get data at CERN easily?
- how to easily use CERN resources?
(e.g. non-physicists)
CERNBox is an easy way to integrate
our storage resources for non-expert
end-user that may use different OSs
Using EOS as backend allow to
access the data from batch nodes or
from other location via https or xroot
…and it’s very simple to share results
with collaborators…
Run2 Event: Photos Sharing
30
ALICE
ATLAS
CMS
CCC
LHCb
DG-CO
CERNBox was used to synchronise
photos between photographers and
the communication team
After the selection the
photos were uploaded
in CDS
Videos: Pre-Release to Press Office
31
Big video files uploaded on EOS with xrdcp
CERNBox used by the Press Office to share
immediately videos to the media for download
After encoding published on CDS and archived
xrdcp
CERNBox was tweaked to support large files (~30GB)
and redirect downloads directly to our storage nodes
on EOS (with replication 3) to sustain peak requests
Future use-cases:
lxplus and lxbatch integration
32
WORK IN PROGRESS
33
batch farm
lazy output synchronisation
input synchronisation
JOB
Processing
“Kerberised”
FUSE mount
Choose What to Sync
DEMO
Overview and Summary
34
• Direct access to EOSUSER (and not only…)
• not only Sync Client & Web
• xroot, fuse, http/WebDAV
• e-group and SSO integration
• Access to Physics Data
• synchronise experiment’s data
• ROOT files viewer
• Shared kerberised fuse access from
lxplus & lxbatch
Features Overview
35
Summary
• New service
• Fast growing
• Very good feedback
• Full integration with petabyte storage
• Integration with existing workflows
• Bring data closer to our users
• New way to interact with your data
• We believe CERNBox is an innovative platform for scientific
computing
36
What is CERNBox ?
Not yet a user? Try out!
login with your NICE account
https://cernbox.cern.ch
38
Download the Client or the App
https://cern.ch/cernbox-resources

More Related Content

PDF
Testing data and metadata backends with ClawIO
PDF
CERNBox: Site Report
PDF
Resource planning on the (Amazon) cloud
PDF
Dev opsmeetup sept2013-leaseweb
PDF
Cern Cloud Architecture - February, 2016
PDF
Deep Dive Into the CERN Cloud Infrastructure - November, 2013
PPTX
Learning to Scale OpenStack
PDF
10 Years of OpenStack at CERN - From 0 to 300k cores
Testing data and metadata backends with ClawIO
CERNBox: Site Report
Resource planning on the (Amazon) cloud
Dev opsmeetup sept2013-leaseweb
Cern Cloud Architecture - February, 2016
Deep Dive Into the CERN Cloud Infrastructure - November, 2013
Learning to Scale OpenStack
10 Years of OpenStack at CERN - From 0 to 300k cores

What's hot (20)

PDF
Tips Tricks and Tactics with Cells and Scaling OpenStack - May, 2015
PPTX
Moving to Nova Cells without Destroying the World
PPTX
Open stack neutron and opendaylight
PDF
OpenNebula Conf 2014 | Lightning talk: OpenNebula Puppet Module - Norman Mess...
PPTX
Scaling Apache Storm (Hadoop Summit 2015)
PDF
Apache Storm
PPTX
Apache Storm Internals
ODP
Hpc to OpenStack: Our journey
PPTX
The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...
PDF
Interactive Data Analysis for End Users on HN Science Cloud
PDF
2018년 3월 정기 세미나 - March 2018 Ops Meetup 후기
PDF
Блохин Леонид - "Mist, как часть Hydrosphere"
PPTX
R in Minecraft
PPTX
Resource Aware Scheduling in Apache Storm
PDF
Pushing Python: Building a High Throughput, Low Latency System
PDF
Openstack Infrastructure Containerization
PDF
Storm
PPTX
Disaggregating Ceph using NVMeoF
PPT
HeroLympics Eng V03 Henk Vd Valk
PDF
TryStack: A Sandbox for OpenStack Users and Admins
Tips Tricks and Tactics with Cells and Scaling OpenStack - May, 2015
Moving to Nova Cells without Destroying the World
Open stack neutron and opendaylight
OpenNebula Conf 2014 | Lightning talk: OpenNebula Puppet Module - Norman Mess...
Scaling Apache Storm (Hadoop Summit 2015)
Apache Storm
Apache Storm Internals
Hpc to OpenStack: Our journey
The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...
Interactive Data Analysis for End Users on HN Science Cloud
2018년 3월 정기 세미나 - March 2018 Ops Meetup 후기
Блохин Леонид - "Mist, как часть Hydrosphere"
R in Minecraft
Resource Aware Scheduling in Apache Storm
Pushing Python: Building a High Throughput, Low Latency System
Openstack Infrastructure Containerization
Storm
Disaggregating Ceph using NVMeoF
HeroLympics Eng V03 Henk Vd Valk
TryStack: A Sandbox for OpenStack Users and Admins
Ad

Similar to What is CERNBox ? (20)

PPTX
Using Containers and HPC to Solve the Mysteries of the Universe by Deborah Bard
PDF
Atom: A cloud native deep learning platform at Supremind
PPTX
Desktop as a Service supporting Environmental ‘omics
PDF
Structural Biology in the Clouds: A Success Story of 10 years
PPTX
CERN Mass and Agility talk at OSCON 2014
PDF
Experience of Running Spark on Kubernetes on OpenStack for High Energy Physic...
PDF
Environment for training models
PDF
CERN’s Next Generation Data Analysis Platform with Apache Spark with Enric Te...
PDF
Using the Open Science Data Cloud for Data Science Research
PPTX
2015 04 bio it world
PPTX
Why OpenStack on UCS? An Introduction to Red Hat and Cisco OpenStack Solution
PDF
Modern apps with dcos
PDF
Sanger OpenStack presentation March 2017
PDF
OpenNebulaConf2017EU: Hyper converged infrastructure with OpenNebula and Ceph...
PDF
Ultra Fast Deep Learning in Hybrid Cloud Using Intel Analytics Zoo & Alluxio
PPTX
Sanger, upcoming Openstack for Bio-informaticians
PPTX
Flexible compute
PDF
Beyond static configuration
PDF
Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...
PDF
OpenStack for Rookies @ IPN
Using Containers and HPC to Solve the Mysteries of the Universe by Deborah Bard
Atom: A cloud native deep learning platform at Supremind
Desktop as a Service supporting Environmental ‘omics
Structural Biology in the Clouds: A Success Story of 10 years
CERN Mass and Agility talk at OSCON 2014
Experience of Running Spark on Kubernetes on OpenStack for High Energy Physic...
Environment for training models
CERN’s Next Generation Data Analysis Platform with Apache Spark with Enric Te...
Using the Open Science Data Cloud for Data Science Research
2015 04 bio it world
Why OpenStack on UCS? An Introduction to Red Hat and Cisco OpenStack Solution
Modern apps with dcos
Sanger OpenStack presentation March 2017
OpenNebulaConf2017EU: Hyper converged infrastructure with OpenNebula and Ceph...
Ultra Fast Deep Learning in Hybrid Cloud Using Intel Analytics Zoo & Alluxio
Sanger, upcoming Openstack for Bio-informaticians
Flexible compute
Beyond static configuration
Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...
OpenStack for Rookies @ IPN
Ad

Recently uploaded (20)

PPTX
Introuction about WHO-FIC in ICD-10.pptx
PDF
The Internet -By the Numbers, Sri Lanka Edition
PDF
Vigrab.top – Online Tool for Downloading and Converting Social Media Videos a...
PPTX
introduction about ICD -10 & ICD-11 ppt.pptx
PDF
Cloud-Scale Log Monitoring _ Datadog.pdf
PDF
Unit-1 introduction to cyber security discuss about how to secure a system
PPTX
522797556-Unit-2-Temperature-measurement-1-1.pptx
PDF
Triggering QUIC, presented by Geoff Huston at IETF 123
PPTX
Introuction about ICD -10 and ICD-11 PPT.pptx
PDF
SASE Traffic Flow - ZTNA Connector-1.pdf
PPTX
QR Codes Qr codecodecodecodecocodedecodecode
PDF
Sims 4 Historia para lo sims 4 para jugar
PDF
WebRTC in SignalWire - troubleshooting media negotiation
PPTX
Introduction to Information and Communication Technology
PPTX
PptxGenJS_Demo_Chart_20250317130215833.pptx
PDF
An introduction to the IFRS (ISSB) Stndards.pdf
PDF
Testing WebRTC applications at scale.pdf
DOCX
Unit-3 cyber security network security of internet system
PPTX
CHE NAA, , b,mn,mblblblbljb jb jlb ,j , ,C PPT.pptx
PDF
Tenda Login Guide: Access Your Router in 5 Easy Steps
Introuction about WHO-FIC in ICD-10.pptx
The Internet -By the Numbers, Sri Lanka Edition
Vigrab.top – Online Tool for Downloading and Converting Social Media Videos a...
introduction about ICD -10 & ICD-11 ppt.pptx
Cloud-Scale Log Monitoring _ Datadog.pdf
Unit-1 introduction to cyber security discuss about how to secure a system
522797556-Unit-2-Temperature-measurement-1-1.pptx
Triggering QUIC, presented by Geoff Huston at IETF 123
Introuction about ICD -10 and ICD-11 PPT.pptx
SASE Traffic Flow - ZTNA Connector-1.pdf
QR Codes Qr codecodecodecodecocodedecodecode
Sims 4 Historia para lo sims 4 para jugar
WebRTC in SignalWire - troubleshooting media negotiation
Introduction to Information and Communication Technology
PptxGenJS_Demo_Chart_20250317130215833.pptx
An introduction to the IFRS (ISSB) Stndards.pdf
Testing WebRTC applications at scale.pdf
Unit-3 cyber security network security of internet system
CHE NAA, , b,mn,mblblblbljb jb jlb ,j , ,C PPT.pptx
Tenda Login Guide: Access Your Router in 5 Easy Steps

What is CERNBox ?