www.egi.eu
@EGI_eInfra
The work of the EGI Foundation
is partly funded by the European Commission
under H2020 Framework Programme
EGI: Advanced Computing for Research
eResearch Africa 2019
Distributed scientific
computing for open science
Technical Director, EGI Foundation
Tiziana Ferrari
@EGI_eInfrawww.egi.eu 17/04/2019 2
•EGI Federation
•Federation in action: Cloud, AAI and Data Management
•Towards Open Science
•Trends and Outlooks
Outline
@EGI_eInfrawww.egi.eu 17/04/2019 3
About EGI
Advanced Computing for Research
@EGI_eInfrawww.egi.eu 17/04/2019 4
@EGI_eInfrawww.egi.eu 17/04/2019 5
EGI Federation (April 2019)
• ZA-UCT-ICTS University of Cape Town - ICTS HPC site
• ZA-UFS (University of the Free State Computing
Centre)
HPC site of the University of Bloemfontein
• ZA-WITS-CORE (University of the Witwatersrand
CORE)
Core research cluster of the University of the
Witwatersrand
4.4 Billion
CPU core
wall time
(2018)
> 1 Million
computing
cores in 2019
> 740 PB disk
& tape
2,915 service
end-points
@EGI_eInfrawww.egi.eu 17/04/2019 6
A global system of e-Infrastructures worldwide
@EGI_eInfrawww.egi.eu 17/04/2019 7
European Participants
@EGI_eInfrawww.egi.eu 17/04/2019 8
EGI Service Portfolio
@EGI_eInfrawww.egi.eu 17/04/2019 9
EGI Internal Service Portfolio
For the members of the federation
@EGI_eInfrawww.egi.eu 17/04/2019 10
• Leverages national e-Infrastructure investments
• Opens access to part of the nationally funded capacity
• Supports international user groups
• Integrates community, private and/or public infrastructures into a scalable
data/computing platform for research
• Uses federated identities, authentication and authorization
• Ensures interoperability of scientific applications and data across multiple
providers bringing distributed computing to data
Benefits of Federation
@EGI_eInfrawww.egi.eu 17/04/2019 11
Federated Operations
@EGI_eInfrawww.egi.eu 17/04/2019 12
Federation in action
Cloud, AAI and Data Management
@EGI_eInfrawww.egi.eu 17/04/2019 13
• Multi-cloud IaaS infrastructure with Single Sign-On
• Federation features:
 Common VM image catalogue
 Discovery, accounting, monitoring
 Unified GUI dashboard
EGI Federated Cloud
Cloud
Compute
Cloud Container
Compute BETA
Training
Infrastructure
Online Storage
Applications on
Demand BETA
Notebooks BETA
@EGI_eInfrawww.egi.eu 17/04/2019 14
Infrastructure
@EGI_eInfrawww.egi.eu 17/04/2019 15
Architecture and Interfaces
• Tools to deal with
heterogeneity:
 IaaS orchestration tools
with support for multiple
APIs:
o Infrastructure Manager,
Terraform, OCCOPUS, …
o https://wiki.egi.eu/wiki/Fe
derated_Cloud_IaaS_Orch
estration
 IaaS libraries with support
for multiple APIs:
o libcloud, jclouds,…
@EGI_eInfrawww.egi.eu 17/04/2019 16
Architecture (Cloud provider view)
@EGI_eInfrawww.egi.eu 17/04/2019 17
• Identity and Access Management solution
 Single sign-on to services through eduGAIN, social media and other institutional or
community-managed identity providers
 Only one account needed for federated access to multiple heterogeneous (web and
non-web) service providers using different technologies (SAML, OpenID Connect,
OAuth 2.0, X509)
 Identity linking enables access to resources using different login credentials
(institutional/social)
 Assurance information associated to each authenticated identity
 Aggregation and harmonisation of authorisation information (VOs/groups, roles,
assurance) from multiple sources
Federated AAI: Check-in
@EGI_eInfrawww.egi.eu 17/04/2019 18
• Conforms to AARC blueprint architecture
• Registered in eduGAIN as an SP complying
with REFEDS Research & Scholarship and
Sirtfi
• All community SPs can have one statically
configured IdP
• No need to run an IdP Discovery Service
on each SP
• Connected SPs get consistent/harmonised
user identifiers and accompanying
attribute sets from different IdPs/AAs that
can be interpreted in a uniform way for
authorisation purposes
Check-in: Identity Provider and Service
Provider Proxy
@EGI_eInfrawww.egi.eu 17/04/2019 19
Check-in: IdP Discovery
@EGI_eInfrawww.egi.eu 17/04/2019 20
Federated accounting
@EGI_eInfrawww.egi.eu 17/04/2019 21
Federated Monitoring
@EGI_eInfrawww.egi.eu 17/04/2019 22
Application Database 1/3
Cloud Marketplace
@EGI_eInfrawww.egi.eu 17/04/2019 23
Application Database 2/3
Cloud Marketplace
@EGI_eInfrawww.egi.eu 17/04/2019 24
Application Database 3/3
VM distribution and management
@EGI_eInfrawww.egi.eu 17/04/2019 25
• Heterogeneous
backend storage
• Common interfaces
(Web, REST, POSIX,
CDMI)
• Common AAI with
Check-in
• Discovery of Datasets
in the EGI DataHub
Federation of Data Repositories
@EGI_eInfrawww.egi.eu 17/04/2019 26
• Clients uses one
ore more providers
to access data
• Data can be
accessed over
multiple protocols
Transparent Data Access
@EGI_eInfrawww.egi.eu 17/04/2019 27
Data Caching
• Cloud provider A hosts data
& computing resources
• Provider B only hosts data
Provider X can use data from A
and B
• Without pre-staging
• Via pre-staging using APIs
• Local data access “à la”
POSIX with FUSE
@EGI_eInfrawww.egi.eu 17/04/2019 28
Multi-cloud Analytics with Jupyter
@EGI_eInfrawww.egi.eu 17/04/2019 29
Towards Open Science
Sharing of data, applications and research
outputs for scientific reproducibility
@EGI_eInfrawww.egi.eu 17/04/2019 30
Chipster: sharing community applications
Read more...
@EGI_eInfrawww.egi.eu 17/04/2019 31
NBIS: Scalable bioinformatics web-servers
powered by cloud computing
Read more…
@EGI_eInfrawww.egi.eu 17/04/2019 32
ESA Geohazards Thematic Exploitation
Platform Read more…
@EGI_eInfrawww.egi.eu 17/04/2019 33
Cloud Containers on the EGI Federated Cloud
@EGI_eInfrawww.egi.eu 17/04/2019 34
EGI Notebooks
@EGI_eInfrawww.egi.eu 17/04/2019 35
• The computational tools to solve a problem
 Python, R, Julia, and wide ecosystem of libraries and tools for science
• An interface to facilitate coding / creating  Jupyter
• A way to communicate work  Notebooks
• A way to share work  GitHub other similar repositories
• A way to pack it all for replication  Docker
• A way to persistently identify it  DOIs (Digital Object Identifiers)
Reproducible Open Science with
EGI Notebooks, Binder, Zenodo
https://documents.egi.eu/document/3442
@EGI_eInfrawww.egi.eu 17/04/2019 36
Open Science in action
@EGI_eInfrawww.egi.eu 17/04/2019 37
Usage Trends and Outlooks
Towards increasing integration of e-infrastructures
@EGI_eInfrawww.egi.eu 17/04/2019 38
Installed Compute Capacity 2011-2019
@EGI_eInfrawww.egi.eu 17/04/2019 39
CPU wall time consumed 2009-2018
@EGI_eInfrawww.egi.eu 17/04/2019 40
Africa’s contribution to the federation
https://accounting.egi.eu/
@EGI_eInfrawww.egi.eu 17/04/2019 41
Scientific Disciplines/HTC
65.6 Million hours for CTA/LOFAR/SKA
@EGI_eInfrawww.egi.eu 17/04/2019 42
Scientific Disciplines/Cloud
@EGI_eInfrawww.egi.eu 17/04/2019 43
Today’s scenario
• Difficult cross-border access due to different funding models, access and
provisioning policies
 Data and service provisioning to international user communities possible only when supported by
sound business models or existing collaboration agreements. Today only a few structured int.
research groups have achieved this.
• Needs of large investments for the creation, processing, preservation, access
and reuse of research data  will the funding match the anticipated needs of
future data-intensive science?
 Opportunities for economies of scale and aggregation of demand can arise with joint provisioning
of infrastructure common components
• Major separation between data preservation and data exploitation
infrastructures in many disciplines
 Ris and e-Infrastructures should collaborate to support the entire research workflow of an
experiment
@EGI_eInfrawww.egi.eu 17/04/2019 44
Tomorrow’s scenario
The International Data Commons
A federation of research data, computing, applications
and other open science resources, responding to the
problem of scalable access to research data through a
new data provisioning service approach that is
complementary to the traditional data download
model.
@EGI_eInfrawww.egi.eu 17/04/2019 45
The Data Commons Should…
Allow to discover, access and analyze major research
datasets and information for third-party exploitation
Provide access to the data & data products close to
processing facilities while avoiding duplication of
local data storage & compute infrastructures across
research performing organizations in Europe
@EGI_eInfrawww.egi.eu 17/04/2019 46
The Data Commons Should…
• Offer a hybrid distributed compute platform (HTC, HPC,
cloud) and integrated rich portfolio of scientific application
tools supporting self-service provisioning
• Offer tools for scalable data movement across data
preservation infrastructures and distributed interconnected
network of “data hubs”
• Provide integrated capabilities for publishing and sharing
scientific outputs from experiments to support open science
• Support federated authentication and authorization for use
of existing personal credentials and easy to use access
channels
@EGI_eInfrawww.egi.eu 17/04/2019 47
The federated infrastructure and supporting initiative
providing
all researchers, innovators, companies and citizens
with seamless access to an open-by-default, efficient and
cross-disciplinary environment
for storing, accessing, reusing data, tools, publications and
other scientific outputs for research, innovation and
educational purposes
About the European Open Science Cloud
This work by the EGI Foundation
is licensed under a Creative Commons
Attribution 4.0 International License.
Questions?
Thank you
for your attention.
www.egi.eu
@EGI_eInfra
EGI: Advanced Computing for Research
The work of the EGI Foundation
is partly funded by the European Commission
under H2020 Framework Programme

More Related Content

PPTX
The ascent of scientific computing: the EGI role and contribution towards the...
PPTX
Past, present and future of advanced computing for data-driven science
PDF
MOVING presentation at JSI
PDF
Sshoc kick off meeting - 1.2.1 How to Connect to EOSC? - Tiziana Ferrari - EGI
PPTX
Overview of infrastructure and operations plans (Matthew Viljoen)
PPT
WP8 Dissemination and Exploitation
PPTX
The costs of making data FAIR (Marjan Grootveld) - EUDAT Summer School | www....
PPTX
How FAIR are your data? (Sarah Jones) - EUDAT Summer School | www.eudat.eu
The ascent of scientific computing: the EGI role and contribution towards the...
Past, present and future of advanced computing for data-driven science
MOVING presentation at JSI
Sshoc kick off meeting - 1.2.1 How to Connect to EOSC? - Tiziana Ferrari - EGI
Overview of infrastructure and operations plans (Matthew Viljoen)
WP8 Dissemination and Exploitation
The costs of making data FAIR (Marjan Grootveld) - EUDAT Summer School | www....
How FAIR are your data? (Sarah Jones) - EUDAT Summer School | www.eudat.eu

What's hot (20)

PPTX
Tiziana ferrari icri 2018 v3
PPTX
European Open Science Cloud: History and Status
PDF
Data management plans – EUDAT Best practices and case study | www.eudat.eu
PPTX
The European Commission's Open Data ambition (Marjan Grootveld) - EUDAT Summe...
PPT
Conférence Open Data par où commencer ? "Apps 4 ghent" Intervention B.Rosseau...
PPTX
H2020: Opportunities for Libraries by Susan K. Reilly
PDF
Project DIH Ukraine results presentation
PPTX
EDF2014: Christian Lindemann, Wolters Kluwer Germany & Christian Dirschl, Wol...
PDF
Evolution of Data Spaces
PPTX
Training experiences in EGI
PPTX
Integration data models, Learning Layers project meeting in Bremen
PDF
Sitra data strategy
PDF
2014 10 sdi4apps_press-release
PDF
Open data is only the beginning
PPT
IAALD 2010 Closing Session Report: Integrated information systems
PDF
Shared Digital Twins: Collaboration in Ecosystems
PPTX
BDE SC2 Workshop 3: DataBio
PPTX
EUDAT Webinar "Organise, retrieve and aggregate data using annotations with B...
PPTX
3rd DBpedia Community Meeting - ALIGNED
PPT
IAALD 2010 Closing Session Report: New tools and Repositories
Tiziana ferrari icri 2018 v3
European Open Science Cloud: History and Status
Data management plans – EUDAT Best practices and case study | www.eudat.eu
The European Commission's Open Data ambition (Marjan Grootveld) - EUDAT Summe...
Conférence Open Data par où commencer ? "Apps 4 ghent" Intervention B.Rosseau...
H2020: Opportunities for Libraries by Susan K. Reilly
Project DIH Ukraine results presentation
EDF2014: Christian Lindemann, Wolters Kluwer Germany & Christian Dirschl, Wol...
Evolution of Data Spaces
Training experiences in EGI
Integration data models, Learning Layers project meeting in Bremen
Sitra data strategy
2014 10 sdi4apps_press-release
Open data is only the beginning
IAALD 2010 Closing Session Report: Integrated information systems
Shared Digital Twins: Collaboration in Ecosystems
BDE SC2 Workshop 3: DataBio
EUDAT Webinar "Organise, retrieve and aggregate data using annotations with B...
3rd DBpedia Community Meeting - ALIGNED
IAALD 2010 Closing Session Report: New tools and Repositories
Ad

Similar to Distributed scientific computing for open science, eResearch Africa 2019 (20)

PPTX
EGI: a spark to transform science, business and society
PPTX
OSFair2017 Workshop | EGI
PDF
EGI Cloud Container Compute Service
PDF
Cloud Computing Needs for Earth Observation Data Analysis: EGI and EOSC-hub
PPTX
Scientific Computing 2021-2030
PDF
EGI Engage: Impact & Results
PPTX
The EOSC Compute Platform with the EGI-ACE project
PDF
A Vision for a European e-Infrastructure for the 21st Century
PPTX
WEBINAR: "How to manage your data to make them open and fair"
PPTX
Gergely Sipos (EGI): Exploiting scientific data in the international context ...
PPTX
EGI Federated Cloud - May 2019
PPTX
EGI Cloud Services in a Federated Multi-Supply Envirnment
PPTX
European Open Science Cloud: Concept, status and opportunities
PPTX
EOSC-hub in EOSC context
PDF
The Ascent of Open Science and the European Open Science Cloud
PPTX
Building a National Research Data Commons – Transforming Scholarship Through ...
PPTX
Building local, thinking global - EOSC
PPT
EGI - Open Data Platform
PPTX
WeNMR Suite for Structural Biology
PPTX
Conjugating Open Science & Open Education: The Sci-GaIA e-Research Hackfest m...
EGI: a spark to transform science, business and society
OSFair2017 Workshop | EGI
EGI Cloud Container Compute Service
Cloud Computing Needs for Earth Observation Data Analysis: EGI and EOSC-hub
Scientific Computing 2021-2030
EGI Engage: Impact & Results
The EOSC Compute Platform with the EGI-ACE project
A Vision for a European e-Infrastructure for the 21st Century
WEBINAR: "How to manage your data to make them open and fair"
Gergely Sipos (EGI): Exploiting scientific data in the international context ...
EGI Federated Cloud - May 2019
EGI Cloud Services in a Federated Multi-Supply Envirnment
European Open Science Cloud: Concept, status and opportunities
EOSC-hub in EOSC context
The Ascent of Open Science and the European Open Science Cloud
Building a National Research Data Commons – Transforming Scholarship Through ...
Building local, thinking global - EOSC
EGI - Open Data Platform
WeNMR Suite for Structural Biology
Conjugating Open Science & Open Education: The Sci-GaIA e-Research Hackfest m...
Ad

More from EGI Federation (6)

PPTX
Reproducible Open Science with EGI Notebooks, Binder and Zenodo
PPTX
2019 02-12 eosc-hub for eo
PPTX
2019 01-15 pa nosc kickoff
PPTX
EGI and EUDAT support to the PaNOSC project
PDF
EGI Services
PDF
Use cases collection
Reproducible Open Science with EGI Notebooks, Binder and Zenodo
2019 02-12 eosc-hub for eo
2019 01-15 pa nosc kickoff
EGI and EUDAT support to the PaNOSC project
EGI Services
Use cases collection

Recently uploaded (20)

PPTX
Understanding the Circulatory System……..
PPTX
2currentelectricity1-201006102815 (1).pptx
PPT
Animal tissues, epithelial, muscle, connective, nervous tissue
PDF
Chapter 3 - Human Development Poweroint presentation
PPTX
Preformulation.pptx Preformulation studies-Including all parameter
PDF
Cosmology using numerical relativity - what hapenned before big bang?
PPTX
ELISA(Enzyme linked immunosorbent assay)
PPTX
Presentation1 INTRODUCTION TO ENZYMES.pptx
PDF
7.Physics_8_WBS_Electricity.pdfXFGXFDHFHG
PDF
The Future of Telehealth: Engineering New Platforms for Care (www.kiu.ac.ug)
PPT
Mutation in dna of bacteria and repairss
PPTX
HAEMATOLOGICAL DISEASES lack of red blood cells, which carry oxygen throughou...
PPTX
gene cloning powerpoint for general biology 2
PPTX
PMR- PPT.pptx for students and doctors tt
PDF
Integrative Oncology: Merging Conventional and Alternative Approaches (www.k...
PPTX
GREEN FIELDS SCHOOL PPT ON HOLIDAY HOMEWORK
PDF
Communicating Health Policies to Diverse Populations (www.kiu.ac.ug)
PPTX
TORCH INFECTIONS in pregnancy with toxoplasma
PDF
GROUP 2 ORIGINAL PPT. pdf Hhfiwhwifhww0ojuwoadwsfjofjwsofjw
PPTX
gene cloning powerpoint for general biology 2
Understanding the Circulatory System……..
2currentelectricity1-201006102815 (1).pptx
Animal tissues, epithelial, muscle, connective, nervous tissue
Chapter 3 - Human Development Poweroint presentation
Preformulation.pptx Preformulation studies-Including all parameter
Cosmology using numerical relativity - what hapenned before big bang?
ELISA(Enzyme linked immunosorbent assay)
Presentation1 INTRODUCTION TO ENZYMES.pptx
7.Physics_8_WBS_Electricity.pdfXFGXFDHFHG
The Future of Telehealth: Engineering New Platforms for Care (www.kiu.ac.ug)
Mutation in dna of bacteria and repairss
HAEMATOLOGICAL DISEASES lack of red blood cells, which carry oxygen throughou...
gene cloning powerpoint for general biology 2
PMR- PPT.pptx for students and doctors tt
Integrative Oncology: Merging Conventional and Alternative Approaches (www.k...
GREEN FIELDS SCHOOL PPT ON HOLIDAY HOMEWORK
Communicating Health Policies to Diverse Populations (www.kiu.ac.ug)
TORCH INFECTIONS in pregnancy with toxoplasma
GROUP 2 ORIGINAL PPT. pdf Hhfiwhwifhww0ojuwoadwsfjofjwsofjw
gene cloning powerpoint for general biology 2

Distributed scientific computing for open science, eResearch Africa 2019