SlideShare a Scribd company logo
Enabling Grids for E-sciencE




                           Distributed Data and gLite

                           Steven Newhouse
                           Technical Director
                           CERN




www.eu-egee.org


EGEE-III INFSO-RI-222667                               EGEE and gLite are registered trademarks
The Data Deluge
                           Enabling Grids for E-sciencE



    •   Astronomy
    •   Genomics
    •   Earth Observation
    •   Digitisation




                                                                   Crab Nebula




                                                          X-ray               Optical
EGEE-III INFSO-RI-222667                                             Data Day - Grid School 2009   2
... And the LHC
                           Enabling Grids for E-sciencE




                                                   X      X

EGEE-III INFSO-RI-222667                                      Data Day - Grid School 2009   3
High throughput data analysis
                           Enabling Grids for E-sciencE



    • Analysing the data
          – Large ensemble calculations (100’s10,000’s jobs)
          – Complex workflows – dependent on previous steps
    • High Throughput
          – Exploit distributed computing and storage resources
                 Data replicated (multiple locations)
                 Resources selected through a broker
                     • WMS: Workload Management System
                     • Higher level tools: GANGA, DIANE, ...
          – Information system records the available resources
          – File catalogue records the location of replicated files
    • Data stored in files
          – Growing interest in relational data access
          – Stored on tape (long-term) or disk (immediate access)
EGEE-III INFSO-RI-222667                                       Data Day - Grid School 2009   4
Project Overview
                           Enabling Grids for E-sciencE


17000 users
136000 LCPUs (cores)
25Pb disk
39Pb tape

12 million jobs/month
    +45% in a year
268 sites
    +5% in a year
48 countries
    +10% in a year
162 VOs
    +29% in a year



EGEE-III INFSO-RI-222667                                     Data Day - Grid School 2009   5
So what does EGEE actually do?
                           Enabling Grids for E-sciencE



    • Builds and supports user communities on the grid

                                                          Application             User
                           Training
                                                           Porting               Support


    • Integrates and provides a worldwide infrastructure

      Software                       Integration,
                                        Test &                      Deployment                   Operations
     Development                     Certification

    • Collaboration and Technical Leadership worldwide

                     Collaborating
                                                             Standards                      Policy
                       Projects


EGEE-III INFSO-RI-222667             Technical Status - Steven Newhouse - EGEE-III First Review 24-25 June 2009   6
Supporting Science
                           Enabling Grids for E-sciencE


•    Archeology
          End-user activity              Resource Utilisation
•    Astronomy
••    13,000 end-users in 112 VOs Computational
     Astrophysics                        Chemistry
•      • Protection
     Civil +44% users in a year         Life Sciences

•    Comp. Chemistry                  Multidisciplinary
•    Earth Sciences                      Astronomy &

•    Finance                             Astrophysics

                                        Earth Science
•    Fusion
•    Geophysics                                 Fusion


•    High Energy Physics                 Other Areas

•    Life Sciences                                      0    1   2     3      4     5      6     7

•    Multimedia                  March 2008 to February 2009 (%) March 2007 to February 2008 (%)

•    Material Sciences                Proportion of HEP usage ~77%




EGEE-III INFSO-RI-222667                                              Data Day - Grid School 2009    7
Connecting Users to Resources
                           Enabling Grids for E-sciencE




                                                          Applications


                                                          Middleware

                                                Physical Resources




               Computers                                     Disks              Tape
EGEE-III INFSO-RI-222667                                                 Data Day - Grid School 2009   8
gLite Middleware
                                    Enabling Grids for E-sciencE


          EGEE Maintained Components Access
                                  User                                           External Components
                                                                                 User Interface
                                                                                 User Interface

                                            General Services                                      Virtual
                                   Workload       Logging &                                    Organisation
          BDII                    Management    Book keeping                       Hydra       Membership
                                    Service         Service                                      Service
           Information Services




                                  File Transfer                    LHC File                    Proxy Server
                                                                                  AMGA
                                     Service                       Catalogue
                                                                                                  Security
                                    Compute Element                            Storage            Services
                                                                               Element
                                                                                                    SCAS
                                  CREAM                LCG-CE                  Disk Pool
                                                                                              Authz. Service
                                                                               Manager
          MON                                 BLAH                                                 LCAS &
                                                                               dCache             LCMAPS
                                  gLExec          Worker Node


                                                         Physical Resources
EGEE-III INFSO-RI-222667                                                                   Data Day - Grid School 2009   9
Enabling Grids for E-sciencE



    • Contact:
          – steven.newhouse@cern.ch




EGEE-III INFSO-RI-222667                                  Data Day - Grid School 2009   10

More Related Content

PPTX
General Introduction to technologies that will be seen in the school
PPTX
Session 33 - Production Grids
PDF
OGF Standards Overview - ITU-T JCA Cloud
PPT
[.ppt]
PDF
OGF Introductory Overview - FAS* 2014
PPT
Gridforum Juergen Knobloch Grids For Science 20080402
PPT
Cyberinfrastructure and its Role in Science
PDF
OGF standards for cloud computing
General Introduction to technologies that will be seen in the school
Session 33 - Production Grids
OGF Standards Overview - ITU-T JCA Cloud
[.ppt]
OGF Introductory Overview - FAS* 2014
Gridforum Juergen Knobloch Grids For Science 20080402
Cyberinfrastructure and its Role in Science
OGF standards for cloud computing

What's hot (20)

PDF
Ben Evans SPEDDEXES 2014
PDF
OGF Introductory Overview - OGF 44 at EGI Conference 2015
PDF
Pathways for EOSC-hub and MaX collaboration
PPT
Calit2-a Persistent UCSD/UCI Framework for Collaboration
PDF
Cloud Testbeds for Standards Development and Innovation
PPTX
Using a Widely Distributed Federated Cloud System to Support Multiple Dispara...
PDF
OCCI - The Open Cloud Computing Interface – flexible, portable, interoperable...
PDF
GlobusWorld 2021: Arecibo Observatory Data Movement
PDF
Tutorial on Hybrid Data Infrastructures: D4Science as a case study
PDF
Using e-Infrastructures for Biodiversity Conservation
PDF
Big Data, Beyond the Data Center
PPT
111018 geo sif_aq_interop
PPTX
Cloud for Research and Innovation - UK USA HPC workshop, Oxford, July 205
PDF
Mateo Valero - Big data: de la investigación científica a la gestión empresarial
PDF
Using a Widely Distributed Federated Cloud System to Support Multiple Dispara...
PDF
SCAPE - Building Digital Preservation Infrastructure
PDF
NSF CAC Cloud Interoperability Testbed Projects
PPTX
Enabling efficient movement of data into & out of a high-performance analysis...
PDF
Ppt5 exp lonodn - kevin cope & alex yakimov ( imperial college ) data cent...
PDF
Bridging Environmental Data Providers and SeaDataNet DIVA Service within a Co...
Ben Evans SPEDDEXES 2014
OGF Introductory Overview - OGF 44 at EGI Conference 2015
Pathways for EOSC-hub and MaX collaboration
Calit2-a Persistent UCSD/UCI Framework for Collaboration
Cloud Testbeds for Standards Development and Innovation
Using a Widely Distributed Federated Cloud System to Support Multiple Dispara...
OCCI - The Open Cloud Computing Interface – flexible, portable, interoperable...
GlobusWorld 2021: Arecibo Observatory Data Movement
Tutorial on Hybrid Data Infrastructures: D4Science as a case study
Using e-Infrastructures for Biodiversity Conservation
Big Data, Beyond the Data Center
111018 geo sif_aq_interop
Cloud for Research and Innovation - UK USA HPC workshop, Oxford, July 205
Mateo Valero - Big data: de la investigación científica a la gestión empresarial
Using a Widely Distributed Federated Cloud System to Support Multiple Dispara...
SCAPE - Building Digital Preservation Infrastructure
NSF CAC Cloud Interoperability Testbed Projects
Enabling efficient movement of data into & out of a high-performance analysis...
Ppt5 exp lonodn - kevin cope & alex yakimov ( imperial college ) data cent...
Bridging Environmental Data Providers and SeaDataNet DIVA Service within a Co...
Ad

Viewers also liked (7)

DOC
Application Form
PDF
Session10part1 Server Intro
PDF
Session5 T Infr Access Emidio
PDF
Session 40 : SAGA Overview and Introduction
PDF
Issgc Welcome
PDF
Session10part2 Servers Detailed
PPTX
Session 50 - High Performance Computing Ecosystem in Europe
Application Form
Session10part1 Server Intro
Session5 T Infr Access Emidio
Session 40 : SAGA Overview and Introduction
Issgc Welcome
Session10part2 Servers Detailed
Session 50 - High Performance Computing Ecosystem in Europe
Ad

Similar to Session 23 - Intro to EGEE-III (20)

PDF
EGEE 3 Project
PDF
Grid07 2 Kranzlmuller
PPT
A View on eScience
PDF
Session 23 - gLite Overview
PPTX
Deroure Repo3
PPTX
Deroure Repo3
PDF
Grid07 7 Gagliardi
PPT
Prtesentation17 12 09 2
PDF
Computing - Delivering Innovative Research
PPTX
g-Social - Enhancing e-Science Tools with Social Networking Functionality
PPT
Ticer summer school_24_aug06
PDF
"Parallel and Distributed Computing: BOINC Grid Implementation" por Rodrigo N...
PDF
Parallel and Distributed Computing: BOINC Grid Implementation Paper
PPTX
PDF
Datos enlazados BNE and MARiMbA
PDF
Understanding the Big Picture of e-Science
PPT
If we build it will they come? BOSC2012 Keynote Goble
PPT
UK e-Infrastructure: Widening Access, Increasing Participation
PPT
How we understand research practices: The example of the semantic spider
PPTX
Open data and Collaborative Governance (the UW lecture)
EGEE 3 Project
Grid07 2 Kranzlmuller
A View on eScience
Session 23 - gLite Overview
Deroure Repo3
Deroure Repo3
Grid07 7 Gagliardi
Prtesentation17 12 09 2
Computing - Delivering Innovative Research
g-Social - Enhancing e-Science Tools with Social Networking Functionality
Ticer summer school_24_aug06
"Parallel and Distributed Computing: BOINC Grid Implementation" por Rodrigo N...
Parallel and Distributed Computing: BOINC Grid Implementation Paper
Datos enlazados BNE and MARiMbA
Understanding the Big Picture of e-Science
If we build it will they come? BOSC2012 Keynote Goble
UK e-Infrastructure: Widening Access, Increasing Participation
How we understand research practices: The example of the semantic spider
Open data and Collaborative Governance (the UW lecture)

More from ISSGC Summer School (20)

PDF
Session 58 - Cloud computing, virtualisation and the future
PDF
Session 58 :: Cloud computing, virtualisation and the future Speaker: Ake Edlund
PPT
Integrating Practical2009
PPT
Session 49 Practical Semantic Sticky Note
PDF
PPT
Session 48 - Principles of Semantic metadata management
PPT
Session 49 - Semantic metadata management practical
PPT
Session 46 - Principles of workflow management and execution
PPT
Session 42 - GridSAM
PPT
Session 37 - Intro to Workflows, API's and semantics
PPT
Session 43 :: Accessing data using a common interface: OGSA-DAI as an example
PPT
Session 36 - Engage Results
PDF
Social Program
PPT
Session29 Arc
PDF
Session 24 - Distribute Data and Metadata Management with gLite
PPT
Session 3-Distributed System Principals
PPT
Session18 Madduri
PDF
Session6 Security Emidio
PDF
Session9part1
PDF
Session19 Globus
Session 58 - Cloud computing, virtualisation and the future
Session 58 :: Cloud computing, virtualisation and the future Speaker: Ake Edlund
Integrating Practical2009
Session 49 Practical Semantic Sticky Note
Session 48 - Principles of Semantic metadata management
Session 49 - Semantic metadata management practical
Session 46 - Principles of workflow management and execution
Session 42 - GridSAM
Session 37 - Intro to Workflows, API's and semantics
Session 43 :: Accessing data using a common interface: OGSA-DAI as an example
Session 36 - Engage Results
Social Program
Session29 Arc
Session 24 - Distribute Data and Metadata Management with gLite
Session 3-Distributed System Principals
Session18 Madduri
Session6 Security Emidio
Session9part1
Session19 Globus

Recently uploaded (20)

PPTX
GDM (1) (1).pptx small presentation for students
PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
PPTX
Cell Structure & Organelles in detailed.
PPTX
PPH.pptx obstetrics and gynecology in nursing
PPTX
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
PDF
Basic Mud Logging Guide for educational purpose
PPTX
Open Quiz Monsoon Mind Game Final Set.pptx
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PDF
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PDF
Pre independence Education in Inndia.pdf
PPTX
Open Quiz Monsoon Mind Game Prelims.pptx
PPTX
Introduction to Child Health Nursing – Unit I | Child Health Nursing I | B.Sc...
PDF
Anesthesia in Laparoscopic Surgery in India
PDF
BÀI TẬP TEST BỔ TRỢ THEO TỪNG CHỦ ĐỀ CỦA TỪNG UNIT KÈM BÀI TẬP NGHE - TIẾNG A...
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PPTX
Week 4 Term 3 Study Techniques revisited.pptx
PPTX
Pharma ospi slides which help in ospi learning
GDM (1) (1).pptx small presentation for students
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
Cell Structure & Organelles in detailed.
PPH.pptx obstetrics and gynecology in nursing
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
Basic Mud Logging Guide for educational purpose
Open Quiz Monsoon Mind Game Final Set.pptx
Abdominal Access Techniques with Prof. Dr. R K Mishra
2.FourierTransform-ShortQuestionswithAnswers.pdf
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
Pharmacology of Heart Failure /Pharmacotherapy of CHF
Pre independence Education in Inndia.pdf
Open Quiz Monsoon Mind Game Prelims.pptx
Introduction to Child Health Nursing – Unit I | Child Health Nursing I | B.Sc...
Anesthesia in Laparoscopic Surgery in India
BÀI TẬP TEST BỔ TRỢ THEO TỪNG CHỦ ĐỀ CỦA TỪNG UNIT KÈM BÀI TẬP NGHE - TIẾNG A...
STATICS OF THE RIGID BODIES Hibbelers.pdf
Week 4 Term 3 Study Techniques revisited.pptx
Pharma ospi slides which help in ospi learning

Session 23 - Intro to EGEE-III

  • 1. Enabling Grids for E-sciencE Distributed Data and gLite Steven Newhouse Technical Director CERN www.eu-egee.org EGEE-III INFSO-RI-222667 EGEE and gLite are registered trademarks
  • 2. The Data Deluge Enabling Grids for E-sciencE • Astronomy • Genomics • Earth Observation • Digitisation Crab Nebula X-ray Optical EGEE-III INFSO-RI-222667 Data Day - Grid School 2009 2
  • 3. ... And the LHC Enabling Grids for E-sciencE X X EGEE-III INFSO-RI-222667 Data Day - Grid School 2009 3
  • 4. High throughput data analysis Enabling Grids for E-sciencE • Analysing the data – Large ensemble calculations (100’s10,000’s jobs) – Complex workflows – dependent on previous steps • High Throughput – Exploit distributed computing and storage resources  Data replicated (multiple locations)  Resources selected through a broker • WMS: Workload Management System • Higher level tools: GANGA, DIANE, ... – Information system records the available resources – File catalogue records the location of replicated files • Data stored in files – Growing interest in relational data access – Stored on tape (long-term) or disk (immediate access) EGEE-III INFSO-RI-222667 Data Day - Grid School 2009 4
  • 5. Project Overview Enabling Grids for E-sciencE 17000 users 136000 LCPUs (cores) 25Pb disk 39Pb tape 12 million jobs/month +45% in a year 268 sites +5% in a year 48 countries +10% in a year 162 VOs +29% in a year EGEE-III INFSO-RI-222667 Data Day - Grid School 2009 5
  • 6. So what does EGEE actually do? Enabling Grids for E-sciencE • Builds and supports user communities on the grid Application User Training Porting Support • Integrates and provides a worldwide infrastructure Software Integration, Test & Deployment Operations Development Certification • Collaboration and Technical Leadership worldwide Collaborating Standards Policy Projects EGEE-III INFSO-RI-222667 Technical Status - Steven Newhouse - EGEE-III First Review 24-25 June 2009 6
  • 7. Supporting Science Enabling Grids for E-sciencE • Archeology End-user activity Resource Utilisation • Astronomy •• 13,000 end-users in 112 VOs Computational Astrophysics Chemistry • • Protection Civil +44% users in a year Life Sciences • Comp. Chemistry Multidisciplinary • Earth Sciences Astronomy & • Finance Astrophysics Earth Science • Fusion • Geophysics Fusion • High Energy Physics Other Areas • Life Sciences 0 1 2 3 4 5 6 7 • Multimedia March 2008 to February 2009 (%) March 2007 to February 2008 (%) • Material Sciences Proportion of HEP usage ~77% EGEE-III INFSO-RI-222667 Data Day - Grid School 2009 7
  • 8. Connecting Users to Resources Enabling Grids for E-sciencE Applications Middleware Physical Resources Computers Disks Tape EGEE-III INFSO-RI-222667 Data Day - Grid School 2009 8
  • 9. gLite Middleware Enabling Grids for E-sciencE EGEE Maintained Components Access User External Components User Interface User Interface General Services Virtual Workload Logging & Organisation BDII Management Book keeping Hydra Membership Service Service Service Information Services File Transfer LHC File Proxy Server AMGA Service Catalogue Security Compute Element Storage Services Element SCAS CREAM LCG-CE Disk Pool Authz. Service Manager MON BLAH LCAS & dCache LCMAPS gLExec Worker Node Physical Resources EGEE-III INFSO-RI-222667 Data Day - Grid School 2009 9
  • 10. Enabling Grids for E-sciencE • Contact: – steven.newhouse@cern.ch EGEE-III INFSO-RI-222667 Data Day - Grid School 2009 10