SlideShare a Scribd company logo
Grid	
  Computing	
  Overview
Federating	
  Compute	
  and	
  Storage	
  Resources	
  to	
  
  Accelerate	
  Research	
  and	
  Aid	
  Collaboration




              Ian	
  Stokes-­‐Rees,	
  PhD
             Harvard,	
  Cambridge,	
  USA

      http://ac.seas.harvard.edu
       ijstokes@seas.harvard.edu
Slides	
  and	
  Contact
         ijstokes@seas.harvard.edu

         http://linkedin.com/in/ijstokes
         http://slidesha.re/ijstokes-hbs2012_pt1
         http://slidesha.re/ijstokes-hbs2012_pt2


        http://ac.seas.harvard.edu
        http://www.opensciencegrid.org
        http://www.xsede.org


Grid Overview - Ian Stokes-Rees         ijstokes@seas.harvard.edu
Today
                    Grid	
  Computing	
  Origins
                        High	
  Energy	
  Physics
                    Life	
  Science	
  Computing	
  Survey
                    US	
  Cyberinfrastructure
                        XSEDE
                        Open	
  Science	
  Grid
                    Science	
  Portals
                    Data	
  Access
                    User	
  Credentials
                    Security


Grid Overview - Ian Stokes-Rees                       ijstokes@seas.harvard.edu
Things	
  ISummary About
                        ’m	
  Excited	
  




Grid Overview - Ian Stokes-Rees   ijstokes@seas.harvard.edu
About	
  Me




Grid Overview - Ian Stokes-Rees             ijstokes@seas.harvard.edu
High	
  Energy	
  Physics




Grid Overview - Ian Stokes-Rees       ijstokes@seas.harvard.edu
Grid Overview - Ian Stokes-Rees   ijstokes@seas.harvard.edu
Grid Overview - Ian Stokes-Rees   ijstokes@seas.harvard.edu
Grid Overview - Ian Stokes-Rees   ijstokes@seas.harvard.edu
40	
  MHz	
  bunch	
  crossing	
  rate
      10	
  million	
  data	
  channels
      1	
  KHz	
  level	
  1	
  event	
  recording	
  rate
      1-­10	
  MB	
  per	
  event
      14	
  hours	
  per	
  day,	
  7+	
  months	
  /	
  year
      4	
  detectors
      6	
  PB	
  of	
  data	
  /	
  year
      globally	
  distribute	
  data	
  for	
  analysis	
  (x2)



Grid Overview - Ian Stokes-Rees                                   ijstokes@seas.harvard.edu
Data	
  and	
  Compute	
  Intensive	
  
          Life	
  Sciences
Study	
  of	
  Protein	
  Structure	
  
                     and	
  Function




                                                                    400m
                            1mm




                                                                           10nm
                                  • Shared	
  scientiLic	
  data	
  collection	
  facility
                                  • Data	
  intensive	
  (10-­‐100	
  GB/day)
Grid Overview - Ian Stokes-Rees                                       ijstokes@seas.harvard.edu
Cryo	
  Electron	
  Microscopy




      • Previously,	
  1-­10,000	
  images,	
  managed	
  by	
  hand
      • Now,	
  robotic	
  systems	
  collect	
  millions	
  of	
  hi-­res	
  images
      • estimate	
  250,000	
  CPU-­hours	
  to	
  reconstruct	
  model
Grid Overview - Ian Stokes-Rees                              ijstokes@seas.harvard.edu
Molecular	
  Dynamics	
  Simulations
                                      1	
  fs	
  time	
  step
                                      1ns	
  snapshot
                                      1	
  us	
  simulation
                                      1e6	
  steps
                                      1000	
  frames
                                      10	
  MB	
  /	
  frame
                                      10	
  GB	
  /	
  sim
                                      20	
  CPU-­years
                                      3	
  months	
  (wall-­
                                      clock)

Grid Overview - Ian Stokes-Rees   ijstokes@seas.harvard.edu

More Related Content

PDF
Making Data Analytics Awesome
PDF
2012 02 pre_hbs_grid_overview_ianstokesrees_pt2
PDF
2011 11 pre_cs50_accelerating_sciencegrid_ianstokesrees
PDF
2011 10 pre_broad_grid_overview_ianstokesrees
PDF
SBGrid Science Portal - eScience 2012
PDF
Adapting federated cyberinfrastructure for shared data collection facilities ...
PDF
Python Blaze Overview
KEY
Grid Computing Overview
Making Data Analytics Awesome
2012 02 pre_hbs_grid_overview_ianstokesrees_pt2
2011 11 pre_cs50_accelerating_sciencegrid_ianstokesrees
2011 10 pre_broad_grid_overview_ianstokesrees
SBGrid Science Portal - eScience 2012
Adapting federated cyberinfrastructure for shared data collection facilities ...
Python Blaze Overview
Grid Computing Overview

Similar to 2012 02 pre_hbs_grid_overview_ianstokesrees_pt1 (20)

KEY
Big Data: tools and techniques for working with large data sets
PDF
Introduction to Next Generation Sequencing
PPTX
The Pacific Research Platform
 Two Years In
PPT
A National Big Data Cyberinfrastructure Supporting Computational Biomedical R...
PPTX
Science Engagement: A Non-Technical Approach to the Technical Divide
PPT
Health Sciences Driving UCSD Research Cyberinfrastructure
KEY
Data-Intensive Research
PPT
The eCrystals Federation
PDF
Data Capacitor II at Indiana University
PPT
UC-Wide Cyberinfrastructure for Data-Intensive Research
PPTX
Pacific Research Platform Application Drivers
PDF
PhDprofiles
PPT
Set My Data Free: High-Performance CI for Data-Intensive Research
PPT
Toward Real-Time Analysis of Large Data Volumes for Diffraction Studies by Ma...
PDF
December 9, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types - Pa...
PPT
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
PDF
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
PPT
High Performance Geographic Information Systems
PDF
Database Systems Introduction Powerpoint
PDF
IEEE_BigData2014-Lee.pdf
Big Data: tools and techniques for working with large data sets
Introduction to Next Generation Sequencing
The Pacific Research Platform
 Two Years In
A National Big Data Cyberinfrastructure Supporting Computational Biomedical R...
Science Engagement: A Non-Technical Approach to the Technical Divide
Health Sciences Driving UCSD Research Cyberinfrastructure
Data-Intensive Research
The eCrystals Federation
Data Capacitor II at Indiana University
UC-Wide Cyberinfrastructure for Data-Intensive Research
Pacific Research Platform Application Drivers
PhDprofiles
Set My Data Free: High-Performance CI for Data-Intensive Research
Toward Real-Time Analysis of Large Data Volumes for Diffraction Studies by Ma...
December 9, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types - Pa...
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
High Performance Geographic Information Systems
Database Systems Introduction Powerpoint
IEEE_BigData2014-Lee.pdf
Ad

More from Boston Consulting Group (7)

PPTX
Cloud-native Enterprise Data Science Teams
PPTX
Cloud-native Enterprise Data Science Teams
PPTX
Beyond the Science Gateway
PPTX
Anaconda Data Science Collaboration
PDF
Wide Search Molecular Replacement and the NEBioGrid portal interface
PDF
2010 06 pre_show_computing_lifesciences_stokesrees
PDF
To Infiniband and Beyond
Cloud-native Enterprise Data Science Teams
Cloud-native Enterprise Data Science Teams
Beyond the Science Gateway
Anaconda Data Science Collaboration
Wide Search Molecular Replacement and the NEBioGrid portal interface
2010 06 pre_show_computing_lifesciences_stokesrees
To Infiniband and Beyond
Ad

Recently uploaded (20)

PPTX
202450812 BayCHI UCSC-SV 20250812 v17.pptx
PDF
Chinmaya Tiranga quiz Grand Finale.pdf
PPTX
Digestion and Absorption of Carbohydrates, Proteina and Fats
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
PPTX
Cell Types and Its function , kingdom of life
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
PPTX
Introduction to Building Materials
PDF
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
PPTX
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
PDF
Classroom Observation Tools for Teachers
PPTX
Orientation - ARALprogram of Deped to the Parents.pptx
PDF
advance database management system book.pdf
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
Empowerment Technology for Senior High School Guide
PDF
What if we spent less time fighting change, and more time building what’s rig...
PDF
RMMM.pdf make it easy to upload and study
PPTX
CHAPTER IV. MAN AND BIOSPHERE AND ITS TOTALITY.pptx
PPTX
Radiologic_Anatomy_of_the_Brachial_plexus [final].pptx
202450812 BayCHI UCSC-SV 20250812 v17.pptx
Chinmaya Tiranga quiz Grand Finale.pdf
Digestion and Absorption of Carbohydrates, Proteina and Fats
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
Cell Types and Its function , kingdom of life
Final Presentation General Medicine 03-08-2024.pptx
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
Introduction to Building Materials
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
Classroom Observation Tools for Teachers
Orientation - ARALprogram of Deped to the Parents.pptx
advance database management system book.pdf
Final Presentation General Medicine 03-08-2024.pptx
Empowerment Technology for Senior High School Guide
What if we spent less time fighting change, and more time building what’s rig...
RMMM.pdf make it easy to upload and study
CHAPTER IV. MAN AND BIOSPHERE AND ITS TOTALITY.pptx
Radiologic_Anatomy_of_the_Brachial_plexus [final].pptx

2012 02 pre_hbs_grid_overview_ianstokesrees_pt1

  • 1. Grid  Computing  Overview Federating  Compute  and  Storage  Resources  to   Accelerate  Research  and  Aid  Collaboration Ian  Stokes-­‐Rees,  PhD Harvard,  Cambridge,  USA http://ac.seas.harvard.edu ijstokes@seas.harvard.edu
  • 2. Slides  and  Contact ijstokes@seas.harvard.edu http://linkedin.com/in/ijstokes http://slidesha.re/ijstokes-hbs2012_pt1 http://slidesha.re/ijstokes-hbs2012_pt2 http://ac.seas.harvard.edu http://www.opensciencegrid.org http://www.xsede.org Grid Overview - Ian Stokes-Rees ijstokes@seas.harvard.edu
  • 3. Today Grid  Computing  Origins High  Energy  Physics Life  Science  Computing  Survey US  Cyberinfrastructure XSEDE Open  Science  Grid Science  Portals Data  Access User  Credentials Security Grid Overview - Ian Stokes-Rees ijstokes@seas.harvard.edu
  • 4. Things  ISummary About ’m  Excited   Grid Overview - Ian Stokes-Rees ijstokes@seas.harvard.edu
  • 5. About  Me Grid Overview - Ian Stokes-Rees ijstokes@seas.harvard.edu
  • 6. High  Energy  Physics Grid Overview - Ian Stokes-Rees ijstokes@seas.harvard.edu
  • 7. Grid Overview - Ian Stokes-Rees ijstokes@seas.harvard.edu
  • 8. Grid Overview - Ian Stokes-Rees ijstokes@seas.harvard.edu
  • 9. Grid Overview - Ian Stokes-Rees ijstokes@seas.harvard.edu
  • 10. 40  MHz  bunch  crossing  rate 10  million  data  channels 1  KHz  level  1  event  recording  rate 1-­10  MB  per  event 14  hours  per  day,  7+  months  /  year 4  detectors 6  PB  of  data  /  year globally  distribute  data  for  analysis  (x2) Grid Overview - Ian Stokes-Rees ijstokes@seas.harvard.edu
  • 11. Data  and  Compute  Intensive   Life  Sciences
  • 12. Study  of  Protein  Structure   and  Function 400m 1mm 10nm • Shared  scientiLic  data  collection  facility • Data  intensive  (10-­‐100  GB/day) Grid Overview - Ian Stokes-Rees ijstokes@seas.harvard.edu
  • 13. Cryo  Electron  Microscopy • Previously,  1-­10,000  images,  managed  by  hand • Now,  robotic  systems  collect  millions  of  hi-­res  images • estimate  250,000  CPU-­hours  to  reconstruct  model Grid Overview - Ian Stokes-Rees ijstokes@seas.harvard.edu
  • 14. Molecular  Dynamics  Simulations 1  fs  time  step 1ns  snapshot 1  us  simulation 1e6  steps 1000  frames 10  MB  /  frame 10  GB  /  sim 20  CPU-­years 3  months  (wall-­ clock) Grid Overview - Ian Stokes-Rees ijstokes@seas.harvard.edu