SlideShare a Scribd company logo
Cyberinfrastructure and its Role in Science Cameron Kiddle Research Fellow, Grid Research Centre Adjunct Assistant Professor, Department of Computer Science, University of Calgary Distributed Systems Architect, WestGrid
Outline Challenges Cyberinfrastructure Cyberinfrastructure Technologies Examples ICE Force Project Molecular Dynamics Simulations GT4-based Grid for Canada Fire Dynamics Simulator Rendering on the Cloud GeoChronos IAI Summer School  July 6, 2009 Cyberinfrastructure  -
Collaboration Challenges Familiarity/awareness of collaboration tools Keeping all interested parties in the loop Finding related work and researchers Keeping up to date with current research Collaboration while working in the field IAI Summer School  July 6, 2009 Cyberinfrastructure  -
Data Challenges Acquisition of data Many different data sources Large quantities of data Different regulations/mechanisms for accessing data Lack of automation Finding the right data Bandwidth constraints Managing data Scattered and unorganized data Inadequate tools for recording/maintaining metadata Data without metadata is meaningless Lack of suitable metadata standards Validation of metadata Tracking provenance of data Pre-processing of data Raw data typically cannot be directly analyzed Significant amount of time spent preparing data for analysis Lack of automation  IAI Summer School  July 6, 2009 Cyberinfrastructure  -
Application Challenges Limited availability of computing resources Access to and familiarity of heterogeneous computing resources Fault tolerance and reliability Access to software available in research lab while in field or other locations Installing, configuring and updating software System dependencies of software Awareness and suitability of available software Sharing applications and results IAI Summer School  July 6, 2009 Cyberinfrastructure  -
Cyberinfrastructure “ Like the physical infrastructure of roads, bridges, power grids, telephone lines, and water systems that support modern society, "cyberinfrastructure" refers to the distributed computer, information and communication technologies combined with the personnel and integrating components that provide a long-term platform to empower the modern scientific research endeavor.” Report of the National Science Foundation Blue-Ribbon  Advisory Panel on Cyberinfrastructure, 2003. IAI Summer School  July 6, 2009 Cyberinfrastructure  -
Cyberinfrastructure Technologies Grid Computing Cloud Computing Virtualization Web 2.0 / Social Networking Web Portals / Scientific Gateways Semantic Web … IAI Summer School  July 6, 2009 Cyberinfrastructure  -
Grid Computing Many different definitions/uses computational grids, data grids, desktop grids, campus grids, sensor grids, access grids Coordinated sharing of heterogeneous resources across administrative domains IAI Summer School  July 6, 2009 Cyberinfrastructure  -  Resources Shared by  Virtual Organization X Resources Shared by Virtual Organization Y Domain A Domain B Domain C
Grid Middleware The layer between users/applications and grid resources that glues everything together Example grid middleware Globus Toolkit  GT2 – pre-standards GT4 – Web Services based UNICORE  gLite  ARC  NAREGI  IAI Summer School  July 6, 2009 Cyberinfrastructure  -
Key Grid Middleware Services Security Services Concerned with authentication, authorization, secure communication, … Information Services Provide information about resources, policy, services and applications to tools and users Data Management Services Manage movement and replication of data as well as metadata about data Execution Management Services Handle placement, provisioning and lifetime management of jobs and workflows IAI Summer School  July 6, 2009 Cyberinfrastructure  -
Benefits of Grid Computing Easier access to more resources Users/organizations can share resources Single sign-on Common interface (hide heterogeneity) Improved data management Efficient file transfers Abstraction of physical location of data Automated execution of jobs and workflows IAI Summer School  July 6, 2009 Cyberinfrastructure  -
Example Grid Projects IAI Summer School  July 6, 2009 Cyberinfrastructure  -  Name Description LHC Computing Grid  http://lcg.web.cern.ch/ data storage and analysis infrastructure for the high energy physics community using the Large Hadron Collider (LHC) at CERN (ATLAS Tier-1 site at TRIUMF in British Columbia) Network for Earthquake Engineering Simulation (NEES)  http://www.nees.org/ a US national network of 15 facilities to study the impact of earthquakes on buildings, bridges, etc. Expanding GEOsciences on DEmand (EGEODE) http://www.egeode.org/ a virtual organization (VO) associated with EGEE that is dedicated to research in geoscience for both public and private industrial R&D and academic laboratories International Virtual Observatory Alliance (IVOA)  http://www.ivoa.net/ development of standards and infrastructure to share and analyze astronomical archives from around the world
Cloud Computing Transparent access to scalable and dynamic services over the Internet Key features: Everything as a Service (EaaS) Utility/On-demand Accessibility/Transparency Scalability Virtualization IAI Summer School  July 6, 2009 Cyberinfrastructure  -
Cloud Computing Solutions IAI Summer School  July 6, 2009 Cyberinfrastructure  -
Benefits of Cloud Computing Reduce capital, support and maintenance costs Pay only for what you use Get access to more/fewer resources when needed Ready to use for users No more downloads, installations or updates Simplify and speed up software development Don’t have to support multiple platforms Application popularity and lifespan difficult to predict Scale applications according to user demand IAI Summer School  July 6, 2009 Cyberinfrastructure  -
Cloud Computing Case Study: Application Popularity on Facebook Difficult to predict popularity and lifespan of applications Facebook Application Growth Sep. 2007: ~ 3700 Sep. 2008: ~39000 Facebook Application Popularity (Sep. 12, 2008) 39181 applications Active user data for 37155 apps 3 apps > 10 million active users 80% apps < 1000 active users IAI Summer School  July 6, 2009 Cyberinfrastructure  -  Monthly Active Users  vs.  Rank of Facebook Applications (September 12, 2008)
Cloud Computing Case Study: Shrek (Dreamworks) Shrek (2001) – 5 million CPU render hours  Shrek 2 (2004) – 10 million CPU render hours  Shrek 3 (2007) – 20 million CPU render hours IAI Summer School  July 6, 2009 Cyberinfrastructure  -  (Source: R. Rowe. DreamWorks Animation &quot;Shrek the Third&quot;: Linux Feeds an Ogre. Linux Journal . June 5, 2007. (http://www.linuxjournal.com/article/9653)) Time to Render 1 CPU 100 CPUs 10000 CPUs Shrek 571 years 5.7 years 21 days Shrek 2 1142 years 11.4 years 42 days Shrek 3 2283 years 22.8 years 83 days
Cloud Computing Case Study: Animoto Animoto ( http://animoto.com )  Produces professional quality videos from images Runs on Amazon EC2 Popularity soared when promoted on Facebook During the course of 4 days: Jumped from 8 to 450 renderings per minute ~20000 new users per hour 3500 instances running on Amazon EC2 at peak IAI Summer School  July 6, 2009 Cyberinfrastructure  -  (Source: D. Barker. You Need 3,500 Servers by When?!  On-demand Enterprise . 2008.07.07)
Virtualization Can transform a single physical machine into multiple virtual machines (VMs) each with their own OS and software stack Virtualization software Xen, KVM, VMWare Support allocation, deallocation, checkpointing and migration of VMs Benefits Custom environments (root access) More efficient use of resources (consolidation) System maintenance without disruption IAI Summer School  July 6, 2009 Cyberinfrastructure  -
Web 2.0 – The “Social Web” Aimed at: Providing feature rich user environments Making it easier for users to generate Web content Improving online social connectivity Example Web 2.0 technologies Blogs (WordPress, TypePad) Wikis (Wikipedia) Mashups (HousingMaps, ChicagoCrime) Widgets/Gadgets (iGoogle, Netvibes) Social networks (Facebook, MySpace, YouTube) IAI Summer School  July 6, 2009 Cyberinfrastructure  -
Social Networking Sites/Platforms IAI Summer School  July 6, 2009 Cyberinfrastructure  -
Web Portals / Scientific Gateways Aimed at providing a community of users access to computing resources through a common Web-based interface Web portal development tools GridSphere (portlet based) Web 2.0/Social Networking Examples TeraGrid Scientific Gateways (over 30 of them) nanoHUB IAI Summer School  July 6, 2009 Cyberinfrastructure  -
Semantic Web Aimed at representing knowledge, not just information Connecting and relating data in a way understandable by machines Semantic Web standards Resource Description Framework (RDF) Web Ontology Language (OWL) IAI Summer School  July 6, 2009 Cyberinfrastructure  -
Confederation Bridge ICE Force Monitoring Project Monitoring of forces on the Confederation Bridge Data analyzed by civil engineering groups at University of Calgary and Carleton University GRC developed solution to automate data management as part of a CANARIE AAP project IAI Summer School  July 6, 2009 Cyberinfrastructure  -  ( http://www.confederationbridge.com )  ( http://www.confederationbridge.com )
ICE Force - Technologies Used Grid Middleware GT4 Data Management Proactive Data Management Service (PDMS) Data Transfer - GridFTP, RFT Replication Management – RLS Metadata Management - MCS IAI Summer School  July 6, 2009 Cyberinfrastructure  -
Molecular Dynamics Simulations (GROMACS) GROMACS  Parallel molecular dynamics simulation application Can simulate hundreds to millions of particles Simulation runs can take days, weeks or months Issues with long running jobs Fault tolerance Scheduler policy constraints IAI Summer School  July 6, 2009 Cyberinfrastructure  -  ( http://moose.bio.ucalgary.ca/ )
GROMACS - Grid Enabled Solution Automated grid enabled solution developed by GRC to manage GROMACS simulations as part of a CANARIE AAP project  Long jobs split into a series of shorter jobs Automates checkpointing, migration and reconfiguration of jobs IAI Summer School  July 6, 2009 Cyberinfrastructure  -
GROMACS - Portal IAI Summer School  July 6, 2009 Cyberinfrastructure  -
GROMACS - Technologies Used Grid Middleware GT4 Information Services WS MDS Data Management PDMS (GridFTP, RFT, RLS, MCS) Execution Management Custom system (Condor-G, WS GRAM) Portal GridSphere IAI Summer School  July 6, 2009 Cyberinfrastructure  -
Web Service based Grid Environment for Canada IAI Summer School  July 6, 2009 Cyberinfrastructure  -  Established a GT4-based grid environment from resources across Canada (CANARIE CIIP)
GT4-based Grid - Model Schemas Models developed to describe systems, applications and scheduler policy (GRC Model Schema)  IAI Summer School  July 6, 2009 Cyberinfrastructure  -  System Model Class Diagram
GT4-based Grid – Viewing Resource Information Used WebMDS, a customizable Web based interface for viewing resource information published by WS MDS  IAI Summer School  July 6, 2009 Cyberinfrastructure  -
GT4-based Grid - Technologies Used Grid Middleware GT4 Data Management GridFTP, RFT Information Services GRC Model Schema, WS MDS, WebMDS Execution Management Condor-G, WS GRAM IAI Summer School  July 6, 2009 Cyberinfrastructure  -
Example: Fire Simulation Developed a comprehensive environment for the Fire Dynamics Simulator (FDS) as part of a collaborative project between GRC and HP Labs Deployed on HP Labs Data Centre at University of Calgary Initial focus of project Leverage Web 2.0 technologies Explore use of virtualization in a utility/cloud computing environment IAI Summer School  July 6, 2009 Cyberinfrastructure  -
Fire Simulation - Technologies Used User level Web 2.0/social networking technology (Facebook) Service provider level LAMP environment (Linux, Apache, MySQL, Perl/Python/PHP) Simulation (FDS, Condor) Visualization (Smokeview, VNC) Resource (utility) provider level Cloud computing technology (ASPEN) Virtual machine technology (Xen) IAI Summer School  July 6, 2009 Cyberinfrastructure  -
Example: Rendering on the Cloud  GRC created an on-demand cloud rendering service for EDM Studio Cybera Pilot Project Technologies used: Cloud computing technology (ASPEN) Virtual machine technology (Xen) Social networking technology (Ning/Elgg) IAI Summer School  July 6, 2009 Cyberinfrastructure  -
An on-line platform For: Earth Observation Scientists Facilitating: Collaboration  between scientists Data  access, management and sharing Application  access, management and sharing Leveraging: Web 2.0 / social networking technologies (Elgg) Semantic Web technologies (RDF, OWL) Cloud computing and virtualization technologies (ASPEN, Xen) IAI Summer School  July 6, 2009 Cyberinfrastructure  -
GeoChronos - Collaboration Social networking portal Elgg-based  (elgg.org) Social networking services Blogs Tags Media/document sharing  Wikis  Friends/contacts Groups Discussions Message boards Calendars Status News Feeds IAI Summer School  July 6, 2009 Cyberinfrastructure  -  http://geochronos.org/
GeoChronos - Data Data Acquisition Automated acquisition of data from sensors (ground, airborne, satellite) or third party Data Storage Store, share, browse and search data i.e., spectral library Data Processing Automated data workflows i.e., mosaic, reproject and subset MODIS data IAI Summer School  July 6, 2009 Cyberinfrastructure  -
GeoChronos - Applications Interactive Application Service (IAS) On-line, on-demand access to scientific applications Share application sessions and data with other users Access control to applications Batch Processing Service Batch processing environment for longer running data processing tasks or simulations For use directly by individual users or as part of automated data workflows IAI Summer School  July 6, 2009 Cyberinfrastructure  -
GeoChronos - Project Team IAI Summer School  July 6, 2009 Cyberinfrastructure  -  Dr. Arturo Sanchez-Azofeifa University of Alberta Dr. John Gamon University of Alberta Dr. Benoit Rivard University of Victoria Dr. Rob Simmonds University of Calgary Prinicipal Investigators Project Coordination Platform Development Domain Scientists
GeoChronos - Virtual Organization IAI Summer School  July 6, 2009 Cyberinfrastructure  -
Contact Information IAI Summer School  July 6, 2009 Cyberinfrastructure  -  Cameron Kiddle [email_address] http://pages.cspc.ucalgary.ca/~kiddlec/ http://grid.ucalgary.ca/

More Related Content

PPTX
WebEng_202010
PPT
UK e-Infrastructure: Widening Access, Increasing Participation
PPTX
Session 33 - Production Grids
PDF
Session 23 - Intro to EGEE-III
PPT
CI-Team MSI-CIEC High Performance Computing and CyberInfrastructure (CI) Camp...
PPTX
Cloud computing: how will it affect my everday work life?
PPT
云计算及其应用
PDF
OntoFrame기반 시맨틱 서비스와 서비스 매쉬업
WebEng_202010
UK e-Infrastructure: Widening Access, Increasing Participation
Session 33 - Production Grids
Session 23 - Intro to EGEE-III
CI-Team MSI-CIEC High Performance Computing and CyberInfrastructure (CI) Camp...
Cloud computing: how will it affect my everday work life?
云计算及其应用
OntoFrame기반 시맨틱 서비스와 서비스 매쉬업

What's hot (20)

PDF
Design of an IT Capstone Subject - Cloud Robotics
PPT
Cloud Computing And Android Apps For The Indian
PDF
APPLICATION OF DIGITAL CLOUD LIBRARIES FOR ETHIOPIAN PUBLIC HIGHER LEARNING I...
PPTX
GENI Engineering Conference -- Ian Foster
PDF
Le nuove tecnologie come vantaggio competitivo della piccola impresa
PPTX
Information Systems - Lecture A
PPT
Update on engineering information sources
PDF
Science DMZ as a Service: Creating Science Super- Facilities with GENI
PDF
06 e science-bio diversity@ pacc 18.07.2014
PDF
Cost Benefits of Cloud vs. In-house IT for Higher Education
PDF
IRJET- Advanced Cloud in E-Libraries
PPT
Grid computing by vaishali sahare [katkar]
PPTX
Grid computing
PPTX
Cloud Computing - Funadamental
PDF
SuperComputingDelivered_16-Dec-2014_FINAL
PPTX
Slides 530 a2
PPT
Grid Computing
PDF
What's all the data about? - Linking and Profiling of Linked Datasets
PDF
KnowEscape workshop, OKCon 2013
PPTX
Design of an IT Capstone Subject - Cloud Robotics
Cloud Computing And Android Apps For The Indian
APPLICATION OF DIGITAL CLOUD LIBRARIES FOR ETHIOPIAN PUBLIC HIGHER LEARNING I...
GENI Engineering Conference -- Ian Foster
Le nuove tecnologie come vantaggio competitivo della piccola impresa
Information Systems - Lecture A
Update on engineering information sources
Science DMZ as a Service: Creating Science Super- Facilities with GENI
06 e science-bio diversity@ pacc 18.07.2014
Cost Benefits of Cloud vs. In-house IT for Higher Education
IRJET- Advanced Cloud in E-Libraries
Grid computing by vaishali sahare [katkar]
Grid computing
Cloud Computing - Funadamental
SuperComputingDelivered_16-Dec-2014_FINAL
Slides 530 a2
Grid Computing
What's all the data about? - Linking and Profiling of Linked Datasets
KnowEscape workshop, OKCon 2013
Ad

Similar to Cyberinfrastructure and its Role in Science (20)

PPT
Cyberinfrastructure and Applications Overview: Howard University June22
PPT
Cloud-Computing
PDF
UNIT 1.pdf
PDF
Cloud Computing Tutorial - Jens Nimis
PDF
Architectural Implications of Cloud Computing
PPTX
Clould Computing and its application in Libraries
PPSX
Cloud Computing
PDF
Situation Normal - Presentation at NottTuesday
PDF
Situation Normal - UKUUG Mar'10
PPTX
History of Cloud Computing.pptx
PDF
Simon Wardley
PDF
Cloud Computing - Challenges and Opportunities - Jens Nimis
PDF
Inside Cybera
PPT
Cyberinfrastructure and the Research Process in Canada
PPT
Cloud basics
PPT
Exploring Cloud Computing Technologies For GIS (Location Based) Applications
PDF
Cloud Security and Privacy:An Enterprise Perspective on Risks and Compliance
PPT
Stuart Wakefield Cloud Computing
KEY
Cloud computing in academic libraries
PPT
Grid and Cloud Computing Intro
Cyberinfrastructure and Applications Overview: Howard University June22
Cloud-Computing
UNIT 1.pdf
Cloud Computing Tutorial - Jens Nimis
Architectural Implications of Cloud Computing
Clould Computing and its application in Libraries
Cloud Computing
Situation Normal - Presentation at NottTuesday
Situation Normal - UKUUG Mar'10
History of Cloud Computing.pptx
Simon Wardley
Cloud Computing - Challenges and Opportunities - Jens Nimis
Inside Cybera
Cyberinfrastructure and the Research Process in Canada
Cloud basics
Exploring Cloud Computing Technologies For GIS (Location Based) Applications
Cloud Security and Privacy:An Enterprise Perspective on Risks and Compliance
Stuart Wakefield Cloud Computing
Cloud computing in academic libraries
Grid and Cloud Computing Intro
Ad

More from Cameron Kiddle (13)

PPT
An On-line Collaborative Data Management System
PPT
GeoChronos - CANARIE NEP Showcase 2010 Presentation
PPT
Collaborative Science: Technologies & Examples
PPT
Enabling Collaboration Among Earth Observation Scientists with the GeoChronos...
PPT
GeoChronos: Challenges and Achievements
PPT
Social Networking and Scientific Gateways
PPT
GeoChronos - SC09 Demo Presentation
PPT
GeoChronos - CANARIE NEP Showcase 2009 Presentation
PPT
Integration and Management of Diverse Environmental Data Sets
PPT
Dynamic Provisioning of Service Environments with ASPEN
PPT
GeoChronos - SpecNet Workshop 2009 Presentation
PPT
GeoChronos: An On-line Collaborative Platform for Earth Observation Scientists
PPT
GRC - CANARIE Green IT 2009
An On-line Collaborative Data Management System
GeoChronos - CANARIE NEP Showcase 2010 Presentation
Collaborative Science: Technologies & Examples
Enabling Collaboration Among Earth Observation Scientists with the GeoChronos...
GeoChronos: Challenges and Achievements
Social Networking and Scientific Gateways
GeoChronos - SC09 Demo Presentation
GeoChronos - CANARIE NEP Showcase 2009 Presentation
Integration and Management of Diverse Environmental Data Sets
Dynamic Provisioning of Service Environments with ASPEN
GeoChronos - SpecNet Workshop 2009 Presentation
GeoChronos: An On-line Collaborative Platform for Earth Observation Scientists
GRC - CANARIE Green IT 2009

Recently uploaded (20)

PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
Empathic Computing: Creating Shared Understanding
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
GamePlan Trading System Review: Professional Trader's Honest Take
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Spectral efficient network and resource selection model in 5G networks
PPTX
Cloud computing and distributed systems.
PDF
[발표본] 너의 과제는 클라우드에 있어_KTDS_김동현_20250524.pdf
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPT
Teaching material agriculture food technology
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Machine learning based COVID-19 study performance prediction
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Empathic Computing: Creating Shared Understanding
NewMind AI Weekly Chronicles - August'25 Week I
GamePlan Trading System Review: Professional Trader's Honest Take
Reach Out and Touch Someone: Haptics and Empathic Computing
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Spectral efficient network and resource selection model in 5G networks
Cloud computing and distributed systems.
[발표본] 너의 과제는 클라우드에 있어_KTDS_김동현_20250524.pdf
Dropbox Q2 2025 Financial Results & Investor Presentation
Per capita expenditure prediction using model stacking based on satellite ima...
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
Review of recent advances in non-invasive hemoglobin estimation
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Teaching material agriculture food technology
Network Security Unit 5.pdf for BCA BBA.
Machine learning based COVID-19 study performance prediction

Cyberinfrastructure and its Role in Science

  • 1. Cyberinfrastructure and its Role in Science Cameron Kiddle Research Fellow, Grid Research Centre Adjunct Assistant Professor, Department of Computer Science, University of Calgary Distributed Systems Architect, WestGrid
  • 2. Outline Challenges Cyberinfrastructure Cyberinfrastructure Technologies Examples ICE Force Project Molecular Dynamics Simulations GT4-based Grid for Canada Fire Dynamics Simulator Rendering on the Cloud GeoChronos IAI Summer School July 6, 2009 Cyberinfrastructure -
  • 3. Collaboration Challenges Familiarity/awareness of collaboration tools Keeping all interested parties in the loop Finding related work and researchers Keeping up to date with current research Collaboration while working in the field IAI Summer School July 6, 2009 Cyberinfrastructure -
  • 4. Data Challenges Acquisition of data Many different data sources Large quantities of data Different regulations/mechanisms for accessing data Lack of automation Finding the right data Bandwidth constraints Managing data Scattered and unorganized data Inadequate tools for recording/maintaining metadata Data without metadata is meaningless Lack of suitable metadata standards Validation of metadata Tracking provenance of data Pre-processing of data Raw data typically cannot be directly analyzed Significant amount of time spent preparing data for analysis Lack of automation IAI Summer School July 6, 2009 Cyberinfrastructure -
  • 5. Application Challenges Limited availability of computing resources Access to and familiarity of heterogeneous computing resources Fault tolerance and reliability Access to software available in research lab while in field or other locations Installing, configuring and updating software System dependencies of software Awareness and suitability of available software Sharing applications and results IAI Summer School July 6, 2009 Cyberinfrastructure -
  • 6. Cyberinfrastructure “ Like the physical infrastructure of roads, bridges, power grids, telephone lines, and water systems that support modern society, &quot;cyberinfrastructure&quot; refers to the distributed computer, information and communication technologies combined with the personnel and integrating components that provide a long-term platform to empower the modern scientific research endeavor.” Report of the National Science Foundation Blue-Ribbon Advisory Panel on Cyberinfrastructure, 2003. IAI Summer School July 6, 2009 Cyberinfrastructure -
  • 7. Cyberinfrastructure Technologies Grid Computing Cloud Computing Virtualization Web 2.0 / Social Networking Web Portals / Scientific Gateways Semantic Web … IAI Summer School July 6, 2009 Cyberinfrastructure -
  • 8. Grid Computing Many different definitions/uses computational grids, data grids, desktop grids, campus grids, sensor grids, access grids Coordinated sharing of heterogeneous resources across administrative domains IAI Summer School July 6, 2009 Cyberinfrastructure - Resources Shared by Virtual Organization X Resources Shared by Virtual Organization Y Domain A Domain B Domain C
  • 9. Grid Middleware The layer between users/applications and grid resources that glues everything together Example grid middleware Globus Toolkit GT2 – pre-standards GT4 – Web Services based UNICORE gLite ARC NAREGI IAI Summer School July 6, 2009 Cyberinfrastructure -
  • 10. Key Grid Middleware Services Security Services Concerned with authentication, authorization, secure communication, … Information Services Provide information about resources, policy, services and applications to tools and users Data Management Services Manage movement and replication of data as well as metadata about data Execution Management Services Handle placement, provisioning and lifetime management of jobs and workflows IAI Summer School July 6, 2009 Cyberinfrastructure -
  • 11. Benefits of Grid Computing Easier access to more resources Users/organizations can share resources Single sign-on Common interface (hide heterogeneity) Improved data management Efficient file transfers Abstraction of physical location of data Automated execution of jobs and workflows IAI Summer School July 6, 2009 Cyberinfrastructure -
  • 12. Example Grid Projects IAI Summer School July 6, 2009 Cyberinfrastructure - Name Description LHC Computing Grid http://lcg.web.cern.ch/ data storage and analysis infrastructure for the high energy physics community using the Large Hadron Collider (LHC) at CERN (ATLAS Tier-1 site at TRIUMF in British Columbia) Network for Earthquake Engineering Simulation (NEES) http://www.nees.org/ a US national network of 15 facilities to study the impact of earthquakes on buildings, bridges, etc. Expanding GEOsciences on DEmand (EGEODE) http://www.egeode.org/ a virtual organization (VO) associated with EGEE that is dedicated to research in geoscience for both public and private industrial R&D and academic laboratories International Virtual Observatory Alliance (IVOA) http://www.ivoa.net/ development of standards and infrastructure to share and analyze astronomical archives from around the world
  • 13. Cloud Computing Transparent access to scalable and dynamic services over the Internet Key features: Everything as a Service (EaaS) Utility/On-demand Accessibility/Transparency Scalability Virtualization IAI Summer School July 6, 2009 Cyberinfrastructure -
  • 14. Cloud Computing Solutions IAI Summer School July 6, 2009 Cyberinfrastructure -
  • 15. Benefits of Cloud Computing Reduce capital, support and maintenance costs Pay only for what you use Get access to more/fewer resources when needed Ready to use for users No more downloads, installations or updates Simplify and speed up software development Don’t have to support multiple platforms Application popularity and lifespan difficult to predict Scale applications according to user demand IAI Summer School July 6, 2009 Cyberinfrastructure -
  • 16. Cloud Computing Case Study: Application Popularity on Facebook Difficult to predict popularity and lifespan of applications Facebook Application Growth Sep. 2007: ~ 3700 Sep. 2008: ~39000 Facebook Application Popularity (Sep. 12, 2008) 39181 applications Active user data for 37155 apps 3 apps > 10 million active users 80% apps < 1000 active users IAI Summer School July 6, 2009 Cyberinfrastructure - Monthly Active Users vs. Rank of Facebook Applications (September 12, 2008)
  • 17. Cloud Computing Case Study: Shrek (Dreamworks) Shrek (2001) – 5 million CPU render hours Shrek 2 (2004) – 10 million CPU render hours Shrek 3 (2007) – 20 million CPU render hours IAI Summer School July 6, 2009 Cyberinfrastructure - (Source: R. Rowe. DreamWorks Animation &quot;Shrek the Third&quot;: Linux Feeds an Ogre. Linux Journal . June 5, 2007. (http://www.linuxjournal.com/article/9653)) Time to Render 1 CPU 100 CPUs 10000 CPUs Shrek 571 years 5.7 years 21 days Shrek 2 1142 years 11.4 years 42 days Shrek 3 2283 years 22.8 years 83 days
  • 18. Cloud Computing Case Study: Animoto Animoto ( http://animoto.com ) Produces professional quality videos from images Runs on Amazon EC2 Popularity soared when promoted on Facebook During the course of 4 days: Jumped from 8 to 450 renderings per minute ~20000 new users per hour 3500 instances running on Amazon EC2 at peak IAI Summer School July 6, 2009 Cyberinfrastructure - (Source: D. Barker. You Need 3,500 Servers by When?! On-demand Enterprise . 2008.07.07)
  • 19. Virtualization Can transform a single physical machine into multiple virtual machines (VMs) each with their own OS and software stack Virtualization software Xen, KVM, VMWare Support allocation, deallocation, checkpointing and migration of VMs Benefits Custom environments (root access) More efficient use of resources (consolidation) System maintenance without disruption IAI Summer School July 6, 2009 Cyberinfrastructure -
  • 20. Web 2.0 – The “Social Web” Aimed at: Providing feature rich user environments Making it easier for users to generate Web content Improving online social connectivity Example Web 2.0 technologies Blogs (WordPress, TypePad) Wikis (Wikipedia) Mashups (HousingMaps, ChicagoCrime) Widgets/Gadgets (iGoogle, Netvibes) Social networks (Facebook, MySpace, YouTube) IAI Summer School July 6, 2009 Cyberinfrastructure -
  • 21. Social Networking Sites/Platforms IAI Summer School July 6, 2009 Cyberinfrastructure -
  • 22. Web Portals / Scientific Gateways Aimed at providing a community of users access to computing resources through a common Web-based interface Web portal development tools GridSphere (portlet based) Web 2.0/Social Networking Examples TeraGrid Scientific Gateways (over 30 of them) nanoHUB IAI Summer School July 6, 2009 Cyberinfrastructure -
  • 23. Semantic Web Aimed at representing knowledge, not just information Connecting and relating data in a way understandable by machines Semantic Web standards Resource Description Framework (RDF) Web Ontology Language (OWL) IAI Summer School July 6, 2009 Cyberinfrastructure -
  • 24. Confederation Bridge ICE Force Monitoring Project Monitoring of forces on the Confederation Bridge Data analyzed by civil engineering groups at University of Calgary and Carleton University GRC developed solution to automate data management as part of a CANARIE AAP project IAI Summer School July 6, 2009 Cyberinfrastructure - ( http://www.confederationbridge.com ) ( http://www.confederationbridge.com )
  • 25. ICE Force - Technologies Used Grid Middleware GT4 Data Management Proactive Data Management Service (PDMS) Data Transfer - GridFTP, RFT Replication Management – RLS Metadata Management - MCS IAI Summer School July 6, 2009 Cyberinfrastructure -
  • 26. Molecular Dynamics Simulations (GROMACS) GROMACS Parallel molecular dynamics simulation application Can simulate hundreds to millions of particles Simulation runs can take days, weeks or months Issues with long running jobs Fault tolerance Scheduler policy constraints IAI Summer School July 6, 2009 Cyberinfrastructure - ( http://moose.bio.ucalgary.ca/ )
  • 27. GROMACS - Grid Enabled Solution Automated grid enabled solution developed by GRC to manage GROMACS simulations as part of a CANARIE AAP project Long jobs split into a series of shorter jobs Automates checkpointing, migration and reconfiguration of jobs IAI Summer School July 6, 2009 Cyberinfrastructure -
  • 28. GROMACS - Portal IAI Summer School July 6, 2009 Cyberinfrastructure -
  • 29. GROMACS - Technologies Used Grid Middleware GT4 Information Services WS MDS Data Management PDMS (GridFTP, RFT, RLS, MCS) Execution Management Custom system (Condor-G, WS GRAM) Portal GridSphere IAI Summer School July 6, 2009 Cyberinfrastructure -
  • 30. Web Service based Grid Environment for Canada IAI Summer School July 6, 2009 Cyberinfrastructure - Established a GT4-based grid environment from resources across Canada (CANARIE CIIP)
  • 31. GT4-based Grid - Model Schemas Models developed to describe systems, applications and scheduler policy (GRC Model Schema) IAI Summer School July 6, 2009 Cyberinfrastructure - System Model Class Diagram
  • 32. GT4-based Grid – Viewing Resource Information Used WebMDS, a customizable Web based interface for viewing resource information published by WS MDS IAI Summer School July 6, 2009 Cyberinfrastructure -
  • 33. GT4-based Grid - Technologies Used Grid Middleware GT4 Data Management GridFTP, RFT Information Services GRC Model Schema, WS MDS, WebMDS Execution Management Condor-G, WS GRAM IAI Summer School July 6, 2009 Cyberinfrastructure -
  • 34. Example: Fire Simulation Developed a comprehensive environment for the Fire Dynamics Simulator (FDS) as part of a collaborative project between GRC and HP Labs Deployed on HP Labs Data Centre at University of Calgary Initial focus of project Leverage Web 2.0 technologies Explore use of virtualization in a utility/cloud computing environment IAI Summer School July 6, 2009 Cyberinfrastructure -
  • 35. Fire Simulation - Technologies Used User level Web 2.0/social networking technology (Facebook) Service provider level LAMP environment (Linux, Apache, MySQL, Perl/Python/PHP) Simulation (FDS, Condor) Visualization (Smokeview, VNC) Resource (utility) provider level Cloud computing technology (ASPEN) Virtual machine technology (Xen) IAI Summer School July 6, 2009 Cyberinfrastructure -
  • 36. Example: Rendering on the Cloud GRC created an on-demand cloud rendering service for EDM Studio Cybera Pilot Project Technologies used: Cloud computing technology (ASPEN) Virtual machine technology (Xen) Social networking technology (Ning/Elgg) IAI Summer School July 6, 2009 Cyberinfrastructure -
  • 37. An on-line platform For: Earth Observation Scientists Facilitating: Collaboration between scientists Data access, management and sharing Application access, management and sharing Leveraging: Web 2.0 / social networking technologies (Elgg) Semantic Web technologies (RDF, OWL) Cloud computing and virtualization technologies (ASPEN, Xen) IAI Summer School July 6, 2009 Cyberinfrastructure -
  • 38. GeoChronos - Collaboration Social networking portal Elgg-based (elgg.org) Social networking services Blogs Tags Media/document sharing Wikis Friends/contacts Groups Discussions Message boards Calendars Status News Feeds IAI Summer School July 6, 2009 Cyberinfrastructure - http://geochronos.org/
  • 39. GeoChronos - Data Data Acquisition Automated acquisition of data from sensors (ground, airborne, satellite) or third party Data Storage Store, share, browse and search data i.e., spectral library Data Processing Automated data workflows i.e., mosaic, reproject and subset MODIS data IAI Summer School July 6, 2009 Cyberinfrastructure -
  • 40. GeoChronos - Applications Interactive Application Service (IAS) On-line, on-demand access to scientific applications Share application sessions and data with other users Access control to applications Batch Processing Service Batch processing environment for longer running data processing tasks or simulations For use directly by individual users or as part of automated data workflows IAI Summer School July 6, 2009 Cyberinfrastructure -
  • 41. GeoChronos - Project Team IAI Summer School July 6, 2009 Cyberinfrastructure - Dr. Arturo Sanchez-Azofeifa University of Alberta Dr. John Gamon University of Alberta Dr. Benoit Rivard University of Victoria Dr. Rob Simmonds University of Calgary Prinicipal Investigators Project Coordination Platform Development Domain Scientists
  • 42. GeoChronos - Virtual Organization IAI Summer School July 6, 2009 Cyberinfrastructure -
  • 43. Contact Information IAI Summer School July 6, 2009 Cyberinfrastructure - Cameron Kiddle [email_address] http://pages.cspc.ucalgary.ca/~kiddlec/ http://grid.ucalgary.ca/