SlideShare a Scribd company logo
Cyberenvironments @ NCSA Supporting Community-scale Science Jim Myers Associate Director Collaborative Technologies NCSA
Beyond Cyberinfrastructure CyberInfrastructure  commonly refers to infrastructure (networks, compute, and data resources) plus the middleware (grid) that links those resources together and presents them in a uniform standard way. CyberEnvironments  is a term NCSA has coined to describe the complete End-to-End solution.  This integrates Shared and Custom Cyberinfrastructure into a process-oriented framework for the community and researchers that allow them to focus on their research, not on accessing and managing the CI. A  CyberCommunity  is a distributed group of people (virtual organization) with common goals and shared knowledge. Size ranges from a few individuals to an interdisciplinary or international groups.  These groups can include, researchers, policy makers, responders, educators, and citizens and often have a long term identity and purpose.
Cyberenvironments:  Enable researchers to tackle more, and more complex challenges leading to  Enhanced production of knowledge  and Enhanced application of that knowledge  to understanding our world, developing solutions, and making informed decisions
The Systems Science Revolution Research spans multiple disciplines/sub-disciplines Coordination through Community Resources Bi-directional flow/feedback of information Partial results being combined to produce new knowledge Experiment/Theory/Model comparisons Multiscale optimizations Rapid Evolution High Complexity Resources will be distributed With multiple curators
End to end Scientific Progress is limited by the manual processes: Data discovery Translation Experiment setup Group coordination Tool integration Training Feature Extraction Data interpretation Acceptance of new models/tools Dissemination of best practices Interdisciplinary communication Data production Processing power Data transfer/storage !
Round-Trip Information Logistics Desktop applications accessing remote resources Individuals publishing to communities and accessing reference information, best practices, etc. Unique capabilities linked into end-to-end community processes Inter-community connectivity Evolving at the speed of science Individual Unique capabilities High Performance Resources Desktop Community End-to-end processes
Key Issues How do we build a system before the parts are done? How do we evolve the system to keep it current? How do we convey knowledge as well as tools to end users? How do we coordinate without centralizing? Technology Responses:  Workflow  Ability to integrate independent web services Ability to hide workflow behind applications Rich metadata Tracking provenance Context-based data discovery Distributed data stores Data translation/data virtualization Cyberenvironments Engineering view of cutting-edge science Collaboration capabilities ‘ Publication’ – exposing work to groups & the public Streams/Events/Feature Management Core Domain Services, e.g. GIS
NCSA Processes Analysis of science and engineering processes across many disciplines Identification of challenges and appropriate design responses Research/Technology Roadmaps Integrated project teams (IPTs) taking leadership roles within specific communities with strong partners to develop Cyberenvironments/CI Producing pilot/production capabilities Advancing technologies along roadmaps Backed by: 20 years of experience in user/community engagement Leadership roles in cutting edge Cyberenvironment projects in many disciplines Strong R&D efforts in Environments/Grid/Viz/Knowledge Discovery,.. Central role in national/global cyberinfrastructure definition/development
Want a systems-science approach to address complex problems New knowledge is assimilated from different data, tools, and disciplines at each scale Real-time bi-directional information flow Multiple  applications for the same information But Normal publication is slow and lossy Data has different formats, hidden dependencies Standardization is hard to do up-front Multi-scale information is complex and its pedigree and context matters  Need lighter weight, flexible, adaptive mechanisms for sharing data  groups    communities Combustion: a Multi-scale Chemical Science Challenge
CMCS Portal CHEF (Sakai precursor) SAM  Basic data/metadata management Metadata extraction Data Translations Additional portlets Metadata view/search Provenance graph E-notebook Chemistry apps Email notifications
CMCS Pilot Science Groups DNS– Jackie Chen, David Leahy Feature detection & tracking in DNS data HCCI University Consortium – Bill Pitz Homogeneous Charge Compression Ignition PrIMe – led by Michael Frenklach Development and publishing chemical reaction models  Real Fuels Project– Wing Tsang, Tom Allison Lead real fuels chemistry at NIST IUPAC – led by Branko Ruscic Develop and publish validated thermochemical data Quantum Chemistry – Theresa Windus QM Reference data
Community Curation of Data:  Quantum Chemistry Basis Sets
MAEViz Cyberenvironment Consequence-Based Risk Management Mid-America Earthquake Center Engineering View of MAE Center Research Portal-based Collaboration Environment Distributed data/metadata Sources Builds on NEESgrid technologies Hazard  Definition Inventory  Selection Fragility Models Damage  Prediction Decision  Support
NEESgrid UIUC NEESgrid UIUC http://neespop.ce.uiuc.edu:9271/chef/portal/group/NEESgridUIUC/page/default.psml/js_pane/P-f16a0kkk Narutoshi Nakata Project Name:  UIUC_ShakeTableExperiment NEESgrid UIUC UIUC UIUC
Environmental Observatories NCSA including CAC is involved in the development of CI for a number of environmental communities CUAHSI  (Consortium of Universities for the Advancement of Hydrologic Sciences Inc.) for hydrology NEON  (National Ecological Observatory Network) for ecology LOOKING  (Laboratory for the Ocean Observatory Knowledge Integration Grid) CLEANER  (Collaborative Large Scale Engineering Analysis Network for Environmental Research) for environmental engineering LTER  (U.S. Long-Term Ecological Research Network) investigating ecological processes over long temporal and broad spatial scale
Long Term Ecological Research  (LTER) Established 1980 (25 years) 26 Research Sites & 1 Support Site (LNO) North America Artic/Antarctica Puerto Rico/Tahiti Five Core Areas of Study Primary Plant Production Organism Population Studies Movement of Organic Matter Movement of Inorganic Matter Disturbance Patterns Questions are being asked at the Regional, National, and Global scale
LTER Pilot Study Portal User Interface Single Signon Data Discovery Secure Data Staging Data Audit Trail Data Analysis via HPC system
Large Synoptic Survey Telescope (LSST) A new telescope located in Chile 8.4m dia. Mirror, 10 sq. degrees FOV 3 GPixel Camera Image available sky every 3 days First light: January 2012 Science Mission: observe the time-varying sky Dark Energy and the accelerating universe Comprehensive census Solar System objects Study optical transients Create a galactic map The LSST collaboration Currently about a dozen institutions, including 3 DOE labs Schedule:  D&D phase:  2004-2007  (funded by NSF grant, private money, in-kind contributions) Construction:  2007-2012  (funded by NSF & DOE) Operation: 2012- NCSA Team headed by Ray Plante: 4 FTEs from NCSA, 2 FTEs from UIUC, 3 FTEs from NSF Data Generation Rate:  30  TB/night, 6 PB/year Total Disk Storage:  18 PB Nominal Computing  required:  20+ Tflops Site-to-archive network  bandwidth:  2.5 Gbits/s Processing latency  for real  time alerts:  ~ 60 secs
LEAD Mesoscale weather is  VERY DYNAMIC  but our tools, cyber environments, research methodologies and learning modalities are  VERY STATIC Getting even static capability is an enormous challenge due to the  complexity  of the tools and the  primitive  information technology infrastructures used to link them
NCSA Processes Analysis of science and engineering processes across many disciplines Identification of challenges and appropriate design responses Research/Technology Roadmaps Integrated project teams (IPTs) taking leadership roles within specific communities with strong partners to develop Cyberenvironments/CI Producing pilot/production capabilities Advancing technologies along roadmaps Backed by: 20 years of experience in user/community engagement Leadership roles in cutting edge Cyberenvironment projects in many disciplines Strong R&D efforts in Environments/Grid/Viz/Knowledge Discovery,.. Central role in national/global cyberinfrastructure definition/development
Cyberenvironments Architecture Perspective Community CyberEnvironments  Security Applications Services  (HPC, Instrument, Analysis,…) Core Services Orchestration Scientific Content/Process Mgmt Services Collaborative Services E-Science Services Data Mgmt Analytics Visualization Stream Mgmt Community Knowledge Services instruments Sensor nets
Key concepts Lightweight environment frameworks Portlet/plug-in models Contextualized collaboration capabilities Distributed Scientific Content & Process Mgmt / Semantics Tracking provenance Metadata Context-based data discovery, translation, virtualization Base for knowledge services Workflow/Services  Ability to integrate independent web services, manage complexities of CI Application/ process-oriented interface (Schema/ontology-driven) Visual Analytics Identification of features/patterns from one domain in terms of another… Streaming/steering/event-driven science Marshaling additional sensors for interesting phenomena On-demand simulation Living Cyberenvironments End-to-end, e.g. Engineering view of cutting-edge science Community managed/evolved Science lifecycle support – research, publication, curation, …
Cyberenvironments Mosaic and Cyberenvironments Mosaic By early 1990s, the internet had a wealth of resources, but they were inaccessible to most scientists Hyperlinking and document formatting did nothing new except lower the barriers to information access Cyberenvironments By the early 2000’s, the internet and grid had a wealth of interactive resources, but they were inaccessible to most scientists Cyberenvironments will lower barriers to orchestrating these resources
SNAC: My Position Statement Cyberenvironments have unsolved issues How do we discover data, services, best practices without hierarchical management? Organization    virtual organizations Disciplines    system science How do we structure large systems projects so they succeed? Can we identify communities who are ‘cyber-ready’? Can we suggest technologies based on community structure?
SNAC: My Position Statement (2) Cyberenvironments will be a rich resource for network research Computer mediated communication Workflow E-notebooks/annotation services Computer mediated model translation

More Related Content

PPT
BeSTGRID OpenGridForum 29 GIN session
PPT
100503 bioinfo instsymp
PPTX
Knoesis Student Achievement
PDF
Ucsd research-it-09-11-18
PPTX
Toward a National Research Platform
PDF
NSF Software @ ApacheConNA
PPTX
A VIVO VIEW OF CANCER RESEARCH: Dream, Vision and Reality
PPTX
Ci days notre_dame_april2010
BeSTGRID OpenGridForum 29 GIN session
100503 bioinfo instsymp
Knoesis Student Achievement
Ucsd research-it-09-11-18
Toward a National Research Platform
NSF Software @ ApacheConNA
A VIVO VIEW OF CANCER RESEARCH: Dream, Vision and Reality
Ci days notre_dame_april2010

What's hot (20)

PPT
Disciplinary RDM
PDF
Opening ndm2012 sc12
PDF
Welcome ndm11
PDF
Sgci iwsg-a-10-10-16
PPT
Sla2009 D Curation Heidorn
PPT
Semantics for Bioinformatics: What, Why and How of Search, Integration and An...
PPTX
Building the Pacific Research Platform: Supernetworks for Big Data Science
PPT
The Pacific Research Platform
PPT
Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...
PPT
Acting as Advocate? Seven steps for libraries in the data decade
PDF
Data Science - Poster - Kirk Borne - RDAP12
PPT
Physics Research in an Era of Global Cyberinfrastructure
PPTX
Building a Data Discovery Network for Sustainability Science
PPT
Calit2 - CSE's Living Laboratory for Applications
PDF
Network Science: Theory, Modeling and Applications
PPTX
The Commons: Leveraging the Power of the Cloud for Big Data
PDF
Adoption of Cloud Computing in Scientific Research
PPTX
Creating a Big Data Machine Learning Platform in California
PDF
Understanding the Big Picture of e-Science
PDF
NSF SI2 program discussion at 2013 SI2 PI meeting
Disciplinary RDM
Opening ndm2012 sc12
Welcome ndm11
Sgci iwsg-a-10-10-16
Sla2009 D Curation Heidorn
Semantics for Bioinformatics: What, Why and How of Search, Integration and An...
Building the Pacific Research Platform: Supernetworks for Big Data Science
The Pacific Research Platform
Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...
Acting as Advocate? Seven steps for libraries in the data decade
Data Science - Poster - Kirk Borne - RDAP12
Physics Research in an Era of Global Cyberinfrastructure
Building a Data Discovery Network for Sustainability Science
Calit2 - CSE's Living Laboratory for Applications
Network Science: Theory, Modeling and Applications
The Commons: Leveraging the Power of the Cloud for Big Data
Adoption of Cloud Computing in Scientific Research
Creating a Big Data Machine Learning Platform in California
Understanding the Big Picture of e-Science
NSF SI2 program discussion at 2013 SI2 PI meeting
Ad

Viewers also liked (15)

PPT
Role of Bio Markers In Water Monitoring
PPTX
Dr Robert Hanner - Barcode Data standards for animals, plants & fungi
PPT
Dna based tools in fish identification
PDF
Mitochondrial DNA in Taxonomy and Phylogeny
PDF
David Schindel - DNA Barcoding and the consortium for the barcode of life (CBOL)
PPT
Johannes Bergsten Dna Barcoding
PPTX
Dna Barcoding and Undergraduate Science
PPTX
DNA Bar-code to Distinguish the Species
PPTX
Fish DNA barcoding
PPTX
Dna barcoding
PPTX
Use of DNA barcoding and its role in the plant species/varietal Identifica...
PPTX
Dna barcoding
PPTX
DNA Barcoding: A simple way of identifying species by DNA
PPTX
Random Amplified polymorphic DNA. RAPD
PPT
Bacteriological analysis of drinking water by MPN method.
Role of Bio Markers In Water Monitoring
Dr Robert Hanner - Barcode Data standards for animals, plants & fungi
Dna based tools in fish identification
Mitochondrial DNA in Taxonomy and Phylogeny
David Schindel - DNA Barcoding and the consortium for the barcode of life (CBOL)
Johannes Bergsten Dna Barcoding
Dna Barcoding and Undergraduate Science
DNA Bar-code to Distinguish the Species
Fish DNA barcoding
Dna barcoding
Use of DNA barcoding and its role in the plant species/varietal Identifica...
Dna barcoding
DNA Barcoding: A simple way of identifying species by DNA
Random Amplified polymorphic DNA. RAPD
Bacteriological analysis of drinking water by MPN method.
Ad

Similar to Cyberistructure (20)

PPT
GeoChronos - CANARIE NEP Showcase 2010 Presentation
PPT
UK e-Infrastructure: Widening Access, Increasing Participation
PPTX
Deroure Repo3
PPTX
Deroure Repo3
PPTX
SGCI-URSSI-Sustainability in Research Computing
PDF
DataONE_cobb_hubbub2012_20120924_v05
PPT
Curriculum Development at the Tetherless World Constellation - Peter Fox - RD...
PPT
Aaas Data Intensive Science And Grid
PDF
Grid is Dead ? Nimrod on the Cloud
PPT
Agents In An Exponential World Foster
PPT
SomeSlides
PPT
Cyberinfrastructure and Applications Overview: Howard University June22
PDF
XldbEuropeEdinburgh-09-jun2011
PPTX
Ogce Workflow Suite
PDF
Sgci esip-7-20-18
PPTX
SEAD: Sustainable Environment-Actionable Data - Robert McDonald - RDAP12
PPT
Cyberinfrastructure for Ocean Cabled Observatories
PPTX
Sgg crest-presentation-final
PPTX
Rpi talk foster september 2011
PPT
Talking 'bout a revolution: Framing e-Research as a computerization movement
GeoChronos - CANARIE NEP Showcase 2010 Presentation
UK e-Infrastructure: Widening Access, Increasing Participation
Deroure Repo3
Deroure Repo3
SGCI-URSSI-Sustainability in Research Computing
DataONE_cobb_hubbub2012_20120924_v05
Curriculum Development at the Tetherless World Constellation - Peter Fox - RD...
Aaas Data Intensive Science And Grid
Grid is Dead ? Nimrod on the Cloud
Agents In An Exponential World Foster
SomeSlides
Cyberinfrastructure and Applications Overview: Howard University June22
XldbEuropeEdinburgh-09-jun2011
Ogce Workflow Suite
Sgci esip-7-20-18
SEAD: Sustainable Environment-Actionable Data - Robert McDonald - RDAP12
Cyberinfrastructure for Ocean Cabled Observatories
Sgg crest-presentation-final
Rpi talk foster september 2011
Talking 'bout a revolution: Framing e-Research as a computerization movement

Recently uploaded (20)

PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
[발표본] 너의 과제는 클라우드에 있어_KTDS_김동현_20250524.pdf
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPT
Teaching material agriculture food technology
PDF
Electronic commerce courselecture one. Pdf
PDF
Approach and Philosophy of On baking technology
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Modernizing your data center with Dell and AMD
PDF
Machine learning based COVID-19 study performance prediction
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
GamePlan Trading System Review: Professional Trader's Honest Take
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
GDG Cloud Iasi [PUBLIC] Florian Blaga - Unveiling the Evolution of Cybersecur...
PPTX
Understanding_Digital_Forensics_Presentation.pptx
Chapter 3 Spatial Domain Image Processing.pdf
Reach Out and Touch Someone: Haptics and Empathic Computing
[발표본] 너의 과제는 클라우드에 있어_KTDS_김동현_20250524.pdf
Dropbox Q2 2025 Financial Results & Investor Presentation
Teaching material agriculture food technology
Electronic commerce courselecture one. Pdf
Approach and Philosophy of On baking technology
Diabetes mellitus diagnosis method based random forest with bat algorithm
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Modernizing your data center with Dell and AMD
Machine learning based COVID-19 study performance prediction
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Mobile App Security Testing_ A Comprehensive Guide.pdf
GamePlan Trading System Review: Professional Trader's Honest Take
MYSQL Presentation for SQL database connectivity
20250228 LYD VKU AI Blended-Learning.pptx
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
GDG Cloud Iasi [PUBLIC] Florian Blaga - Unveiling the Evolution of Cybersecur...
Understanding_Digital_Forensics_Presentation.pptx

Cyberistructure

  • 1. Cyberenvironments @ NCSA Supporting Community-scale Science Jim Myers Associate Director Collaborative Technologies NCSA
  • 2. Beyond Cyberinfrastructure CyberInfrastructure commonly refers to infrastructure (networks, compute, and data resources) plus the middleware (grid) that links those resources together and presents them in a uniform standard way. CyberEnvironments is a term NCSA has coined to describe the complete End-to-End solution. This integrates Shared and Custom Cyberinfrastructure into a process-oriented framework for the community and researchers that allow them to focus on their research, not on accessing and managing the CI. A CyberCommunity is a distributed group of people (virtual organization) with common goals and shared knowledge. Size ranges from a few individuals to an interdisciplinary or international groups. These groups can include, researchers, policy makers, responders, educators, and citizens and often have a long term identity and purpose.
  • 3. Cyberenvironments: Enable researchers to tackle more, and more complex challenges leading to Enhanced production of knowledge and Enhanced application of that knowledge to understanding our world, developing solutions, and making informed decisions
  • 4. The Systems Science Revolution Research spans multiple disciplines/sub-disciplines Coordination through Community Resources Bi-directional flow/feedback of information Partial results being combined to produce new knowledge Experiment/Theory/Model comparisons Multiscale optimizations Rapid Evolution High Complexity Resources will be distributed With multiple curators
  • 5. End to end Scientific Progress is limited by the manual processes: Data discovery Translation Experiment setup Group coordination Tool integration Training Feature Extraction Data interpretation Acceptance of new models/tools Dissemination of best practices Interdisciplinary communication Data production Processing power Data transfer/storage !
  • 6. Round-Trip Information Logistics Desktop applications accessing remote resources Individuals publishing to communities and accessing reference information, best practices, etc. Unique capabilities linked into end-to-end community processes Inter-community connectivity Evolving at the speed of science Individual Unique capabilities High Performance Resources Desktop Community End-to-end processes
  • 7. Key Issues How do we build a system before the parts are done? How do we evolve the system to keep it current? How do we convey knowledge as well as tools to end users? How do we coordinate without centralizing? Technology Responses: Workflow Ability to integrate independent web services Ability to hide workflow behind applications Rich metadata Tracking provenance Context-based data discovery Distributed data stores Data translation/data virtualization Cyberenvironments Engineering view of cutting-edge science Collaboration capabilities ‘ Publication’ – exposing work to groups & the public Streams/Events/Feature Management Core Domain Services, e.g. GIS
  • 8. NCSA Processes Analysis of science and engineering processes across many disciplines Identification of challenges and appropriate design responses Research/Technology Roadmaps Integrated project teams (IPTs) taking leadership roles within specific communities with strong partners to develop Cyberenvironments/CI Producing pilot/production capabilities Advancing technologies along roadmaps Backed by: 20 years of experience in user/community engagement Leadership roles in cutting edge Cyberenvironment projects in many disciplines Strong R&D efforts in Environments/Grid/Viz/Knowledge Discovery,.. Central role in national/global cyberinfrastructure definition/development
  • 9. Want a systems-science approach to address complex problems New knowledge is assimilated from different data, tools, and disciplines at each scale Real-time bi-directional information flow Multiple applications for the same information But Normal publication is slow and lossy Data has different formats, hidden dependencies Standardization is hard to do up-front Multi-scale information is complex and its pedigree and context matters  Need lighter weight, flexible, adaptive mechanisms for sharing data groups  communities Combustion: a Multi-scale Chemical Science Challenge
  • 10. CMCS Portal CHEF (Sakai precursor) SAM Basic data/metadata management Metadata extraction Data Translations Additional portlets Metadata view/search Provenance graph E-notebook Chemistry apps Email notifications
  • 11. CMCS Pilot Science Groups DNS– Jackie Chen, David Leahy Feature detection & tracking in DNS data HCCI University Consortium – Bill Pitz Homogeneous Charge Compression Ignition PrIMe – led by Michael Frenklach Development and publishing chemical reaction models Real Fuels Project– Wing Tsang, Tom Allison Lead real fuels chemistry at NIST IUPAC – led by Branko Ruscic Develop and publish validated thermochemical data Quantum Chemistry – Theresa Windus QM Reference data
  • 12. Community Curation of Data: Quantum Chemistry Basis Sets
  • 13. MAEViz Cyberenvironment Consequence-Based Risk Management Mid-America Earthquake Center Engineering View of MAE Center Research Portal-based Collaboration Environment Distributed data/metadata Sources Builds on NEESgrid technologies Hazard Definition Inventory Selection Fragility Models Damage Prediction Decision Support
  • 14. NEESgrid UIUC NEESgrid UIUC http://neespop.ce.uiuc.edu:9271/chef/portal/group/NEESgridUIUC/page/default.psml/js_pane/P-f16a0kkk Narutoshi Nakata Project Name: UIUC_ShakeTableExperiment NEESgrid UIUC UIUC UIUC
  • 15. Environmental Observatories NCSA including CAC is involved in the development of CI for a number of environmental communities CUAHSI (Consortium of Universities for the Advancement of Hydrologic Sciences Inc.) for hydrology NEON (National Ecological Observatory Network) for ecology LOOKING (Laboratory for the Ocean Observatory Knowledge Integration Grid) CLEANER (Collaborative Large Scale Engineering Analysis Network for Environmental Research) for environmental engineering LTER (U.S. Long-Term Ecological Research Network) investigating ecological processes over long temporal and broad spatial scale
  • 16. Long Term Ecological Research (LTER) Established 1980 (25 years) 26 Research Sites & 1 Support Site (LNO) North America Artic/Antarctica Puerto Rico/Tahiti Five Core Areas of Study Primary Plant Production Organism Population Studies Movement of Organic Matter Movement of Inorganic Matter Disturbance Patterns Questions are being asked at the Regional, National, and Global scale
  • 17. LTER Pilot Study Portal User Interface Single Signon Data Discovery Secure Data Staging Data Audit Trail Data Analysis via HPC system
  • 18. Large Synoptic Survey Telescope (LSST) A new telescope located in Chile 8.4m dia. Mirror, 10 sq. degrees FOV 3 GPixel Camera Image available sky every 3 days First light: January 2012 Science Mission: observe the time-varying sky Dark Energy and the accelerating universe Comprehensive census Solar System objects Study optical transients Create a galactic map The LSST collaboration Currently about a dozen institutions, including 3 DOE labs Schedule: D&D phase: 2004-2007 (funded by NSF grant, private money, in-kind contributions) Construction: 2007-2012 (funded by NSF & DOE) Operation: 2012- NCSA Team headed by Ray Plante: 4 FTEs from NCSA, 2 FTEs from UIUC, 3 FTEs from NSF Data Generation Rate: 30 TB/night, 6 PB/year Total Disk Storage: 18 PB Nominal Computing required: 20+ Tflops Site-to-archive network bandwidth: 2.5 Gbits/s Processing latency for real time alerts: ~ 60 secs
  • 19. LEAD Mesoscale weather is VERY DYNAMIC but our tools, cyber environments, research methodologies and learning modalities are VERY STATIC Getting even static capability is an enormous challenge due to the complexity of the tools and the primitive information technology infrastructures used to link them
  • 20. NCSA Processes Analysis of science and engineering processes across many disciplines Identification of challenges and appropriate design responses Research/Technology Roadmaps Integrated project teams (IPTs) taking leadership roles within specific communities with strong partners to develop Cyberenvironments/CI Producing pilot/production capabilities Advancing technologies along roadmaps Backed by: 20 years of experience in user/community engagement Leadership roles in cutting edge Cyberenvironment projects in many disciplines Strong R&D efforts in Environments/Grid/Viz/Knowledge Discovery,.. Central role in national/global cyberinfrastructure definition/development
  • 21. Cyberenvironments Architecture Perspective Community CyberEnvironments Security Applications Services (HPC, Instrument, Analysis,…) Core Services Orchestration Scientific Content/Process Mgmt Services Collaborative Services E-Science Services Data Mgmt Analytics Visualization Stream Mgmt Community Knowledge Services instruments Sensor nets
  • 22. Key concepts Lightweight environment frameworks Portlet/plug-in models Contextualized collaboration capabilities Distributed Scientific Content & Process Mgmt / Semantics Tracking provenance Metadata Context-based data discovery, translation, virtualization Base for knowledge services Workflow/Services Ability to integrate independent web services, manage complexities of CI Application/ process-oriented interface (Schema/ontology-driven) Visual Analytics Identification of features/patterns from one domain in terms of another… Streaming/steering/event-driven science Marshaling additional sensors for interesting phenomena On-demand simulation Living Cyberenvironments End-to-end, e.g. Engineering view of cutting-edge science Community managed/evolved Science lifecycle support – research, publication, curation, …
  • 23. Cyberenvironments Mosaic and Cyberenvironments Mosaic By early 1990s, the internet had a wealth of resources, but they were inaccessible to most scientists Hyperlinking and document formatting did nothing new except lower the barriers to information access Cyberenvironments By the early 2000’s, the internet and grid had a wealth of interactive resources, but they were inaccessible to most scientists Cyberenvironments will lower barriers to orchestrating these resources
  • 24. SNAC: My Position Statement Cyberenvironments have unsolved issues How do we discover data, services, best practices without hierarchical management? Organization  virtual organizations Disciplines  system science How do we structure large systems projects so they succeed? Can we identify communities who are ‘cyber-ready’? Can we suggest technologies based on community structure?
  • 25. SNAC: My Position Statement (2) Cyberenvironments will be a rich resource for network research Computer mediated communication Workflow E-notebooks/annotation services Computer mediated model translation