SlideShare a Scribd company logo
Deep Learning
for
Extreme Weather Detection
- 1 -
Prabhat
Lawrence Berkeley National Laboratory
SAMSI Climate Workshop
8/23/2017
Outline
•Scientific Motivation
•Deep Learning in Computer Vision
– Trends, Successes
•Deep Learning in Climate Science
– Results, Challenges
•Future of AI in Climate Science
- 2 -
Extreme Weather
- 3 -
- 4 -
- 5 -
- 6 -
- 7 -
How will extreme weather change in the future?
- 8 -
How will extreme weather change in the future?
•Look back in time (Paleoclimate records)
•Look forward in time (Climate simulations)
– Internal climate system variability
– External forcings
– Anthropogenic influence
•Need an objective tool for detecting extremes
– Pattern detection task
– Can Deep Learning come to the rescue?
- 9 -
Climate Simulations
- 10 -
Outline
•
•Deep Learning in Computer Vision
– Trends, Successes
•
–
•
- 11 -
Computer Vision tasks
- 12 -
Deep Learning for Computer Vision
- 13 -
ImageNet Dataset
•1000 object classes
•1.2M training examples
- 14 -
AlexNet Architecture
- 15 -
ImageNet Performance
- 16 -
Can Deep Learning Work for Climate Science?
- 17 -
• Similarities
– Tasks:
• Classification, Localization, Detection, Segmentation
• Clustering
• Feature Learning
• Differences
– Unique attributes of Climate Data
• Multi-channel / Multi-variate
• Different Spatio-temporal scales
• Double precision floating point
• Underlying statistics are likely different
Challenge: Multi-Variate Data
- 18 -
Outline
•
•
–
•Deep Learning in Climate Science
– Results, Challenges
•
- 19 -
- 20 -
Task: Find Extreme Weather Patterns
Supervised Binary Classification
•Training Input: Cropped, Centered, Multi-variate
patches with Labels
–Tropical Cyclone (TC)
–Atmospheric River (AR)
–Weather Front (WF)
•Output: Binary (Yes/No) on Test patches
– Is there a TC in the patch?
– Is there an AR in the patch?
– Is there a WF in the patch?
- 21 -
CLASSIFICATION Image
Dimension
Variables Total Examples
(+ve) (-ve)
Tropical Cyclone 32x32 PSL,UBOT,VBOT,TMQ,
U850,V850,T200,T500
10000 10000
Atmospheric
Rivers
148x224 TMQ, Land Sea mask 6500 6800
Weather Fronts 27x60 T2m, Precip, PSL 5600 6500
Training Data
CLASSIFICATION Conv1 Pool1 Conv2 Pool2 Full Full
Tropical Cyclone 5x5-8 2x2 5x5-16 2x2 50 2
Atmospheric River 12x12-8 3x3 12x12-16 2x2 200 2
Weather Fronts 5x5-16 2x2 5x5-16 2x2 400 2
Supervised Convolutional Architecture
Logistic
Regression
K-Nearest
Neighbor
Support Vector
Machine
Random
Forest
ConvNet
Train Test Train Test Train Test Train Test Train Test
Tropical
Cyclone
96.8 95.85 98.1 97.85 97.0 95.85 99.2 99.4 99.3 99.1
Atmospheric
Rivers
81.97 82.65 79.7 81.7 81.6 83.0 87.9 88.4 90.5 90.0
Weather
Fronts
84.9 89.8 72.46 76.45 84.35 90.2 80.97 87.5 88.7 89.4
Hyper-parameter optimization applied with Spearmint for all methods
Supervised Classification Accuracy
Semi-Supervised Detection
•Objectives:
– Create unified architecture for all weather patterns
– Want to predict bounding box location for weather pattern
– Might have few/no labels for several weather patterns; want
to discover new patterns
- 25 -
Semi-Supervised Convolutional
Architecture
Encoder Decoder
Classification + Bounding Box Regression
Contributors: Evan Racah, Chris Beckham, Tegan Maharaj, Chris
Pal
Reconstruction Results
- 27 -
Original Reconstruction
Classification + Regression Results
- 28 -
Ground Truth
Prediction
Program on Mathematical and Statistical Methods for Climate and the Earth System Opening Workshop, Deep Learning for Extreme Weather Detection - Prabhat, Aug 23, 2017
Weather Front Detection
- 30 -
Contributors: Jim Biard, Ken Kunkel, Evan Racah
- 31 -
- 32 -
- 33 -
Current Status
• SC’17 paper on scaling semi-supervised
architecture to 600,000 cores, 15PF performance
•2 papers under review
–BAMS article: “Deep Learning for Detecting Extreme
Weather Patterns”
– NIPS paper: “ExtremeWeather: A large-scale climate
dataset for semi-supervised detection, localization, and
understanding of extreme weather events”
•Upcoming papers
– Front detection
– Atmospheric river detection
- 34 -
Next Steps
• Goal: Create unified architecture for finding
extreme weather in all climate datasets
• Create schema and reference dataset for climate
community; release reference architecture
• Hybrid LSTM + CNN architectures for storm tracking
• Classification -> Localization -> Detection ->
Segmentation
- 35 -
Open Challenges
• Hyper-Parameter Optimization
– Tuning #layers, #filters, learning rates, schedule is a black art
• Performance and Scaling
– Current networks take days to train on O(10) GB datasets, we
have O(10TB) datasets on hand
• Scarcity of Labeled Data
– Community needs to self-organize and run labeling campaigns
– Need some theory around limits of semi-supervised learning
• Interpretability and Visualization
– ‘Black Box’ classifier
– Need to incorporate domain science principles (physical
consistency, etc)
- 36 -
Outline
•
•
–
•
–
•Future of AI in Climate Science
- 37 -
Timeline of 2 communities
- 38 -
1970 1980 1990 2000 2010
• Neural Nets • Multi-layer
perceptrons,
Backpropagation
• Eigenface
• First principles
Physics,
Geometry
• Random Forests
• Hand-tuned
features (SIFT,
HOG, …)
• Edge, Corner
detectors
• SVMs, Machine
Learning
• PASCAL VOC
• ImageNet
• ILSVRC
• AlexNet
• ResNet
• …
• Hand-tuned features, first principles physics,…
• IMILAST, ARTMIP
Pattern Detection in Computer Vision
Pattern Detection in Climate Science
Disclaimer…
- 39 -
2018-2020
•Climate Science community will self-organize and
conduct labeling campaigns
– Datasets and reference architectures will become easily
available
•Researchers will exploit low-hanging fruit
– Classification, Clustering, Regression problems will be
(nearly) completely solved
•Broad applicability for analyzing observational
(satellite, weather station) and model (reanalysis,
CMIP-6) output
- 40 -
2020+
•Broad Deployment of tools at HPC centers, Cloud
and universities
– Interactive analytics on 100TB-PB datasets
•Entire climate archives are segmented and
classified
– Anomaly detection; Correlation; Causal Analysis
•Hard questions are formulated and addressed
– Interpretability, incorporating domain science principles
• What is the ‘value add’ of the climate data analyst?
- 41 -
2020+ Climate Analyst Workflow
- 42 -
CMIP
archives
Observational
Data
• Interactive Exploration
• Semantic Labels
• Patterns
• Clusters
• Anomalies
• Mechanisms
• Hypothesis
Conclusions
• Characterizing Extreme Weather in future climate
regimes is an important goal
• Deep Learning can be applied effectively to solve pattern
recognition problem
– Supervised and semi-supervised CNNs
– Segmentation and LSTM architectures in development
• Unique challenges
– Performance and Scaling, Interpretability, Hyper-parameter
Optimization, Lack of labeled data
• Potential for innovation and impact, open to
collaboration!
- 43 -
Acknowledgements
• Intel Research: Nadathur Satish, Narayanan Sundaram, Mostofa
Patwary, Amir Khosrowshahi, Pradeep Dubey, Joe Curley
• Stanford: Ioannis Mitliagkas, Jian Yang, Yunjie Liu, Chris Re
• UC Berkeley: Mayur Mudigonda
• MILA: Evan Racah, Chris Beckham, Tegan Maharaj, Chris Pal
• LBNL: Thorsten Kurth, Wahid Bhimji, Michael Wehner, William
Collins
• NOAA: Jim Biard, Ken Kunkel
- 44 -
Questions?
prabhat@lbl.gov

More Related Content

PDF
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
PDF
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
PDF
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
PDF
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
PDF
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
PDF
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
PDF
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
PDF
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...

What's hot (20)

PDF
CLIM: Transition Workshop - Optimization Methods in Remote Sensing - Jessica...
PPTX
Improving Physical Parametrizations in Climate Models using Machine Learning
PDF
CLIM Program: Remote Sensing Workshop, Optimization Methods in Remote Sensing...
PDF
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
PDF
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
PDF
DSD-INT 2016 Integrating information sources for inland waters modelling - Ba...
PPT
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
PDF
DSD-INT 2016 Data assimilation to improve volcanic ash forecasts using LOTOS-...
PDF
Time integration of evapotranspiration using a two source surface energy bala...
PDF
DSD-INT 2016 The eWaterCyle global Hydrological forecasting system - Drost
PPTX
Land Cover and Land use Classifiction from Satellite Image Time Series Data u...
PPTX
Testing the global grid of master events for waveform cross correlation with ...
PDF
15 sengupta next_generation_satellite_modelling
PPTX
TU2.T10.1.pptx
PDF
Short-term solar forecasting based on sky images
PPTX
Datta-Barua, URSI AT-RASC, 2015, Canary Islands, Ionospheric-Thermospheric St...
PPT
FR1.L09.2 - ONBOARD RADAR PROCESSING CONCEPTS FOR THE DESDYNI MISSION
PDF
Improved Kalman Filtered Neuro-Fuzzy Wind Speed Predictor For Real Data Set ...
PPT
TU2.L10 - NEXT-GENERATION GLOBAL PRECIPITATION PRODUCTS AND THEIR APPLICATIONS
PPTX
IGARSS2011_radarvolcanology.pptx
CLIM: Transition Workshop - Optimization Methods in Remote Sensing - Jessica...
Improving Physical Parametrizations in Climate Models using Machine Learning
CLIM Program: Remote Sensing Workshop, Optimization Methods in Remote Sensing...
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
DSD-INT 2016 Integrating information sources for inland waters modelling - Ba...
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
DSD-INT 2016 Data assimilation to improve volcanic ash forecasts using LOTOS-...
Time integration of evapotranspiration using a two source surface energy bala...
DSD-INT 2016 The eWaterCyle global Hydrological forecasting system - Drost
Land Cover and Land use Classifiction from Satellite Image Time Series Data u...
Testing the global grid of master events for waveform cross correlation with ...
15 sengupta next_generation_satellite_modelling
TU2.T10.1.pptx
Short-term solar forecasting based on sky images
Datta-Barua, URSI AT-RASC, 2015, Canary Islands, Ionospheric-Thermospheric St...
FR1.L09.2 - ONBOARD RADAR PROCESSING CONCEPTS FOR THE DESDYNI MISSION
Improved Kalman Filtered Neuro-Fuzzy Wind Speed Predictor For Real Data Set ...
TU2.L10 - NEXT-GENERATION GLOBAL PRECIPITATION PRODUCTS AND THEIR APPLICATIONS
IGARSS2011_radarvolcanology.pptx
Ad

Similar to Program on Mathematical and Statistical Methods for Climate and the Earth System Opening Workshop, Deep Learning for Extreme Weather Detection - Prabhat, Aug 23, 2017 (20)

PDF
Processing Real-Time Volcano Seismic Measurements Through Redis: David Chaves
PDF
NASA Advanced Computing Environment for Science & Engineering
PDF
How HPC and large-scale data analytics are transforming experimental science
PPT
Data Intensive Applications at UCSD: Driving a Campus Research Cyberinfrastru...
PDF
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
PPTX
ESCAPE Kick-off meeting - KM3Net, Opening a new window on our universe (Feb 2...
PPT
Environmental Remote Sensing
PDF
Comparison of Fine-tuning and Extension Strategies for Deep Convolutional Neu...
PDF
FUTURE TRENDS OF SEISMIC ANALYSIS
PDF
cv_lukas_mandrake_jpl_update2016_formatted
PDF
HPC Cluster Computing from 64 to 156,000 Cores 
PDF
The U.S. Exascale Computing Project: Status and Plans
PPT
big_data_casestudies_2.ppt
PPTX
Scalable Deep Learning in ExtremeEarth-phiweek19
PDF
Combining remote sensing earth observations and in situ networks: detection o...
PDF
01-10 Exploring new high potential 2D materials - Angioni.pdf
PPT
Supercomputers: Directions in Technology, Architecture, and Applications
PPTX
Developing Computational Skills in the Sciences with Matlab Webinar 2017
PDF
Grid is Dead ? Nimrod on the Cloud
PDF
Numerical inflation: simulation of observational parameters
Processing Real-Time Volcano Seismic Measurements Through Redis: David Chaves
NASA Advanced Computing Environment for Science & Engineering
How HPC and large-scale data analytics are transforming experimental science
Data Intensive Applications at UCSD: Driving a Campus Research Cyberinfrastru...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
ESCAPE Kick-off meeting - KM3Net, Opening a new window on our universe (Feb 2...
Environmental Remote Sensing
Comparison of Fine-tuning and Extension Strategies for Deep Convolutional Neu...
FUTURE TRENDS OF SEISMIC ANALYSIS
cv_lukas_mandrake_jpl_update2016_formatted
HPC Cluster Computing from 64 to 156,000 Cores 
The U.S. Exascale Computing Project: Status and Plans
big_data_casestudies_2.ppt
Scalable Deep Learning in ExtremeEarth-phiweek19
Combining remote sensing earth observations and in situ networks: detection o...
01-10 Exploring new high potential 2D materials - Angioni.pdf
Supercomputers: Directions in Technology, Architecture, and Applications
Developing Computational Skills in the Sciences with Matlab Webinar 2017
Grid is Dead ? Nimrod on the Cloud
Numerical inflation: simulation of observational parameters
Ad

More from The Statistical and Applied Mathematical Sciences Institute (20)

PDF
Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...
PDF
2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...
PDF
Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...
PDF
Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
PDF
Causal Inference Opening Workshop - A Bracketing Relationship between Differe...
PDF
Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...
PPTX
Causal Inference Opening Workshop - Difference-in-differences: more than meet...
PDF
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
PDF
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
PPTX
Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...
PDF
Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...
PDF
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
PDF
Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...
PDF
Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...
PDF
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
PDF
Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...
PPTX
2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...
PPTX
2019 Fall Series: Professional Development, Writing Academic Papers…What Work...
PDF
2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...
PDF
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...
2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...
Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...
Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
Causal Inference Opening Workshop - A Bracketing Relationship between Differe...
Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...
Causal Inference Opening Workshop - Difference-in-differences: more than meet...
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...
Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...
Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...
2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...
2019 Fall Series: Professional Development, Writing Academic Papers…What Work...
2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...

Recently uploaded (20)

PPTX
Microbial diseases, their pathogenesis and prophylaxis
PDF
Microbial disease of the cardiovascular and lymphatic systems
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PDF
Classroom Observation Tools for Teachers
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PDF
Basic Mud Logging Guide for educational purpose
PPTX
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
PPTX
Cell Structure & Organelles in detailed.
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
Complications of Minimal Access Surgery at WLH
PDF
Insiders guide to clinical Medicine.pdf
PDF
01-Introduction-to-Information-Management.pdf
PDF
TR - Agricultural Crops Production NC III.pdf
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PDF
Anesthesia in Laparoscopic Surgery in India
PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
Microbial diseases, their pathogenesis and prophylaxis
Microbial disease of the cardiovascular and lymphatic systems
human mycosis Human fungal infections are called human mycosis..pptx
Classroom Observation Tools for Teachers
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
2.FourierTransform-ShortQuestionswithAnswers.pdf
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
Basic Mud Logging Guide for educational purpose
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
Cell Structure & Organelles in detailed.
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
Complications of Minimal Access Surgery at WLH
Insiders guide to clinical Medicine.pdf
01-Introduction-to-Information-Management.pdf
TR - Agricultural Crops Production NC III.pdf
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
Anesthesia in Laparoscopic Surgery in India
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf

Program on Mathematical and Statistical Methods for Climate and the Earth System Opening Workshop, Deep Learning for Extreme Weather Detection - Prabhat, Aug 23, 2017

  • 1. Deep Learning for Extreme Weather Detection - 1 - Prabhat Lawrence Berkeley National Laboratory SAMSI Climate Workshop 8/23/2017
  • 2. Outline •Scientific Motivation •Deep Learning in Computer Vision – Trends, Successes •Deep Learning in Climate Science – Results, Challenges •Future of AI in Climate Science - 2 -
  • 8. How will extreme weather change in the future? - 8 -
  • 9. How will extreme weather change in the future? •Look back in time (Paleoclimate records) •Look forward in time (Climate simulations) – Internal climate system variability – External forcings – Anthropogenic influence •Need an objective tool for detecting extremes – Pattern detection task – Can Deep Learning come to the rescue? - 9 -
  • 11. Outline • •Deep Learning in Computer Vision – Trends, Successes • – • - 11 -
  • 13. Deep Learning for Computer Vision - 13 -
  • 14. ImageNet Dataset •1000 object classes •1.2M training examples - 14 -
  • 17. Can Deep Learning Work for Climate Science? - 17 - • Similarities – Tasks: • Classification, Localization, Detection, Segmentation • Clustering • Feature Learning • Differences – Unique attributes of Climate Data • Multi-channel / Multi-variate • Different Spatio-temporal scales • Double precision floating point • Underlying statistics are likely different
  • 19. Outline • • – •Deep Learning in Climate Science – Results, Challenges • - 19 -
  • 20. - 20 - Task: Find Extreme Weather Patterns
  • 21. Supervised Binary Classification •Training Input: Cropped, Centered, Multi-variate patches with Labels –Tropical Cyclone (TC) –Atmospheric River (AR) –Weather Front (WF) •Output: Binary (Yes/No) on Test patches – Is there a TC in the patch? – Is there an AR in the patch? – Is there a WF in the patch? - 21 -
  • 22. CLASSIFICATION Image Dimension Variables Total Examples (+ve) (-ve) Tropical Cyclone 32x32 PSL,UBOT,VBOT,TMQ, U850,V850,T200,T500 10000 10000 Atmospheric Rivers 148x224 TMQ, Land Sea mask 6500 6800 Weather Fronts 27x60 T2m, Precip, PSL 5600 6500 Training Data
  • 23. CLASSIFICATION Conv1 Pool1 Conv2 Pool2 Full Full Tropical Cyclone 5x5-8 2x2 5x5-16 2x2 50 2 Atmospheric River 12x12-8 3x3 12x12-16 2x2 200 2 Weather Fronts 5x5-16 2x2 5x5-16 2x2 400 2 Supervised Convolutional Architecture
  • 24. Logistic Regression K-Nearest Neighbor Support Vector Machine Random Forest ConvNet Train Test Train Test Train Test Train Test Train Test Tropical Cyclone 96.8 95.85 98.1 97.85 97.0 95.85 99.2 99.4 99.3 99.1 Atmospheric Rivers 81.97 82.65 79.7 81.7 81.6 83.0 87.9 88.4 90.5 90.0 Weather Fronts 84.9 89.8 72.46 76.45 84.35 90.2 80.97 87.5 88.7 89.4 Hyper-parameter optimization applied with Spearmint for all methods Supervised Classification Accuracy
  • 25. Semi-Supervised Detection •Objectives: – Create unified architecture for all weather patterns – Want to predict bounding box location for weather pattern – Might have few/no labels for several weather patterns; want to discover new patterns - 25 -
  • 26. Semi-Supervised Convolutional Architecture Encoder Decoder Classification + Bounding Box Regression Contributors: Evan Racah, Chris Beckham, Tegan Maharaj, Chris Pal
  • 27. Reconstruction Results - 27 - Original Reconstruction
  • 28. Classification + Regression Results - 28 - Ground Truth Prediction
  • 30. Weather Front Detection - 30 - Contributors: Jim Biard, Ken Kunkel, Evan Racah
  • 34. Current Status • SC’17 paper on scaling semi-supervised architecture to 600,000 cores, 15PF performance •2 papers under review –BAMS article: “Deep Learning for Detecting Extreme Weather Patterns” – NIPS paper: “ExtremeWeather: A large-scale climate dataset for semi-supervised detection, localization, and understanding of extreme weather events” •Upcoming papers – Front detection – Atmospheric river detection - 34 -
  • 35. Next Steps • Goal: Create unified architecture for finding extreme weather in all climate datasets • Create schema and reference dataset for climate community; release reference architecture • Hybrid LSTM + CNN architectures for storm tracking • Classification -> Localization -> Detection -> Segmentation - 35 -
  • 36. Open Challenges • Hyper-Parameter Optimization – Tuning #layers, #filters, learning rates, schedule is a black art • Performance and Scaling – Current networks take days to train on O(10) GB datasets, we have O(10TB) datasets on hand • Scarcity of Labeled Data – Community needs to self-organize and run labeling campaigns – Need some theory around limits of semi-supervised learning • Interpretability and Visualization – ‘Black Box’ classifier – Need to incorporate domain science principles (physical consistency, etc) - 36 -
  • 37. Outline • • – • – •Future of AI in Climate Science - 37 -
  • 38. Timeline of 2 communities - 38 - 1970 1980 1990 2000 2010 • Neural Nets • Multi-layer perceptrons, Backpropagation • Eigenface • First principles Physics, Geometry • Random Forests • Hand-tuned features (SIFT, HOG, …) • Edge, Corner detectors • SVMs, Machine Learning • PASCAL VOC • ImageNet • ILSVRC • AlexNet • ResNet • … • Hand-tuned features, first principles physics,… • IMILAST, ARTMIP Pattern Detection in Computer Vision Pattern Detection in Climate Science
  • 40. 2018-2020 •Climate Science community will self-organize and conduct labeling campaigns – Datasets and reference architectures will become easily available •Researchers will exploit low-hanging fruit – Classification, Clustering, Regression problems will be (nearly) completely solved •Broad applicability for analyzing observational (satellite, weather station) and model (reanalysis, CMIP-6) output - 40 -
  • 41. 2020+ •Broad Deployment of tools at HPC centers, Cloud and universities – Interactive analytics on 100TB-PB datasets •Entire climate archives are segmented and classified – Anomaly detection; Correlation; Causal Analysis •Hard questions are formulated and addressed – Interpretability, incorporating domain science principles • What is the ‘value add’ of the climate data analyst? - 41 -
  • 42. 2020+ Climate Analyst Workflow - 42 - CMIP archives Observational Data • Interactive Exploration • Semantic Labels • Patterns • Clusters • Anomalies • Mechanisms • Hypothesis
  • 43. Conclusions • Characterizing Extreme Weather in future climate regimes is an important goal • Deep Learning can be applied effectively to solve pattern recognition problem – Supervised and semi-supervised CNNs – Segmentation and LSTM architectures in development • Unique challenges – Performance and Scaling, Interpretability, Hyper-parameter Optimization, Lack of labeled data • Potential for innovation and impact, open to collaboration! - 43 -
  • 44. Acknowledgements • Intel Research: Nadathur Satish, Narayanan Sundaram, Mostofa Patwary, Amir Khosrowshahi, Pradeep Dubey, Joe Curley • Stanford: Ioannis Mitliagkas, Jian Yang, Yunjie Liu, Chris Re • UC Berkeley: Mayur Mudigonda • MILA: Evan Racah, Chris Beckham, Tegan Maharaj, Chris Pal • LBNL: Thorsten Kurth, Wahid Bhimji, Michael Wehner, William Collins • NOAA: Jim Biard, Ken Kunkel - 44 -