SlideShare a Scribd company logo
Opportunities with very large high resolution
climate model datasets
Extreme event attribution
Projections
Machine learning
Michael F. Wehner
Lawrence Berkeley National Laboratory
mfwehner@lbl.gov
US DOE Policy 411.2A
SUBJECT: SCIENTIFIC INTEGRITY
When expressing opinions on policy matters to the public and media,
research personnel must make it clear when they are expressing their
personal views, rather than those of the Department, the U.S.
Government, or their respective institutions. Public representation of
Government or DOE positions or policies must be cleared through
their program management to include DOE headquarters.
Program on Mathematical and Statistical Methods for Climate and the Earth System Opening Workshop, Computational and Mathematical Challenges in Climate Modeling - Michael Wehner, Aug 21, 2017
Program on Mathematical and Statistical Methods for Climate and the Earth System Opening Workshop, Computational and Mathematical Challenges in Climate Modeling - Michael Wehner, Aug 21, 2017
Program on Mathematical and Statistical Methods for Climate and the Earth System Opening Workshop, Computational and Mathematical Challenges in Climate Modeling - Michael Wehner, Aug 21, 2017
Icosahedral High Resolution
More modern discretizations
Resolution
1km
Cloud system resolving models
are a transformational change
25km
Upper limit of climate models
with cloud parameterizations
200km
Typical resolution of
IPCC AR4 models
Surface Altitude (feet)
Technology
Moore’s law is alive and well.
The largest computers continually get faster. And so do models
1990 AMIP1: Many modeling groups required a calendar year to complete a 10 year
integration of a stand alone atmospheric general circulation model. Typical grid
resolution was T21 or about 600km (64X32x10)
2017: I get ~1 simulated year/ wall clock day for the same calculation except at 25km
(1152x768x30)
This calculation used only 7680 processors on a 120,000 processor machine
• 5 million processor hours.
• 25 km grid cell
• Took about 3 months to complete in 2012. Typically, I get better throughput now.
Program on Mathematical and Statistical Methods for Climate and the Earth System Opening Workshop, Computational and Mathematical Challenges in Climate Modeling - Michael Wehner, Aug 21, 2017
Tropical
Storm
Cat1
Cat2
Cat3
Cat4
Cat5
Figures by Prabhat
Cat1
Cat2
Cat3
Cat4
Cat5
Figures by Prabhat
Cat4
Cat5
Figures by Prabhat
Tropical Cyclone min pressure vs max wind speed
Total # TC / year
observations 87±8
cam5.1 84±9
Total # hurricanes / year
observations 49±7
cam5.1 52
Figures by Cheng-Ta Chen
The strongest hurricanes get more intense.
+0.85oC +1.5oC +2oC +4.0oC
m/s
Average annual most intense tropical cyclone wind speed (m/s)
Program on Mathematical and Statistical Methods for Climate and the Earth System Opening Workshop, Computational and Mathematical Challenges in Climate Modeling - Michael Wehner, Aug 21, 2017
Program on Mathematical and Statistical Methods for Climate and the Earth System Opening Workshop, Computational and Mathematical Challenges in Climate Modeling - Michael Wehner, Aug 21, 2017
Real storms can be tracked by hand. They happen in real time!
Tracking of simulated storms must be automated. There are too many to count.
Two approaches.
Traditional, “parametric” feature tracking based on conditions.
• Hurricanes: co-located vorticity maxima, pressure minima, warm cores.
• Extratropical cyclones: co-located vorticity maxima, pressure minima.
• Atmospheric rivers: precipitable water, integrated water transport, etc. (ARTMIP)
• Blocking, fronts, meso-scale convective systems.
Supervised Machine-learning
• Convolutional neural networks.
• Need to have a training data set.
Tracking storms
Two steps:
1. Candidate detection
2. Continuity in time & space. (Stitching/tracking)
Toolkit for Extreme Climate Analysis
https://github.com/LBL-EESA/TECA
Highly parallel
(I routinely use 29200
processors for TC tracking)
TECA2: parallel parametric feature tracking
Tropical cyclone detection.
Movie courtesy of Burlen Loring
Can Deep Learning Work for Climate Science?
- 20 -
Similarities to Computer Vision
• Tasks:
– Pattern Classification
– Clustering
– Feature Learning
– Anomaly Detection
Differences
• Unique attributes of Climate Data
– Multi-channel / Multi-variate
– Spatio-temporal
– Statistics are likely different
Challenge: Multi-Variate Data
- 21 -
- 22 -
Task: Find Extreme Weather Patterns in a box
Supervised Learning
Training Input: Cropped, Centered, Multi-variate patches with Labels*
• Tropical Cyclone (TC)
• Atmospheric River (AR)
• Weather Front (WF)
• TC & AR labels are provided by TECA, which
implemented human-specified criteria
• WF is a hand crafted data set (5 FTE-years)
Output: Binary (Yes/No) on Test patches
• Is there a TC in the patch?
• Is there an AR in the patch?
• Is there a WF in the patch?
Currently, we have separate convolutional neural nets for these 3 storm types.
– Our goal is to have just one machine learning algorithm for all storms.
- 23 -
CLASSIFICATIO
N
Image
Dimensi
on
Variables Total Examples
(+ve) (-ve)
Tropical
Cyclone
32x32 PSL,UBOT,VBOT,TMQ,
U850,V850,T200,T500
10000 10000
Atmospheric
Rivers
148x224 TMQ, Land Sea mask 6500 6800
Weather
Fronts
27x60 T2m, Precip, PSL 5600 6500
Machine learning Training Data
Logistic
Regression
K-Nearest
Neighbor
Support
Vector
Machine
Random
Forest
ConvNet
Train Test Train Test Train Test Train Test Trai
n
Test
Tropical
Cyclone
96.8 95.85 98.1 97.85 97.0 95.85 99.2 99.4 99.3 99.1
Atmosphe
ric Rivers
81.97 82.65 79.7 81.7 81.6 83.0 87.9 88.4 90.5 90.0
Weather
Fronts
84.9 89.8 72.46 76.45 84.35 90.2 80.97 87.5 88.7 89.4
Hyper-parameter optimization applied with Spearmint for all methods
Supervised Classification Accuracy
Weather Front Detection
- 26 -
Contributors: Jim Biard, Ken Kunkel, Evan Racah
- 27 -
- 28 -
- 29 -
Current status
Contact Prabhat about Machine Learning details
prabhat@lbl.gov
Hyper-Parameter Optimization
• Tuning #layers, #filters, learning rates, schedule is a black art
Performance and Scaling
• Current networks take days to train on O(10) GB datasets, we have O(10TB)
datasets on hand
Scarcity of Labeled Data
• Community needs to self-organize and run labeling campaigns
Interpretability and Visualization
• ‘Black Box’ classifier
Deep Learning is viable for Pattern Detection in Climate Data
• Supervised architectures can match TECA performance
• Open challenges in semi-supervised, unsupervised learning and
interpretability
• Need more ground truth catalogs and training data!
- 30 -
• When extreme weather
happens, the public wants to
know
– “Is this climate change?”
Extreme Event Attribution
• Not quite the correct question, better to ask:
– “How has the risk of this event changed because of climate change?”
Or
– “How did climate change affect the magnitude of this event?”
Extreme Event Attribution
Severe floods occurred along
the Colorado Front Range
during the second week of
September 2013, impacting
several thousands of people
and many homes, roads, and
businesses.
Lyons, CO
usatoday.com
• At least 10 deaths; 11,000 evacuated
• Nearly 19,000 homes damaged, and
over 1,500 destroyed, costing $2 bn
• Several highway bridges
damaged/destroyed, and rail lines
affected
South Platte River, CO
nytimes.com
The 2013 Colorado Floods
P Pall, C Patricola, M Wehner, D Stone, C Paciorek, W Collins. In press.
Colorado Floods September 2013
A more constrained numerical experiment
Step 1 Step 2 Step 3 Step 4 Step 5
… with a best estimate of a
about a doubling in odds of
heavy rainfall occurrence.
Simulations suggest a
substantial human-
induced influence on
South Platte rainfall…
NCEP RE-
ANALYSIS
WRF MODEL SOUTH PLATTE
BASIN (CO)
INCREASE IN
ODDS OF HEAVY
RAINFALL
DISTRIBUTIONS OF
ENSEMBLE
RAINFALL
Use Sep 2013
weather from
NCEP re-
analysis, both
under human and
adjusted natural
conditions
…to drive an
ensemble of
100 regional
model
simulations
(WRF 12km)
… then
extract rain
over South
Platte
basin.
Human
Natural
(adjusted T, u,v, RH, etc.)
Colorado Sep 2013 floods: Mechanistic approach
• We find a substantial shift in our rainfall distributions over the South Platte basin
(increase in mean of ~30%)
-> beyond a thermodynamic (~7-14%/K) induced increase, given ΔT = ~1.5-2K
• But increase in precipitable water (~15%) appears broadly consistent with C-C
• The 30% increase is a result of increased cumulus convective energy
• Not a result of changes in larger scale dynamics or uplifting.
• The “storm that was” was more violent than the “storm that might have been”
.
7-DAY RAINFALL
P. Pall, et al. (2016) Diagnosing Anthropogenic Contributions to Heavy Colorado Rainfall in
September 2013. to appear in Weather and Climate Extremes
zarzycki@ucar.edu - University of Colorado, Boulder, CO, April 2016
Typhoon Haiyan
• Use VR-CESM in “forecast mode”
• ATM: GFS analysis
• OCN: NOAA OI
• Ensembles of 120 hr forecasts
Init: 12Z 11-04-2013
NOAA IR Obs: 11-07 21Z
111km: 11-07 21Z
8km: 11-07 21Z
zarzycki@ucar.edu - University of Colorado, Boulder, CO, April 2016
Typhoon Haiyan
• Forecast pretty good!
• Little overall change in forecast track
Obs.
All-Hist
Nat-Hist
Present day storm (red) was slightly weaker than the counterfactual storm (blue)
Colder counterfactual SST alone (green) weakened the storm.
Counterfactual initial conditions alone (yellow)intensified the storm.
Changes in winds and shear had little effect.
Colder upper air temperature changes alone intensified the storm.
Lots of unanswered questions. CAM5 vs MIROC5?
Typhoon Haiyan
Program on Mathematical and Statistical Methods for Climate and the Earth System Opening Workshop, Computational and Mathematical Challenges in Climate Modeling - Michael Wehner, Aug 21, 2017
Video courtesy of Andreas Prein NCAR
Convective outbreak in May 2010
• Objective based analysis allows to evaluate model on
the storm scale
Observation WRF 4 km
Hurricane Katrina
• Hindcast that was (red)
• Hindcast that might have been (blue)
• 3km WRF
• No detectible anthropogenic effect
on cyclone intensity in 2005
• Accumulated precipitation increases
at Clausius-Clapeyron rates.
• 3km WRF
Max wind speed
• End of 21st century (RCP8.5)
• But intensity increases in a much
warmer world
• 9 & 27km WRF
Max wind speed
• Not an ideal candidate
• Track is not as stable
to perturbations and
simulation start date
00UCT 25 Oct 2005
18UCT 24 Oct 2005
Superstorm
Sandy
Factual Counterfactual
Super storm Sandy
No discernible change in intensity
But storm surge was worse because of sea level rise
(GFDL ran detailed storm surge calculations)
• Christina finds little anthropogenic effect on Hurricane Katrina in 2005
but an intensification if a similar storm occurs in 2100.
• Andreas finds more MCS events and that they move slower in a
warmer world. Maximum rain rates up to 40% more in 2100.
• Our project at LBNL estimates that 28 sustained petaflops is required
for a global 2km climate model.
• We provided a technology path forward based on consumer
electronics design practices (Eliminate waste with a reduced
instruction set.)
• Each hourly 2D variable would require 6TB/year and would need to
be written at 200MB/sec.
– But many variables are of interest so the total is a lot more than
this.
– Some but not all tasks would better suited for in-line calculations.
Cloud system resolving models
• Over 4PB of a single hi-resolution global model is available now.
• Community Atmospheric Model (CAM5.1)
• 25km
Done now
• 5 realizations of a world that was(1996-2015)
• 5 realizations of a world that ParisCOP21 wanted (2105-2115) 1.5K over
preindustrial
• 5 realizations of a world that is also not very likely (2105-2115) 2.0K over
preindustrial
• Done soon
• 5 realizations of a world that might have been (1996-2005)
• 5 realizations of a world that we currently are headed towards (2080-2100)
– RCP8.5 (3.5K over preindustrial)
Available data.
http://portal.nersc.gov/c20c/
C20C+ detection and attribution subproject
• As climate models get to finer resolution, higher frequency
data becomes more interesting, causing dataset sizes to
increase yet more.
• Better simulated storms.
• More realistic extreme weather.
• New questions can be asked. And answered!
• Supervised machine learning works great for finding things
we already know something about (i.e.storms).
• Can unsupervised machine learning reveal other climate
features?
• New classes of storms?
• New modes of variability?
Conclusions
New Journal!
Intended as a bridge between the Statistics and climate/weather/ocean communities
http://advances-statistical-climatology-meteorology-oceanography.net/index.html
Contact me if you want some data!
Thank you!
mfwehner@lbl.gov

More Related Content

PDF
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
PDF
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
PDF
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
PDF
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
PDF
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
PDF
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
PDF
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
PDF
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...

What's hot (20)

PDF
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
PDF
CLIM Program: Remote Sensing Workshop, Optimization Methods in Remote Sensing...
PDF
CLIM: Transition Workshop - Optimization Methods in Remote Sensing - Jessica...
PDF
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
PDF
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
PPTX
Improving Physical Parametrizations in Climate Models using Machine Learning
PPT
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
PDF
Time integration of evapotranspiration using a two source surface energy bala...
PPTX
Datta-Barua, URSI AT-RASC, 2015, Canary Islands, Ionospheric-Thermospheric St...
PPT
TU2.L10 - NEXT-GENERATION GLOBAL PRECIPITATION PRODUCTS AND THEIR APPLICATIONS
PDF
DSD-INT 2016 Data assimilation to improve volcanic ash forecasts using LOTOS-...
PDF
The Role of Semantics in Harmonizing YOPP Observation and Model Data
PPT
Climate Modelling for Ireland -Dr Ray McGrath, Met Eireann
PDF
Buckles_research
PDF
CLIM Undergraduate Workshop: How was this Made?: Making Dirty Data into Somet...
PDF
Climate downscaling
PDF
Longwave radiation
PPT
3178_IGARSS11.ppt
PDF
Improved Kalman Filtered Neuro-Fuzzy Wind Speed Predictor For Real Data Set ...
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
CLIM Program: Remote Sensing Workshop, Optimization Methods in Remote Sensing...
CLIM: Transition Workshop - Optimization Methods in Remote Sensing - Jessica...
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
Improving Physical Parametrizations in Climate Models using Machine Learning
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
Time integration of evapotranspiration using a two source surface energy bala...
Datta-Barua, URSI AT-RASC, 2015, Canary Islands, Ionospheric-Thermospheric St...
TU2.L10 - NEXT-GENERATION GLOBAL PRECIPITATION PRODUCTS AND THEIR APPLICATIONS
DSD-INT 2016 Data assimilation to improve volcanic ash forecasts using LOTOS-...
The Role of Semantics in Harmonizing YOPP Observation and Model Data
Climate Modelling for Ireland -Dr Ray McGrath, Met Eireann
Buckles_research
CLIM Undergraduate Workshop: How was this Made?: Making Dirty Data into Somet...
Climate downscaling
Longwave radiation
3178_IGARSS11.ppt
Improved Kalman Filtered Neuro-Fuzzy Wind Speed Predictor For Real Data Set ...
Ad

Viewers also liked (20)

PDF
CLIM Fall 2017 Course: Statistics for Climate Research, Analysis for Climate ...
PDF
CLIM Fall 2017 Course: Statistics for Climate Research, Spatial Data: Models ...
PDF
CLIM Fall 2017 Course: Statistics for Climate Research, Geostats for Large Da...
PDF
CLIM Fall 2017 Course: Statistics for Climate Research, Nonstationary Covaria...
PDF
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
PDF
CLIM Fall 2017 Course: Statistics for Climate Research, Estimating Curves and...
PDF
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
PDF
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
PDF
CLIM Fall 2017 Course: Statistics for Climate Research, Guest lecture: Data F...
PDF
CLIM Fall 2017 Course: Statistics for Climate Research, Climate Informatics -...
PDF
CLIM Fall 2017 Course: Statistics for Climate Research, Detection & Attributi...
PDF
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
PDF
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
PDF
CLIM Undergraduate Workshop: Extreme Value Analysis for Climate Research - Wh...
PDF
CLIM Undergraduate Workshop: Undergraduate Workshop Introduction - Elvan Ceyh...
PDF
CLIM Undergraduate Workshop: Introduction to Spatial Data Analysis with R - M...
PDF
CLIM Undergraduate Workshop: Tutorial on R Software - Huang Huang, Oct 23, 2017
PDF
CLIM Undergraduate Workshop: Applications in Climate Context - Michael Wehner...
PDF
CLIM Undergraduate Workshop: Statistical Development and challenges for Paleo...
PDF
CLIM Undergraduate Workshop: (Attachment) Performing Extreme Value Analysis (...
CLIM Fall 2017 Course: Statistics for Climate Research, Analysis for Climate ...
CLIM Fall 2017 Course: Statistics for Climate Research, Spatial Data: Models ...
CLIM Fall 2017 Course: Statistics for Climate Research, Geostats for Large Da...
CLIM Fall 2017 Course: Statistics for Climate Research, Nonstationary Covaria...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
CLIM Fall 2017 Course: Statistics for Climate Research, Estimating Curves and...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
CLIM Fall 2017 Course: Statistics for Climate Research, Guest lecture: Data F...
CLIM Fall 2017 Course: Statistics for Climate Research, Climate Informatics -...
CLIM Fall 2017 Course: Statistics for Climate Research, Detection & Attributi...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
CLIM Undergraduate Workshop: Extreme Value Analysis for Climate Research - Wh...
CLIM Undergraduate Workshop: Undergraduate Workshop Introduction - Elvan Ceyh...
CLIM Undergraduate Workshop: Introduction to Spatial Data Analysis with R - M...
CLIM Undergraduate Workshop: Tutorial on R Software - Huang Huang, Oct 23, 2017
CLIM Undergraduate Workshop: Applications in Climate Context - Michael Wehner...
CLIM Undergraduate Workshop: Statistical Development and challenges for Paleo...
CLIM Undergraduate Workshop: (Attachment) Performing Extreme Value Analysis (...
Ad

Similar to Program on Mathematical and Statistical Methods for Climate and the Earth System Opening Workshop, Computational and Mathematical Challenges in Climate Modeling - Michael Wehner, Aug 21, 2017 (20)

PPTX
Climate Extremes Workshop - Historical Perspective on Hurricane Harvey Rainf...
PDF
Blue Waters Enabled Advances in the Fields of Atmospheric Science, Climate, a...
PPTX
CLIM: Transition Workshop - Advances in Understanding of Climate Extremes - K...
PDF
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
PDF
Weather jun02
PPTX
EcoTas13 BradEvans e-MAST
PDF
IAHR 2015 - Extreme value analysis in typhoon prone areas, Moerman, Deltares,...
PDF
Learning from data: data mining approaches for Energy & Weather/Climate appli...
PDF
IndabaX2022Agishaposter.pdf
PDF
Stephan harrison
PPTX
Julian R - Spatial downscaling of future climate predictions for agriculture ...
PPT
Cosmic rays and clouds: using open science to clear the confusion
PPTX
Carlos N - CIAT Experience In Climate Modeling; Scenarios of future climate c...
PPTX
CAAC_Downbursts_Fog_KP
PPTX
Progress in flood forecasting across Britain from advances in hydrological mo...
PPTX
Joint techs keynote january
PPT
1_Buck - Wavemil Steps IGARSS-11.ppt
PPT
1_Buck - Wavemil Steps IGARSS-11.ppt
PDF
DSD-INT 2017 XBeach Past, Present and Future _Keynote - Roelvink
Climate Extremes Workshop - Historical Perspective on Hurricane Harvey Rainf...
Blue Waters Enabled Advances in the Fields of Atmospheric Science, Climate, a...
CLIM: Transition Workshop - Advances in Understanding of Climate Extremes - K...
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
Weather jun02
EcoTas13 BradEvans e-MAST
IAHR 2015 - Extreme value analysis in typhoon prone areas, Moerman, Deltares,...
Learning from data: data mining approaches for Energy & Weather/Climate appli...
IndabaX2022Agishaposter.pdf
Stephan harrison
Julian R - Spatial downscaling of future climate predictions for agriculture ...
Cosmic rays and clouds: using open science to clear the confusion
Carlos N - CIAT Experience In Climate Modeling; Scenarios of future climate c...
CAAC_Downbursts_Fog_KP
Progress in flood forecasting across Britain from advances in hydrological mo...
Joint techs keynote january
1_Buck - Wavemil Steps IGARSS-11.ppt
1_Buck - Wavemil Steps IGARSS-11.ppt
DSD-INT 2017 XBeach Past, Present and Future _Keynote - Roelvink

More from The Statistical and Applied Mathematical Sciences Institute (20)

PDF
Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...
PDF
2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...
PDF
Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...
PDF
Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
PDF
Causal Inference Opening Workshop - A Bracketing Relationship between Differe...
PDF
Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...
PPTX
Causal Inference Opening Workshop - Difference-in-differences: more than meet...
PDF
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
PDF
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
PPTX
Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...
PDF
Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...
PDF
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
PDF
Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...
PDF
Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...
PDF
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
PDF
Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...
PPTX
2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...
PPTX
2019 Fall Series: Professional Development, Writing Academic Papers…What Work...
PDF
2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...
PDF
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...
2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...
Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...
Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
Causal Inference Opening Workshop - A Bracketing Relationship between Differe...
Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...
Causal Inference Opening Workshop - Difference-in-differences: more than meet...
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...
Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...
Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...
2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...
2019 Fall Series: Professional Development, Writing Academic Papers…What Work...
2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...

Recently uploaded (20)

PDF
Insiders guide to clinical Medicine.pdf
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PPTX
Institutional Correction lecture only . . .
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PDF
TR - Agricultural Crops Production NC III.pdf
PPTX
master seminar digital applications in india
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PPTX
Week 4 Term 3 Study Techniques revisited.pptx
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
O7-L3 Supply Chain Operations - ICLT Program
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PDF
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
Microbial disease of the cardiovascular and lymphatic systems
Insiders guide to clinical Medicine.pdf
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
Institutional Correction lecture only . . .
2.FourierTransform-ShortQuestionswithAnswers.pdf
FourierSeries-QuestionsWithAnswers(Part-A).pdf
STATICS OF THE RIGID BODIES Hibbelers.pdf
TR - Agricultural Crops Production NC III.pdf
master seminar digital applications in india
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
Week 4 Term 3 Study Techniques revisited.pptx
Pharmacology of Heart Failure /Pharmacotherapy of CHF
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
O7-L3 Supply Chain Operations - ICLT Program
human mycosis Human fungal infections are called human mycosis..pptx
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
Abdominal Access Techniques with Prof. Dr. R K Mishra
Microbial disease of the cardiovascular and lymphatic systems

Program on Mathematical and Statistical Methods for Climate and the Earth System Opening Workshop, Computational and Mathematical Challenges in Climate Modeling - Michael Wehner, Aug 21, 2017

  • 1. Opportunities with very large high resolution climate model datasets Extreme event attribution Projections Machine learning Michael F. Wehner Lawrence Berkeley National Laboratory mfwehner@lbl.gov
  • 2. US DOE Policy 411.2A SUBJECT: SCIENTIFIC INTEGRITY When expressing opinions on policy matters to the public and media, research personnel must make it clear when they are expressing their personal views, rather than those of the Department, the U.S. Government, or their respective institutions. Public representation of Government or DOE positions or policies must be cleared through their program management to include DOE headquarters.
  • 6. Icosahedral High Resolution More modern discretizations
  • 7. Resolution 1km Cloud system resolving models are a transformational change 25km Upper limit of climate models with cloud parameterizations 200km Typical resolution of IPCC AR4 models Surface Altitude (feet)
  • 8. Technology Moore’s law is alive and well. The largest computers continually get faster. And so do models 1990 AMIP1: Many modeling groups required a calendar year to complete a 10 year integration of a stand alone atmospheric general circulation model. Typical grid resolution was T21 or about 600km (64X32x10) 2017: I get ~1 simulated year/ wall clock day for the same calculation except at 25km (1152x768x30) This calculation used only 7680 processors on a 120,000 processor machine • 5 million processor hours. • 25 km grid cell • Took about 3 months to complete in 2012. Typically, I get better throughput now.
  • 13. Tropical Cyclone min pressure vs max wind speed Total # TC / year observations 87±8 cam5.1 84±9 Total # hurricanes / year observations 49±7 cam5.1 52 Figures by Cheng-Ta Chen
  • 14. The strongest hurricanes get more intense. +0.85oC +1.5oC +2oC +4.0oC m/s Average annual most intense tropical cyclone wind speed (m/s)
  • 17. Real storms can be tracked by hand. They happen in real time! Tracking of simulated storms must be automated. There are too many to count. Two approaches. Traditional, “parametric” feature tracking based on conditions. • Hurricanes: co-located vorticity maxima, pressure minima, warm cores. • Extratropical cyclones: co-located vorticity maxima, pressure minima. • Atmospheric rivers: precipitable water, integrated water transport, etc. (ARTMIP) • Blocking, fronts, meso-scale convective systems. Supervised Machine-learning • Convolutional neural networks. • Need to have a training data set. Tracking storms
  • 18. Two steps: 1. Candidate detection 2. Continuity in time & space. (Stitching/tracking) Toolkit for Extreme Climate Analysis https://github.com/LBL-EESA/TECA Highly parallel (I routinely use 29200 processors for TC tracking) TECA2: parallel parametric feature tracking
  • 19. Tropical cyclone detection. Movie courtesy of Burlen Loring
  • 20. Can Deep Learning Work for Climate Science? - 20 - Similarities to Computer Vision • Tasks: – Pattern Classification – Clustering – Feature Learning – Anomaly Detection Differences • Unique attributes of Climate Data – Multi-channel / Multi-variate – Spatio-temporal – Statistics are likely different
  • 22. - 22 - Task: Find Extreme Weather Patterns in a box
  • 23. Supervised Learning Training Input: Cropped, Centered, Multi-variate patches with Labels* • Tropical Cyclone (TC) • Atmospheric River (AR) • Weather Front (WF) • TC & AR labels are provided by TECA, which implemented human-specified criteria • WF is a hand crafted data set (5 FTE-years) Output: Binary (Yes/No) on Test patches • Is there a TC in the patch? • Is there an AR in the patch? • Is there a WF in the patch? Currently, we have separate convolutional neural nets for these 3 storm types. – Our goal is to have just one machine learning algorithm for all storms. - 23 -
  • 24. CLASSIFICATIO N Image Dimensi on Variables Total Examples (+ve) (-ve) Tropical Cyclone 32x32 PSL,UBOT,VBOT,TMQ, U850,V850,T200,T500 10000 10000 Atmospheric Rivers 148x224 TMQ, Land Sea mask 6500 6800 Weather Fronts 27x60 T2m, Precip, PSL 5600 6500 Machine learning Training Data
  • 25. Logistic Regression K-Nearest Neighbor Support Vector Machine Random Forest ConvNet Train Test Train Test Train Test Train Test Trai n Test Tropical Cyclone 96.8 95.85 98.1 97.85 97.0 95.85 99.2 99.4 99.3 99.1 Atmosphe ric Rivers 81.97 82.65 79.7 81.7 81.6 83.0 87.9 88.4 90.5 90.0 Weather Fronts 84.9 89.8 72.46 76.45 84.35 90.2 80.97 87.5 88.7 89.4 Hyper-parameter optimization applied with Spearmint for all methods Supervised Classification Accuracy
  • 26. Weather Front Detection - 26 - Contributors: Jim Biard, Ken Kunkel, Evan Racah
  • 30. Current status Contact Prabhat about Machine Learning details prabhat@lbl.gov Hyper-Parameter Optimization • Tuning #layers, #filters, learning rates, schedule is a black art Performance and Scaling • Current networks take days to train on O(10) GB datasets, we have O(10TB) datasets on hand Scarcity of Labeled Data • Community needs to self-organize and run labeling campaigns Interpretability and Visualization • ‘Black Box’ classifier Deep Learning is viable for Pattern Detection in Climate Data • Supervised architectures can match TECA performance • Open challenges in semi-supervised, unsupervised learning and interpretability • Need more ground truth catalogs and training data! - 30 -
  • 31. • When extreme weather happens, the public wants to know – “Is this climate change?” Extreme Event Attribution
  • 32. • Not quite the correct question, better to ask: – “How has the risk of this event changed because of climate change?” Or – “How did climate change affect the magnitude of this event?” Extreme Event Attribution
  • 33. Severe floods occurred along the Colorado Front Range during the second week of September 2013, impacting several thousands of people and many homes, roads, and businesses. Lyons, CO usatoday.com • At least 10 deaths; 11,000 evacuated • Nearly 19,000 homes damaged, and over 1,500 destroyed, costing $2 bn • Several highway bridges damaged/destroyed, and rail lines affected South Platte River, CO nytimes.com The 2013 Colorado Floods P Pall, C Patricola, M Wehner, D Stone, C Paciorek, W Collins. In press.
  • 34. Colorado Floods September 2013 A more constrained numerical experiment Step 1 Step 2 Step 3 Step 4 Step 5 … with a best estimate of a about a doubling in odds of heavy rainfall occurrence. Simulations suggest a substantial human- induced influence on South Platte rainfall… NCEP RE- ANALYSIS WRF MODEL SOUTH PLATTE BASIN (CO) INCREASE IN ODDS OF HEAVY RAINFALL DISTRIBUTIONS OF ENSEMBLE RAINFALL Use Sep 2013 weather from NCEP re- analysis, both under human and adjusted natural conditions …to drive an ensemble of 100 regional model simulations (WRF 12km) … then extract rain over South Platte basin. Human Natural (adjusted T, u,v, RH, etc.)
  • 35. Colorado Sep 2013 floods: Mechanistic approach • We find a substantial shift in our rainfall distributions over the South Platte basin (increase in mean of ~30%) -> beyond a thermodynamic (~7-14%/K) induced increase, given ΔT = ~1.5-2K • But increase in precipitable water (~15%) appears broadly consistent with C-C • The 30% increase is a result of increased cumulus convective energy • Not a result of changes in larger scale dynamics or uplifting. • The “storm that was” was more violent than the “storm that might have been” . 7-DAY RAINFALL P. Pall, et al. (2016) Diagnosing Anthropogenic Contributions to Heavy Colorado Rainfall in September 2013. to appear in Weather and Climate Extremes
  • 36. zarzycki@ucar.edu - University of Colorado, Boulder, CO, April 2016 Typhoon Haiyan • Use VR-CESM in “forecast mode” • ATM: GFS analysis • OCN: NOAA OI • Ensembles of 120 hr forecasts Init: 12Z 11-04-2013 NOAA IR Obs: 11-07 21Z 111km: 11-07 21Z 8km: 11-07 21Z
  • 37. zarzycki@ucar.edu - University of Colorado, Boulder, CO, April 2016 Typhoon Haiyan • Forecast pretty good! • Little overall change in forecast track Obs. All-Hist Nat-Hist
  • 38. Present day storm (red) was slightly weaker than the counterfactual storm (blue) Colder counterfactual SST alone (green) weakened the storm. Counterfactual initial conditions alone (yellow)intensified the storm. Changes in winds and shear had little effect. Colder upper air temperature changes alone intensified the storm. Lots of unanswered questions. CAM5 vs MIROC5? Typhoon Haiyan
  • 40. Video courtesy of Andreas Prein NCAR Convective outbreak in May 2010 • Objective based analysis allows to evaluate model on the storm scale Observation WRF 4 km
  • 41. Hurricane Katrina • Hindcast that was (red) • Hindcast that might have been (blue) • 3km WRF
  • 42. • No detectible anthropogenic effect on cyclone intensity in 2005 • Accumulated precipitation increases at Clausius-Clapeyron rates. • 3km WRF Max wind speed
  • 43. • End of 21st century (RCP8.5) • But intensity increases in a much warmer world • 9 & 27km WRF Max wind speed
  • 44. • Not an ideal candidate • Track is not as stable to perturbations and simulation start date 00UCT 25 Oct 2005 18UCT 24 Oct 2005 Superstorm Sandy Factual Counterfactual
  • 45. Super storm Sandy No discernible change in intensity But storm surge was worse because of sea level rise (GFDL ran detailed storm surge calculations)
  • 46. • Christina finds little anthropogenic effect on Hurricane Katrina in 2005 but an intensification if a similar storm occurs in 2100. • Andreas finds more MCS events and that they move slower in a warmer world. Maximum rain rates up to 40% more in 2100. • Our project at LBNL estimates that 28 sustained petaflops is required for a global 2km climate model. • We provided a technology path forward based on consumer electronics design practices (Eliminate waste with a reduced instruction set.) • Each hourly 2D variable would require 6TB/year and would need to be written at 200MB/sec. – But many variables are of interest so the total is a lot more than this. – Some but not all tasks would better suited for in-line calculations. Cloud system resolving models
  • 47. • Over 4PB of a single hi-resolution global model is available now. • Community Atmospheric Model (CAM5.1) • 25km Done now • 5 realizations of a world that was(1996-2015) • 5 realizations of a world that ParisCOP21 wanted (2105-2115) 1.5K over preindustrial • 5 realizations of a world that is also not very likely (2105-2115) 2.0K over preindustrial • Done soon • 5 realizations of a world that might have been (1996-2005) • 5 realizations of a world that we currently are headed towards (2080-2100) – RCP8.5 (3.5K over preindustrial) Available data.
  • 49. • As climate models get to finer resolution, higher frequency data becomes more interesting, causing dataset sizes to increase yet more. • Better simulated storms. • More realistic extreme weather. • New questions can be asked. And answered! • Supervised machine learning works great for finding things we already know something about (i.e.storms). • Can unsupervised machine learning reveal other climate features? • New classes of storms? • New modes of variability? Conclusions
  • 50. New Journal! Intended as a bridge between the Statistics and climate/weather/ocean communities http://advances-statistical-climatology-meteorology-oceanography.net/index.html
  • 51. Contact me if you want some data! Thank you! mfwehner@lbl.gov