SlideShare a Scribd company logo
FMI Information
Management System
RoopeTervo
LeadArchitect
Finnish Meteorological Institute
If not stated otherwise, omages by FMI. Licence CC4BY
Finnish Meteorological Institute
2
• Roughly 650 FTE
• Research Institute
• Operative services
• Safety / Public / B2B
• Roughly half of the employees on operative side
https://www.youtube.com/watch?v=WzIVAEvLH2E
If not stated otherwise, omages by FMI. Licence CC4BY
Software Development at FMI
• ~40 - 50 developers in 5-7 units
• + contractors
• In principle, all new development as open source
• Many projects in collaboration (i.e. MetCoOp, Namcon, WFS3…)
3
If not stated otherwise, omages by FMI. Licence CC4BY
FMI Open Data
• Finnish Meteorological Institute opened its data in 2013
• Basically everything that FMI has property rights was opened
• Data is provided in freely in machine readable format
• https://en.ilmatieteenlaitos.fi/open-data
4
If not stated otherwise, omages by FMI. Licence CC4BY
5
If not stated otherwise, omages by FMI. Licence CC4BY
Distribution
Product
Human
intervention
Post
processing
Raw data
SATELLITES
• Polar
• Geostationary
RADAR NETWORK
• National
• Baltrad (Norhern Europe)
• Opera (Europe)
NUMERICAL MODELS
• Atmospheric / Ocean / Road
• Local 2.5 km / Global 25 km
• Ensembles
OBSERVATIONS
• Global Network
• Surface / Balloons / Airplanes
LIGHTNNING
DETECTION NETWORK
• Northern Europe
CROWDSOURCING
• Social sensor networks
• Smartphones
• 112 call data
KALMAN MOS LAPS KRIGING
WWW MOBILE TV NEWSPAPERS B2B OPENDATA SAFETY
VISUALIZA
TION
MODEL
SELECTIO
N
POST
PROCESSI
NG
FIELD
MODIFICA
TION
QUALITY
CONTROL
DATASERVERS
WFS
JSON
XML
MAPSERVERS
WMS
WCS
APPLICATIONS
ILMANET
B2B
MOBILE
6
If not stated otherwise, omages by FMI. Licence CC4BY
7
Observations
FMI operated -- operational
Automatic surface stations 184
Rainfall measurement sites 91
Sounding stations 3
Air quality stations 28
Buoys 16
Antennas for lightning detection 8
Mareographs 14
Solar 13
Total 481
Automation 95%
If not stated otherwise, omages by FMI. Licence CC4BY
8
External air quality 83
Skandinavian lightning detection 29
Baltic mareographs 24
Road weather stations 29
Runway weather stations 63
Foreign surface weather stations 13736
External water measurement stations 63
Road weather stations 629
NetAtmo ~100k
Observations
External
If not stated otherwise, omages by FMI. Licence CC4BY
9
Observations
Process
1 / 10
minute
aggregation
LAN / 3G Oracle
If not stated otherwise, omages by FMI. Licence CC4BY
Radars
10
• 10 radars
• Range 120-250 km
• Typical file formats HDF5, GeoTiff, PNG
• Sending power 250 000 W
• Receiving power
0,00000000001 – 0,00000001 W
If not stated otherwise, omages by FMI. Licence CC4BY
Radars
Products
• Single radars
• Reflectivity
• Cappi (‘rain intensity’)
• Doppler speed (‘wind’)
• Echotop (‘cloud base’)
• Echotop (‘thunder risk’)
• Composite
• Reflectivity
• Rain intensity
• Rain accumulation (1h, 3h, 6h…)
• Cross sections
11
If not stated otherwise, omages by FMI. Licence CC4BY
Satellites
Earth Observation
• In weather, used to observe:
• cloud, pressure, wind, temperature...
• Operated by EUMETSAT and NOAA
• Reception, processing and filing of data
• Geostationary satellites
• Received in Germany
• Delivered to Finland via telecommunication satellite and
land link
• Polar satellites
• Received in Helsinki and Sodankylä
12
If not stated otherwise, omages by FMI. Licence CC4BY
• Multicast technology
(DVB-S2)
• EUMETCast Europe, data rate
46.215 GB/hour, 1109.16
GB/day
Satellites
Data Transfer
13
Image: FMI
If not stated otherwise, omages by FMI. Licence CC4BY
Lightning Detection
• 9 sensor in Finland, 34 in NORDLIS
• Practically all ground strikes detected with < 1km accuracy
• Also part of cloud strikes
• Stored to climate database
• Can be delivered to clients in
approximately 20 seconds
14
If not stated otherwise, omages by FMI. Licence CC4BY
Numerical Weather Prediction (NWP)
• Physical modelling of atmospheric
• Models developed in international consortiums
• Typical programming languages: Fortran, C++
• Input: observations, boundaries from other model, satellites
• Output: grid from several surfaces
• Surface, pressure levels, model levels
• Typical data formats:
• GRIB1, GRIB2
(and NetCDF in marine models)
15
If not stated otherwise, omages by FMI. Licence CC4BY
Weather
forecasts are
calculated to a
regular grid
16
Products (like at
https://ilmatieteenlaitos.fi) are
often interpolations between
grid points
If not stated otherwise, omages by FMI. Licence CC4BY
Numerical Weather Prediction (NWP)
ECMWF
European Centre for Medium-Range
Weather Forecasts (ECMWF) to
create products that support weather
service
ECMWF model gridded data in
Meteorological Archival and
Retrieval System (MARS). It is the
repository of meteorological data at
ECMWF (storage size of PBs of
data)
17
Model ECMWF ECMWF EPS
Spatial
Resolution
15km 15 km
Coverage Global Global
Temporal
Resolution
1-3h 1-3h
Time range 10 days 10 days
Output data Surface,
pressure
levels
Surface 51
members
Update
interval
6h 6h
If not stated otherwise, omages by FMI. Licence CC4BY
Numerical Weather
Prediction (NWP)
18
Model Hirlam Harmonie
Spatial
Resolution
7,5 km 2,5 km
Coverage Europe Skandinavia
Temporal
Resolution
1h 15 min
Time range 48h 48h
Update
interval
6h 6h
Output data Surface,
pressure levels,
model levels
Surface,
pressure levels,
model levels
If not stated otherwise, omages by FMI. Licence CC4BY
Human Intervention
• Meteorologists selects the best source
model and refines forecast
• The forecast is updated as often as
necessary
• Output is used as primary data for
Scandinavian area
• Software ‘SmartMet’ is done at FMI (C++)
• Meteorologists draws an analysis maps,
fronts, jets, etc…
• Software ‘Mirwa’ is done at FMI (Java)
• The data is handled as objects
19
If not stated otherwise, omages by FMI. Licence CC4BY
Human Intervention
• Meteorologists generates warnings
• Software ‘SmartMet Alert’ is done at FMI
(Java)
• Meteorologists does aviation forecasts
• METARS, TAFs, SWC charts…
• Software ‘SmartMet Aviation’ is done at
FMI (Java)
• Several other tasks
• Writing texts, generating ice advisories,
consulting about weather
• Several web based tools
20
If not stated otherwise, omages by FMI. Licence CC4BY
Storing Data
• FMI total capacity: 6 PB
• Ceph (Sodankylä satellite center): 1.5 PB
• Ceph (Main site): 2.5 PB  coming 2020
• Fast file system: 300 TB (Sodankylä) + 300 TB
(main site)
• Archive: 1.5 PB (disk) + 2 PB (tape)
• Operational databases: Oracle, PostgreSQL,
MySQL and Redis
• FMI utilizes ECMWF data storage in daily
operations
• ECMWF MARS
• Archive capacity: 225 PB
• Current daily increase: 225 TB
• Data size grows exponentially in near future
21
71.7
287
929
0
100
200
300
400
500
600
700
800
900
1000
2017 2020 2025
Global Numerical Weather
Model Output
Model output per day (TiB)
AI inWeather
Forecasting
Examples of Operational Systems and
Possibilities
22
If not stated otherwise, omages by FMI. Licence CC4BY
ML can be used in many parts of the process
• Making observations (machine vision)
• Quality Check (unsupervised learning)
• Radar and satellite data (machine vision)
• Weather predictions (i.a. neural networks)
• Post-prosessing (traditional supervised
learning, neural networks)
• Impact analysis
23
If not stated otherwise, omages by FMI. Licence CC4BY
MachineVision in Observations
24
https://www.visivis.at/
If not stated otherwise, omages by FMI. Licence CC4BY
Precipitation NowCast (Radar)
https://arxiv.org/abs/1912.12132
25
2 years history of radar images
256km
U-Net
Image: Mehrdad Yazdani, Wikipedia, Licence CC BY-SA 4.0
Prediction (i.e. t +2h)
If not stated otherwise, omages by FMI. Licence CC4BY
Global SyntheticWeather Radar
Poster:
https://www.star.nesdis.noaa.gov/star/documents/meetings/2019AI/
posters/P2.3_Mattioli.pdf
26
Convolutional Neural Network to create a global
synthetic weather radar based on satellites, lightning
data and numerical weather prediction data.
Image:Mattioli presentation
If not stated otherwise, omages by FMI. Licence CC4BY
Level 1
• Post-processing
• Downscaling
Level 2
• Emulate existing
parametrization
scheme
• Correct existing
schemes
Level 3
• Learn new
parametrization
scheme from
simulations or data
Level 4
• Model replacement
Weather Models and Post-processing
27
Credits: Stephan Rasp, slides
If not stated otherwise, omages by FMI. Licence CC4BY
Post-processing with Blend
If not stated otherwise, omages by FMI. Licence CC4BY
 Method used: “Exponentially weighted moving average” / “Moving average” (Cui
et al. 2012)
 BCt = (1-α)BCt-1 + α(FCSTt-1 – OBSt-1)
 MAEt = (1-α)MAEt-1 + α|FCSTt-1 – OBSt-1|
BC = bias correction, α = decaying weight (0.05), OBS = observation, FCST = model forecast,
MAE = mean absolute error
 BC and MAE are calculated for each forecast model, analysis time and lead time
separately
 Bias corrected forecasts are used to calculate MAE values and calculating the
latest Blend forecast
Bias correction (BC) and MAE
If not stated otherwise, omages by FMI. Licence CC4BY
Biascorrection
How large effect will the error have and for how long with different alpha values:
equation: BCt = (1-α)BCt-1 (started with BC value 1)
Decaying weight, alpha
If not stated otherwise, omages by FMI. Licence CC4BY
 Verification results based on MAE are used to produce weights for each model
(Woodcock and Engel, 2005):
W1 = (1/MAE1) / (1/MAE1+1/MAE2+...+1/MAEn)
WhereW = weight, n= total number of models
 The Blend is calculated using weights and bias corrected model forecasts:
Blend =W1*BCFCST1 +W2*BCFCST2+...+Wn*BCFCSTn
Where BCFCST = Latest Bias corrected model forecast
Calculating weights and Blend
If not stated otherwise, omages by FMI. Licence CC4BY
Neural networks for post-processing ensemble
weather forecasts
Image: ECMWF
Image: Cecbur, source: Wikipedia., License: CC BY-SA 4.0
Loss:
where Φ and φ denote CDF and PDF of a
standardGaussian distribution
Features (X):
• Several weather parameters
• Embedded station location
Label (y): observations
Ouput:
normal distribution mean μ
and standard deviation σ
If not stated otherwise, omages by FMI. Licence CC4BY
PredictingWeather Using Neural Networks is
Under Research
Jonathan Weyn:
Can Machines Learn to Predict Weather? Using Deep Learning to Predict
Gridded 500‐hPa Geopotential Height From Historical Weather Data:
https://agupubs.onlinelibrary.wiley.com/doi/full/10.1029/2019MS001705
https://github.com/jweyn/DLWP
Peter Düben:
Deep learning for weather and climate (excellent presentation about the
topic):
https://ossg.bcs.org/wp-content/uploads/04-19-dueben-weather.pdf
Challenges and design choices for global weather and climate models based
on machine learning:
https://www.researchgate.net/publication/326043857_Challenges_and_de
sign_choices_for_global_weather_and_climate_models_based_on_machi
ne_learning
Image: Mehrdad Yazdani, Wikipedia, Licence CC BY-SA 4.0
TRAINS
Predicting Train Delays Inflicted by Weather
Photo by Kalevi Lehtonen 1955. Not published until Commons in 2014.
https://fi.wikipedia.org/wiki/Tiedosto:Finnish_class_Dm4_locomotive_number_1607_in_the_year_1955.jpg
If not stated otherwise, omages by FMI. Licence CC4BY
Predicting Weather Inflicted Train Delays
Data Liikennevirasto (CC4)
Delay between stations
• Passenger trains
• 514 stations
Weather observations
• 19 parameters
If not stated otherwise, omages by FMI. Licence CC4BY
Image: Venkata Jagannath. License: CC4-BY
Image: CC0
Image: BiObserver. License: CC4-BY
Results
RFR
RMSE: 5.37
MAE: 3.21
BSS: 0.11
LSTM
RMSE: 4.35
MAE: 2.75
BSS: 0.01
LR
RMSE: 5.59
MAE: 3.11
BSS: 0.08
𝐵𝑆𝑆 = 1 −
𝑅𝑀𝑆𝐸
𝑅𝑀𝑆𝐸𝑟𝑒𝑓
,
where 𝑅𝑀𝑆𝐸𝑟𝑒𝑓 denotes root mean square error
calculated with a mean value over the whole dataset
If not stated otherwise, omages by FMI. Licence CC4BY
LSTM shows no real skill
0
25
50
Delay(minutes)
Time
02/2011
Time
06/2016
Time
02/2017
Predicted vs. true delay, case Ahvenus
Predicted delay
True delay
If not stated otherwise, omages by FMI. Licence CC4BY
RFR works relatively well
0
50
100
Delay(minutes)
Time
02/2011
Time
06/2016
Time
02/2017
Predicted vs. true delay, average over all stations
Predicted delay
True delay
SASSE
Forecasting Electricity Outages Caused by
Convective Storms
Photo by Bryan Alexander. CC BY 2.0
If not stated otherwise, omages by FMI. Licence CC4BY
• Trained with observed
outages (Random
Forest Classifier)
• Several input features
• Few years of old data
required
• Based on storm cell
severity and
knowledge about
clients electricity grid
• For example category
2 storm cell is
expected to cause
problems in 10-50 %
of grid nodes
Turn
outages
to money
Power companies
has to give
compensation for
outages
Forecasting Electricity Outages
Extract storm cells
from radar data
Classify the storm
cells
Predict future
outages
Turn outages to
money
Resources
General:
https://ilmatieteenlaitos.fi/avoin-data
http://www.slideshare.net/tervo/
http://ilmatieteenlaitos.fi/tekniikka
http://www.ecmwf.int/
GRIB usage examples:
https://polar.ncep.noaa.gov/waves/examples/usingpython.shtml
https://confluence.ecmwf.int/pages/viewpage.action?pageId=57443489
41
If not stated otherwise, omages by FMI. Licence CC4BY
Most Important DataTypes
Data Data Type Software
Weather forecasts GRIB1, GRIB2, (NetCDF) PanoPly, IDV, ECMWF GRIB
tools, WGRIB, WGRIB2, gdal,
SmartMet Server, pangeo
Marine forecasts, air quality
forecasts…
NetCDF PanoPly, IDV, gdal, pangeo,
GeoServer
Observations XML, JSON, csv any
BUFR ecCodes, NCEP
Aviation products TAF, METAR, SIGMET… any
IWXXM fmi-avi-message-converter-iwxxm
Radar GeoTiff, HDF5 gdal, GeoServer (see also:
https://en.wikipedia.org/wiki/Hiera
rchical_Data_Format)
Satellites GeoTiff, NetCDF gdal, GeoServer, PyTroll
42

More Related Content

PPTX
Inspire Compliant Weather Data
PPT
2004-10-09 MANE-VU Status Report on CATT and FASTNET
PPTX
FMI Open Data Interface and Data Models
PDF
IRJET- Study of Real Time Kinematica Survey with Differential Global Position...
PDF
Emma Gibbons - Model uncertainty in the assessment of major infrastructure pr...
PPT
Fast Cat M V[1]
PPTX
Eposa english
Inspire Compliant Weather Data
2004-10-09 MANE-VU Status Report on CATT and FASTNET
FMI Open Data Interface and Data Models
IRJET- Study of Real Time Kinematica Survey with Differential Global Position...
Emma Gibbons - Model uncertainty in the assessment of major infrastructure pr...
Fast Cat M V[1]
Eposa english

What's hot (11)

PDF
BigDataEurope 1st SC5 Workshop Big Data in ECMWF by Ioannis Mallas
PPT
2005-01-08 MANE-VU Status Report on CATT and FASTNET
PPT
Fast Cat Mv3
PPTX
Using FME to Derive Fishnet Air Emission Values
PPTX
Polar Use Case in ExtremeEarth-phiweek19
PDF
Icelandic Bathy model
PPTX
NPOESS Transition to the Joint Polar Satellite System (JPSS) and Defense Weat...
PPTX
Prtr non point estimate method in japan
PDF
Galileo: System and Applications
PDF
Air Quality Hot Spot Mapper | Paul Monks| March 2015
PPTX
(Slides) A Method for Sharing Traffic Jam Information Using Inter-Vehicle C...
BigDataEurope 1st SC5 Workshop Big Data in ECMWF by Ioannis Mallas
2005-01-08 MANE-VU Status Report on CATT and FASTNET
Fast Cat Mv3
Using FME to Derive Fishnet Air Emission Values
Polar Use Case in ExtremeEarth-phiweek19
Icelandic Bathy model
NPOESS Transition to the Joint Polar Satellite System (JPSS) and Defense Weat...
Prtr non point estimate method in japan
Galileo: System and Applications
Air Quality Hot Spot Mapper | Paul Monks| March 2015
(Slides) A Method for Sharing Traffic Jam Information Using Inter-Vehicle C...
Ad

Similar to FMI Information Management System (20)

PPTX
Producing INSPIRE compliant datasets
PPTX
Producing INSPIRE Compliant Data Sets
PPTX
FMI Open Data Interface and Usage
PPTX
Possibilities of Open Source Code
PPTX
20161028 strahlendorff fmi experience in openness
PPTX
Open Weather Data as Part of Big Data
PPTX
Fmi Open Data on S3
PPTX
Why we need open data? FMI Open Data on AWS
PPTX
Strahlendorff - EO and insitu for weather, water and climate
PPTX
Meteorological and Aviation Weather Open Data implementation utilising OGC st...
PPTX
SmartMet Server in INSPIRE
PDF
Available data sources & Real-time data collection
PPTX
FMI Open Data on S3
PPTX
Open Data and and INSPIRE
PPTX
Open data & crowdsourcing of environmental observations in MMEA
PDF
Weather and Climate Data: Not Just for Meteorologists
PDF
Meteoio Introduction given by Mathias Bavey in Bozen
PPTX
Forecasting Electricity Outages Caused by Convective Storms
PDF
Efficiently Implementing INSPIRE & Creating INSPIRE Mashups with FME
PPTX
Complex Weather data and a Multi-platform Audience: Big Data at The Weather N...
Producing INSPIRE compliant datasets
Producing INSPIRE Compliant Data Sets
FMI Open Data Interface and Usage
Possibilities of Open Source Code
20161028 strahlendorff fmi experience in openness
Open Weather Data as Part of Big Data
Fmi Open Data on S3
Why we need open data? FMI Open Data on AWS
Strahlendorff - EO and insitu for weather, water and climate
Meteorological and Aviation Weather Open Data implementation utilising OGC st...
SmartMet Server in INSPIRE
Available data sources & Real-time data collection
FMI Open Data on S3
Open Data and and INSPIRE
Open data & crowdsourcing of environmental observations in MMEA
Weather and Climate Data: Not Just for Meteorologists
Meteoio Introduction given by Mathias Bavey in Bozen
Forecasting Electricity Outages Caused by Convective Storms
Efficiently Implementing INSPIRE & Creating INSPIRE Mashups with FME
Complex Weather data and a Multi-platform Audience: Big Data at The Weather N...
Ad

More from Roope Tervo (10)

PDF
FMI Open Data Impact Survey 2019
PDF
Predicting weather inflicted train delays
PPTX
FMI Open Data on AWS Public dataset program
PPTX
SmartMet Server OSGeo
PPTX
WMTS Performance Tests
PDF
AvoinData-workshop käyttöesimerkki
PDF
AvoinData aineistot
PDF
AvoinData-workshop aikasarjat
PDF
Avoindata workshop tekninen_yleiskuvaus
PDF
Aaltoes opendata 20130206
FMI Open Data Impact Survey 2019
Predicting weather inflicted train delays
FMI Open Data on AWS Public dataset program
SmartMet Server OSGeo
WMTS Performance Tests
AvoinData-workshop käyttöesimerkki
AvoinData aineistot
AvoinData-workshop aikasarjat
Avoindata workshop tekninen_yleiskuvaus
Aaltoes opendata 20130206

Recently uploaded (20)

PPTX
Biodiversity of nature in environmental studies.pptx
PPTX
IMPACTS OF CHEMICAL FERTILIZERS -CHEMPROJ (11).pptx
PPTX
Soil chemistry lecture 20 presentations agriculture
PPTX
Afro-Asia Literature.pptxmmmmmmmmmmmmmmm
PDF
1748933543SJA_41_2_826-834 SJA Ihsan ullha.pdf
DOCX
Biogas Balloon for Bio CNG Plants An efficient solution for biogas storage..docx
PPTX
Environmental pollutants for natural res
DOCX
Double Membrane Roofs for Bio-gas Tanks Reliable containment for biofuel gas....
PPTX
Biodiversity PPT by Gaithanlung Gonmei.pptx
DOCX
Double Membrane Roofs for Anaerobic Digesters Essential for capturing and sto...
PDF
Harnessing the Earth's Heat with Geothermal Systems Installation
PPT
Environmental pollution for educational study
PPTX
Definition, Causes And Effects Of Greenhouse.pptx
PPTX
RadiationSafetyPt120252026nucchemis.pptx
PDF
IWRM - City University Presentation 28 may 2018-v3.pdf
DOCX
Double Membrane Roofs for Biomethane Storage Holds upgraded biomethane fuel.docx
DOCX
Double Membrane Roofs for Digester Tank Wastewater Treatment Integral to biog...
PPTX
Importance of good air quality and different pollutants.
PPTX
STL Academy - Highlights & Impact 2020-21-v2 (1).pptx
PPTX
EME Aerospace.pptx basics of mechanical engineering
Biodiversity of nature in environmental studies.pptx
IMPACTS OF CHEMICAL FERTILIZERS -CHEMPROJ (11).pptx
Soil chemistry lecture 20 presentations agriculture
Afro-Asia Literature.pptxmmmmmmmmmmmmmmm
1748933543SJA_41_2_826-834 SJA Ihsan ullha.pdf
Biogas Balloon for Bio CNG Plants An efficient solution for biogas storage..docx
Environmental pollutants for natural res
Double Membrane Roofs for Bio-gas Tanks Reliable containment for biofuel gas....
Biodiversity PPT by Gaithanlung Gonmei.pptx
Double Membrane Roofs for Anaerobic Digesters Essential for capturing and sto...
Harnessing the Earth's Heat with Geothermal Systems Installation
Environmental pollution for educational study
Definition, Causes And Effects Of Greenhouse.pptx
RadiationSafetyPt120252026nucchemis.pptx
IWRM - City University Presentation 28 may 2018-v3.pdf
Double Membrane Roofs for Biomethane Storage Holds upgraded biomethane fuel.docx
Double Membrane Roofs for Digester Tank Wastewater Treatment Integral to biog...
Importance of good air quality and different pollutants.
STL Academy - Highlights & Impact 2020-21-v2 (1).pptx
EME Aerospace.pptx basics of mechanical engineering

FMI Information Management System

  • 2. If not stated otherwise, omages by FMI. Licence CC4BY Finnish Meteorological Institute 2 • Roughly 650 FTE • Research Institute • Operative services • Safety / Public / B2B • Roughly half of the employees on operative side https://www.youtube.com/watch?v=WzIVAEvLH2E
  • 3. If not stated otherwise, omages by FMI. Licence CC4BY Software Development at FMI • ~40 - 50 developers in 5-7 units • + contractors • In principle, all new development as open source • Many projects in collaboration (i.e. MetCoOp, Namcon, WFS3…) 3
  • 4. If not stated otherwise, omages by FMI. Licence CC4BY FMI Open Data • Finnish Meteorological Institute opened its data in 2013 • Basically everything that FMI has property rights was opened • Data is provided in freely in machine readable format • https://en.ilmatieteenlaitos.fi/open-data 4
  • 5. If not stated otherwise, omages by FMI. Licence CC4BY 5
  • 6. If not stated otherwise, omages by FMI. Licence CC4BY Distribution Product Human intervention Post processing Raw data SATELLITES • Polar • Geostationary RADAR NETWORK • National • Baltrad (Norhern Europe) • Opera (Europe) NUMERICAL MODELS • Atmospheric / Ocean / Road • Local 2.5 km / Global 25 km • Ensembles OBSERVATIONS • Global Network • Surface / Balloons / Airplanes LIGHTNNING DETECTION NETWORK • Northern Europe CROWDSOURCING • Social sensor networks • Smartphones • 112 call data KALMAN MOS LAPS KRIGING WWW MOBILE TV NEWSPAPERS B2B OPENDATA SAFETY VISUALIZA TION MODEL SELECTIO N POST PROCESSI NG FIELD MODIFICA TION QUALITY CONTROL DATASERVERS WFS JSON XML MAPSERVERS WMS WCS APPLICATIONS ILMANET B2B MOBILE 6
  • 7. If not stated otherwise, omages by FMI. Licence CC4BY 7 Observations FMI operated -- operational Automatic surface stations 184 Rainfall measurement sites 91 Sounding stations 3 Air quality stations 28 Buoys 16 Antennas for lightning detection 8 Mareographs 14 Solar 13 Total 481 Automation 95%
  • 8. If not stated otherwise, omages by FMI. Licence CC4BY 8 External air quality 83 Skandinavian lightning detection 29 Baltic mareographs 24 Road weather stations 29 Runway weather stations 63 Foreign surface weather stations 13736 External water measurement stations 63 Road weather stations 629 NetAtmo ~100k Observations External
  • 9. If not stated otherwise, omages by FMI. Licence CC4BY 9 Observations Process 1 / 10 minute aggregation LAN / 3G Oracle
  • 10. If not stated otherwise, omages by FMI. Licence CC4BY Radars 10 • 10 radars • Range 120-250 km • Typical file formats HDF5, GeoTiff, PNG • Sending power 250 000 W • Receiving power 0,00000000001 – 0,00000001 W
  • 11. If not stated otherwise, omages by FMI. Licence CC4BY Radars Products • Single radars • Reflectivity • Cappi (‘rain intensity’) • Doppler speed (‘wind’) • Echotop (‘cloud base’) • Echotop (‘thunder risk’) • Composite • Reflectivity • Rain intensity • Rain accumulation (1h, 3h, 6h…) • Cross sections 11
  • 12. If not stated otherwise, omages by FMI. Licence CC4BY Satellites Earth Observation • In weather, used to observe: • cloud, pressure, wind, temperature... • Operated by EUMETSAT and NOAA • Reception, processing and filing of data • Geostationary satellites • Received in Germany • Delivered to Finland via telecommunication satellite and land link • Polar satellites • Received in Helsinki and Sodankylä 12
  • 13. If not stated otherwise, omages by FMI. Licence CC4BY • Multicast technology (DVB-S2) • EUMETCast Europe, data rate 46.215 GB/hour, 1109.16 GB/day Satellites Data Transfer 13 Image: FMI
  • 14. If not stated otherwise, omages by FMI. Licence CC4BY Lightning Detection • 9 sensor in Finland, 34 in NORDLIS • Practically all ground strikes detected with < 1km accuracy • Also part of cloud strikes • Stored to climate database • Can be delivered to clients in approximately 20 seconds 14
  • 15. If not stated otherwise, omages by FMI. Licence CC4BY Numerical Weather Prediction (NWP) • Physical modelling of atmospheric • Models developed in international consortiums • Typical programming languages: Fortran, C++ • Input: observations, boundaries from other model, satellites • Output: grid from several surfaces • Surface, pressure levels, model levels • Typical data formats: • GRIB1, GRIB2 (and NetCDF in marine models) 15
  • 16. If not stated otherwise, omages by FMI. Licence CC4BY Weather forecasts are calculated to a regular grid 16 Products (like at https://ilmatieteenlaitos.fi) are often interpolations between grid points
  • 17. If not stated otherwise, omages by FMI. Licence CC4BY Numerical Weather Prediction (NWP) ECMWF European Centre for Medium-Range Weather Forecasts (ECMWF) to create products that support weather service ECMWF model gridded data in Meteorological Archival and Retrieval System (MARS). It is the repository of meteorological data at ECMWF (storage size of PBs of data) 17 Model ECMWF ECMWF EPS Spatial Resolution 15km 15 km Coverage Global Global Temporal Resolution 1-3h 1-3h Time range 10 days 10 days Output data Surface, pressure levels Surface 51 members Update interval 6h 6h
  • 18. If not stated otherwise, omages by FMI. Licence CC4BY Numerical Weather Prediction (NWP) 18 Model Hirlam Harmonie Spatial Resolution 7,5 km 2,5 km Coverage Europe Skandinavia Temporal Resolution 1h 15 min Time range 48h 48h Update interval 6h 6h Output data Surface, pressure levels, model levels Surface, pressure levels, model levels
  • 19. If not stated otherwise, omages by FMI. Licence CC4BY Human Intervention • Meteorologists selects the best source model and refines forecast • The forecast is updated as often as necessary • Output is used as primary data for Scandinavian area • Software ‘SmartMet’ is done at FMI (C++) • Meteorologists draws an analysis maps, fronts, jets, etc… • Software ‘Mirwa’ is done at FMI (Java) • The data is handled as objects 19
  • 20. If not stated otherwise, omages by FMI. Licence CC4BY Human Intervention • Meteorologists generates warnings • Software ‘SmartMet Alert’ is done at FMI (Java) • Meteorologists does aviation forecasts • METARS, TAFs, SWC charts… • Software ‘SmartMet Aviation’ is done at FMI (Java) • Several other tasks • Writing texts, generating ice advisories, consulting about weather • Several web based tools 20
  • 21. If not stated otherwise, omages by FMI. Licence CC4BY Storing Data • FMI total capacity: 6 PB • Ceph (Sodankylä satellite center): 1.5 PB • Ceph (Main site): 2.5 PB  coming 2020 • Fast file system: 300 TB (Sodankylä) + 300 TB (main site) • Archive: 1.5 PB (disk) + 2 PB (tape) • Operational databases: Oracle, PostgreSQL, MySQL and Redis • FMI utilizes ECMWF data storage in daily operations • ECMWF MARS • Archive capacity: 225 PB • Current daily increase: 225 TB • Data size grows exponentially in near future 21 71.7 287 929 0 100 200 300 400 500 600 700 800 900 1000 2017 2020 2025 Global Numerical Weather Model Output Model output per day (TiB)
  • 22. AI inWeather Forecasting Examples of Operational Systems and Possibilities 22
  • 23. If not stated otherwise, omages by FMI. Licence CC4BY ML can be used in many parts of the process • Making observations (machine vision) • Quality Check (unsupervised learning) • Radar and satellite data (machine vision) • Weather predictions (i.a. neural networks) • Post-prosessing (traditional supervised learning, neural networks) • Impact analysis 23
  • 24. If not stated otherwise, omages by FMI. Licence CC4BY MachineVision in Observations 24 https://www.visivis.at/
  • 25. If not stated otherwise, omages by FMI. Licence CC4BY Precipitation NowCast (Radar) https://arxiv.org/abs/1912.12132 25 2 years history of radar images 256km U-Net Image: Mehrdad Yazdani, Wikipedia, Licence CC BY-SA 4.0 Prediction (i.e. t +2h)
  • 26. If not stated otherwise, omages by FMI. Licence CC4BY Global SyntheticWeather Radar Poster: https://www.star.nesdis.noaa.gov/star/documents/meetings/2019AI/ posters/P2.3_Mattioli.pdf 26 Convolutional Neural Network to create a global synthetic weather radar based on satellites, lightning data and numerical weather prediction data. Image:Mattioli presentation
  • 27. If not stated otherwise, omages by FMI. Licence CC4BY Level 1 • Post-processing • Downscaling Level 2 • Emulate existing parametrization scheme • Correct existing schemes Level 3 • Learn new parametrization scheme from simulations or data Level 4 • Model replacement Weather Models and Post-processing 27 Credits: Stephan Rasp, slides
  • 28. If not stated otherwise, omages by FMI. Licence CC4BY Post-processing with Blend
  • 29. If not stated otherwise, omages by FMI. Licence CC4BY  Method used: “Exponentially weighted moving average” / “Moving average” (Cui et al. 2012)  BCt = (1-α)BCt-1 + α(FCSTt-1 – OBSt-1)  MAEt = (1-α)MAEt-1 + α|FCSTt-1 – OBSt-1| BC = bias correction, α = decaying weight (0.05), OBS = observation, FCST = model forecast, MAE = mean absolute error  BC and MAE are calculated for each forecast model, analysis time and lead time separately  Bias corrected forecasts are used to calculate MAE values and calculating the latest Blend forecast Bias correction (BC) and MAE
  • 30. If not stated otherwise, omages by FMI. Licence CC4BY Biascorrection How large effect will the error have and for how long with different alpha values: equation: BCt = (1-α)BCt-1 (started with BC value 1) Decaying weight, alpha
  • 31. If not stated otherwise, omages by FMI. Licence CC4BY  Verification results based on MAE are used to produce weights for each model (Woodcock and Engel, 2005): W1 = (1/MAE1) / (1/MAE1+1/MAE2+...+1/MAEn) WhereW = weight, n= total number of models  The Blend is calculated using weights and bias corrected model forecasts: Blend =W1*BCFCST1 +W2*BCFCST2+...+Wn*BCFCSTn Where BCFCST = Latest Bias corrected model forecast Calculating weights and Blend
  • 32. If not stated otherwise, omages by FMI. Licence CC4BY Neural networks for post-processing ensemble weather forecasts Image: ECMWF Image: Cecbur, source: Wikipedia., License: CC BY-SA 4.0 Loss: where Φ and φ denote CDF and PDF of a standardGaussian distribution Features (X): • Several weather parameters • Embedded station location Label (y): observations Ouput: normal distribution mean μ and standard deviation σ
  • 33. If not stated otherwise, omages by FMI. Licence CC4BY PredictingWeather Using Neural Networks is Under Research Jonathan Weyn: Can Machines Learn to Predict Weather? Using Deep Learning to Predict Gridded 500‐hPa Geopotential Height From Historical Weather Data: https://agupubs.onlinelibrary.wiley.com/doi/full/10.1029/2019MS001705 https://github.com/jweyn/DLWP Peter Düben: Deep learning for weather and climate (excellent presentation about the topic): https://ossg.bcs.org/wp-content/uploads/04-19-dueben-weather.pdf Challenges and design choices for global weather and climate models based on machine learning: https://www.researchgate.net/publication/326043857_Challenges_and_de sign_choices_for_global_weather_and_climate_models_based_on_machi ne_learning Image: Mehrdad Yazdani, Wikipedia, Licence CC BY-SA 4.0
  • 34. TRAINS Predicting Train Delays Inflicted by Weather Photo by Kalevi Lehtonen 1955. Not published until Commons in 2014. https://fi.wikipedia.org/wiki/Tiedosto:Finnish_class_Dm4_locomotive_number_1607_in_the_year_1955.jpg
  • 35. If not stated otherwise, omages by FMI. Licence CC4BY Predicting Weather Inflicted Train Delays Data Liikennevirasto (CC4) Delay between stations • Passenger trains • 514 stations Weather observations • 19 parameters
  • 36. If not stated otherwise, omages by FMI. Licence CC4BY Image: Venkata Jagannath. License: CC4-BY Image: CC0 Image: BiObserver. License: CC4-BY Results RFR RMSE: 5.37 MAE: 3.21 BSS: 0.11 LSTM RMSE: 4.35 MAE: 2.75 BSS: 0.01 LR RMSE: 5.59 MAE: 3.11 BSS: 0.08 𝐵𝑆𝑆 = 1 − 𝑅𝑀𝑆𝐸 𝑅𝑀𝑆𝐸𝑟𝑒𝑓 , where 𝑅𝑀𝑆𝐸𝑟𝑒𝑓 denotes root mean square error calculated with a mean value over the whole dataset
  • 37. If not stated otherwise, omages by FMI. Licence CC4BY LSTM shows no real skill 0 25 50 Delay(minutes) Time 02/2011 Time 06/2016 Time 02/2017 Predicted vs. true delay, case Ahvenus Predicted delay True delay
  • 38. If not stated otherwise, omages by FMI. Licence CC4BY RFR works relatively well 0 50 100 Delay(minutes) Time 02/2011 Time 06/2016 Time 02/2017 Predicted vs. true delay, average over all stations Predicted delay True delay
  • 39. SASSE Forecasting Electricity Outages Caused by Convective Storms Photo by Bryan Alexander. CC BY 2.0
  • 40. If not stated otherwise, omages by FMI. Licence CC4BY • Trained with observed outages (Random Forest Classifier) • Several input features • Few years of old data required • Based on storm cell severity and knowledge about clients electricity grid • For example category 2 storm cell is expected to cause problems in 10-50 % of grid nodes Turn outages to money Power companies has to give compensation for outages Forecasting Electricity Outages Extract storm cells from radar data Classify the storm cells Predict future outages Turn outages to money
  • 42. If not stated otherwise, omages by FMI. Licence CC4BY Most Important DataTypes Data Data Type Software Weather forecasts GRIB1, GRIB2, (NetCDF) PanoPly, IDV, ECMWF GRIB tools, WGRIB, WGRIB2, gdal, SmartMet Server, pangeo Marine forecasts, air quality forecasts… NetCDF PanoPly, IDV, gdal, pangeo, GeoServer Observations XML, JSON, csv any BUFR ecCodes, NCEP Aviation products TAF, METAR, SIGMET… any IWXXM fmi-avi-message-converter-iwxxm Radar GeoTiff, HDF5 gdal, GeoServer (see also: https://en.wikipedia.org/wiki/Hiera rchical_Data_Format) Satellites GeoTiff, NetCDF gdal, GeoServer, PyTroll 42