SlideShare a Scribd company logo
Scaling Deep Learning Models for Large Spatial
Time-Series Forecasting
Zainab Abbas1, Jon Reginbald Ivarsson1, Ahmad Al-Shishtawy2 and Vladimir Vlassov1
1 KTH Royal Institute of Technology
2 RISE SICS
Stockholm, Sweden
IEEE BIGDATA 2019, LOS ANGELES, DEC 9-12, 2019
Challenge of Scale
● Deep Neural networks are used for different machine learning tasks, such as
spatial time-series forecasting.
● At scale, training deep NNs is computationally and memory intensive.
● Partitioning and distribution is a general approach to the challenge of scale in
NN-based modelling
○ dividing the problem into smaller tasks.
○ these tasks comprise of smaller models working on subsets of data
2
Traffic Data
● Sensor ID
● GPS coordinates
● Time
● Flow (no. of cars per minute)
● Average Speed (Km per hr)
Flow / Speed = Density (Cars per Km)
3Scaling Deep Learning Models for Large Spatial Time-Series Forecasting
Large amount of sensor data
● Radar sensors deployed on
Stockholm highways
● More than 88 millions data
points collected by 2058
sensors
● Number of sensors
increasing every year
● Sensor values per minute
4Scaling Deep Learning Models for Large Spatial Time-Series Forecasting
Research Questions
● How to partition spatial time-series while preserving dependencies among
them?
● Which and how many spatial time-series do we take into account for a fast and
accurate forecast?
5
Scaling Deep Learning Models for Large Spatial Time-Series Forecasting
Graph Representation
● The traffic sensors are represented in the
form of a directed weighted graph
● Sensors that are at the same location but
on different lanes are represented as a
vertex
● The paths between sensors are
represented as edges
● An edge is weighted with the travel time
between the corresponding sensors
6
Graph Partitioning
7
2) Creation of Base Partitions
4) Additions of Partitions from Front and Behind.3) Creation of Base Partitions Graph
Do backward traversal from the
starting vertex in the graph till the
threshold is met.
1) Directed Weighted Graph
Data Representation
8
Spatio-temporal dependencies of sensor data Space-Time window representation of sensor data
Stacked LSTMs
● Stacked LSTMs are build of multiple layers of LSTM
placed over each other
● More powerful and deep neural network compared to
the conventional architecture
● Capable to learn non-linear dependencies in the data
9Scaling Deep Learning Models for Large Spatial Time-Series Forecasting
Technology
● Apache Spark 2.4.0
● Python 2.7.15
● Tensorflow 1.11.0
10Scaling Deep Learning Models for Large Spatial Time-Series Forecasting
Prediction Models
11Scaling Deep Learning Models for Large Spatial Time-Series Forecasting
1-1 Single Sensor Model (SSM)
12
1-1 Single Sensor Model (SSM)
13
n-n Entire Sensor Infrastructure Model (ESIM)
n n
Partition-Based Models (B-t)
15
3 min partitions (B3) 5 min partitions (B5)
Partition-Based Models (B-t)
16
10 min partitions (B10) 20 min partitions (B20)
Results
Partition-based model B15 has ≈2x less RMSE compared to SSM and ESIM.
17
Results
Sequential training time (sec) Parallel training time (sec)
18
* SSMs have very less prediction time, hence omitted. 19
Results
Conclusion
● Scalability
● Accuracy
● Performance
20Scaling Deep Learning Models for Large Spatial Time-Series Forecasting
Thank you :)
21Scaling Deep Learning Models for Large Spatial Time-Series Forecasting
Zainab Abbas
zainabab@kth.se
Vladimir Vlassov
vladv@kth.se
Ahmad Al-Shishtawy
ahmad.al-shishtawy@ri.se
Jon R. Ivarsson
mail@reginbald.com
Scaling Deep Learning Models for Large Spatial
Time-Series Forecasting
Zainab Abbas1, Jon Reginbald Ivarsson1, Ahmad Al-Shishtawy2 and Vladimir Vlassov1
1 KTH Royal Institute of Technology
2 RISE SICS
Stockholm, Sweden
IEEE BIGDATA 2019, LOS ANGELES, DEC 9-12, 2019

More Related Content

PDF
Using TensorFlow for Machine Learning
PDF
Cluster formation over huge volatile robotic data
PPTX
Applications of PARALLEL PROCESSING
PDF
Nephele pegasus
PDF
Gray-Box Models for Performance Assessment of Spark Applications
PDF
Centernet
PPTX
Labmeeting - 20150512 - New Secure Routing Method & Applications Facing MitM ...
PPTX
Point cloud library
Using TensorFlow for Machine Learning
Cluster formation over huge volatile robotic data
Applications of PARALLEL PROCESSING
Nephele pegasus
Gray-Box Models for Performance Assessment of Spark Applications
Centernet
Labmeeting - 20150512 - New Secure Routing Method & Applications Facing MitM ...
Point cloud library

What's hot (20)

PDF
Gravitational Billion Body Project
PDF
Resense: Transparent Record and Replay of Sensor Data in the Internet of Thin...
PDF
Ace an accurate and efficient multi entity device-free wlan localization system
PPT
Matteoli ieee gold_2010_clean
PPTX
Map Reduce in the Clouds (http://salsahpc.indiana.edu/mapreduceroles4azure/)
PPTX
Raster processing
PDF
2020 ml swarm ascend presentation
PDF
Graph neural networks overview
DOCX
IEEE 2014 JAVA NETWORKING PROJECTS Snapshot and continuous data collection in...
PDF
An Elastic Middleware Platform for Concurrent and Distributed Cloud and MapRe...
PDF
geographic information system pdf
PPTX
Summary of survey papers on deep learning method to 3D data
PPTX
Grid based method & model based clustering method
PDF
An adaptive distributed simulator for cloud andmap reduce algorithms and arch...
PDF
Deep Learning Applications to Satellite Imagery
PPTX
Attention in Deep Learning
PDF
Global Grid of Grapes
PDF
PDF
Continuous and Parallel LiDAR Point-cloud Clustering
PPTX
Raster data analysis
Gravitational Billion Body Project
Resense: Transparent Record and Replay of Sensor Data in the Internet of Thin...
Ace an accurate and efficient multi entity device-free wlan localization system
Matteoli ieee gold_2010_clean
Map Reduce in the Clouds (http://salsahpc.indiana.edu/mapreduceroles4azure/)
Raster processing
2020 ml swarm ascend presentation
Graph neural networks overview
IEEE 2014 JAVA NETWORKING PROJECTS Snapshot and continuous data collection in...
An Elastic Middleware Platform for Concurrent and Distributed Cloud and MapRe...
geographic information system pdf
Summary of survey papers on deep learning method to 3D data
Grid based method & model based clustering method
An adaptive distributed simulator for cloud andmap reduce algorithms and arch...
Deep Learning Applications to Satellite Imagery
Attention in Deep Learning
Global Grid of Grapes
Continuous and Parallel LiDAR Point-cloud Clustering
Raster data analysis
Ad

Similar to Scaling Deep Learning Models for Large Spatial Time-Series Forecasting (20)

PPTX
[20240628_LabSeminar_Huy]ScalableSTGNN.pptx
PPTX
[20240422_LabSeminar_Huy]Taming_Effect.pptx
PDF
Map r chicago_advanalytics_oct_meetup
PDF
Spatio-Temporal Data Analysis using Deep Learning
PDF
Predictive Maintenance Using Recurrent Neural Networks
PDF
PPTX
Traffic Prediction from Street Network images.pptx
PPTX
[20240520_LabSeminar_Huy]DSTAGNN: Dynamic Spatial-Temporal Aware Graph Neural...
PDF
2006.11583.pdf
PDF
VSSML18. Deepnets and Time Series
PPTX
[Seminar] hyunwook 0624
PDF
IRJET- Identification of Scene Images using Convolutional Neural Networks - A...
PDF
Pixel Recurrent Neural Networks
PDF
Time series deep learning
PPTX
[20240805_LabSeminar_Huy]GPT-ST: Generative Pre-Training of Spatio-Temporal G...
PPTX
[20240325_LabSeminar_Huy]Spatial-Temporal Fusion Graph Neural Networks for Tr...
PDF
mathematics-10-01599-v2.pdf
PDF
Deep Learning for Time Series Data
PDF
Memory Efficient Graph Convolutional Network based Distributed Link Prediction
PDF
Machine Learning for Time Series, Strata London 2018
[20240628_LabSeminar_Huy]ScalableSTGNN.pptx
[20240422_LabSeminar_Huy]Taming_Effect.pptx
Map r chicago_advanalytics_oct_meetup
Spatio-Temporal Data Analysis using Deep Learning
Predictive Maintenance Using Recurrent Neural Networks
Traffic Prediction from Street Network images.pptx
[20240520_LabSeminar_Huy]DSTAGNN: Dynamic Spatial-Temporal Aware Graph Neural...
2006.11583.pdf
VSSML18. Deepnets and Time Series
[Seminar] hyunwook 0624
IRJET- Identification of Scene Images using Convolutional Neural Networks - A...
Pixel Recurrent Neural Networks
Time series deep learning
[20240805_LabSeminar_Huy]GPT-ST: Generative Pre-Training of Spatio-Temporal G...
[20240325_LabSeminar_Huy]Spatial-Temporal Fusion Graph Neural Networks for Tr...
mathematics-10-01599-v2.pdf
Deep Learning for Time Series Data
Memory Efficient Graph Convolutional Network based Distributed Link Prediction
Machine Learning for Time Series, Strata London 2018
Ad

Recently uploaded (20)

PDF
annual-report-2024-2025 original latest.
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
Business Acumen Training GuidePresentation.pptx
PPT
Reliability_Chapter_ presentation 1221.5784
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PPTX
Supervised vs unsupervised machine learning algorithms
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PPTX
1_Introduction to advance data techniques.pptx
PDF
Mega Projects Data Mega Projects Data
annual-report-2024-2025 original latest.
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Clinical guidelines as a resource for EBP(1).pdf
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
Introduction-to-Cloud-ComputingFinal.pptx
Business Acumen Training GuidePresentation.pptx
Reliability_Chapter_ presentation 1221.5784
Galatica Smart Energy Infrastructure Startup Pitch Deck
oil_refinery_comprehensive_20250804084928 (1).pptx
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
Business Ppt On Nestle.pptx huunnnhhgfvu
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Supervised vs unsupervised machine learning algorithms
STUDY DESIGN details- Lt Col Maksud (21).pptx
Miokarditis (Inflamasi pada Otot Jantung)
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Qualitative Qantitative and Mixed Methods.pptx
1_Introduction to advance data techniques.pptx
Mega Projects Data Mega Projects Data

Scaling Deep Learning Models for Large Spatial Time-Series Forecasting

  • 1. Scaling Deep Learning Models for Large Spatial Time-Series Forecasting Zainab Abbas1, Jon Reginbald Ivarsson1, Ahmad Al-Shishtawy2 and Vladimir Vlassov1 1 KTH Royal Institute of Technology 2 RISE SICS Stockholm, Sweden IEEE BIGDATA 2019, LOS ANGELES, DEC 9-12, 2019
  • 2. Challenge of Scale ● Deep Neural networks are used for different machine learning tasks, such as spatial time-series forecasting. ● At scale, training deep NNs is computationally and memory intensive. ● Partitioning and distribution is a general approach to the challenge of scale in NN-based modelling ○ dividing the problem into smaller tasks. ○ these tasks comprise of smaller models working on subsets of data 2
  • 3. Traffic Data ● Sensor ID ● GPS coordinates ● Time ● Flow (no. of cars per minute) ● Average Speed (Km per hr) Flow / Speed = Density (Cars per Km) 3Scaling Deep Learning Models for Large Spatial Time-Series Forecasting
  • 4. Large amount of sensor data ● Radar sensors deployed on Stockholm highways ● More than 88 millions data points collected by 2058 sensors ● Number of sensors increasing every year ● Sensor values per minute 4Scaling Deep Learning Models for Large Spatial Time-Series Forecasting
  • 5. Research Questions ● How to partition spatial time-series while preserving dependencies among them? ● Which and how many spatial time-series do we take into account for a fast and accurate forecast? 5 Scaling Deep Learning Models for Large Spatial Time-Series Forecasting
  • 6. Graph Representation ● The traffic sensors are represented in the form of a directed weighted graph ● Sensors that are at the same location but on different lanes are represented as a vertex ● The paths between sensors are represented as edges ● An edge is weighted with the travel time between the corresponding sensors 6
  • 7. Graph Partitioning 7 2) Creation of Base Partitions 4) Additions of Partitions from Front and Behind.3) Creation of Base Partitions Graph Do backward traversal from the starting vertex in the graph till the threshold is met. 1) Directed Weighted Graph
  • 8. Data Representation 8 Spatio-temporal dependencies of sensor data Space-Time window representation of sensor data
  • 9. Stacked LSTMs ● Stacked LSTMs are build of multiple layers of LSTM placed over each other ● More powerful and deep neural network compared to the conventional architecture ● Capable to learn non-linear dependencies in the data 9Scaling Deep Learning Models for Large Spatial Time-Series Forecasting
  • 10. Technology ● Apache Spark 2.4.0 ● Python 2.7.15 ● Tensorflow 1.11.0 10Scaling Deep Learning Models for Large Spatial Time-Series Forecasting
  • 11. Prediction Models 11Scaling Deep Learning Models for Large Spatial Time-Series Forecasting
  • 12. 1-1 Single Sensor Model (SSM) 12
  • 13. 1-1 Single Sensor Model (SSM) 13
  • 14. n-n Entire Sensor Infrastructure Model (ESIM) n n
  • 15. Partition-Based Models (B-t) 15 3 min partitions (B3) 5 min partitions (B5)
  • 16. Partition-Based Models (B-t) 16 10 min partitions (B10) 20 min partitions (B20)
  • 17. Results Partition-based model B15 has ≈2x less RMSE compared to SSM and ESIM. 17
  • 18. Results Sequential training time (sec) Parallel training time (sec) 18
  • 19. * SSMs have very less prediction time, hence omitted. 19 Results
  • 20. Conclusion ● Scalability ● Accuracy ● Performance 20Scaling Deep Learning Models for Large Spatial Time-Series Forecasting
  • 21. Thank you :) 21Scaling Deep Learning Models for Large Spatial Time-Series Forecasting Zainab Abbas zainabab@kth.se Vladimir Vlassov vladv@kth.se Ahmad Al-Shishtawy ahmad.al-shishtawy@ri.se Jon R. Ivarsson mail@reginbald.com
  • 22. Scaling Deep Learning Models for Large Spatial Time-Series Forecasting Zainab Abbas1, Jon Reginbald Ivarsson1, Ahmad Al-Shishtawy2 and Vladimir Vlassov1 1 KTH Royal Institute of Technology 2 RISE SICS Stockholm, Sweden IEEE BIGDATA 2019, LOS ANGELES, DEC 9-12, 2019

Editor's Notes