SlideShare a Scribd company logo
Amplifying OrganisationalIntelligence Intellify Pty Ltd
IntellifyAI
Intellify_AISydney
Level 8
11York Street
Sydney, NSW 2000
T. (02) 8089 4073
www.intellify.com.au
Melbourne
Level 28
303 Collins Street
Melbourne,VIC 3000
T. (03) 9132 9846
info@intellify.com.au 20 Bridge Street
AWS Forecast: DeepAR Predictor
Time-series
Amplifying OrganisationalIntelligence
Agenda
1. Introduction
2. Why time-series and how are they different?
3. Classical (Statistical) and Modern time-series methods
4. AWS Forecast: Modern Time-series
5. AWS Predictor: DeepAR
6. Demo on DeepAR
Amplifying OrganisationalIntelligence
Why are time-series methods important?
1
2
3
Time series are everywhere!
Most methods were designed for use on cross-sectional data
We can drive better business outcomes through the use of time-series methods
Amplifying OrganisationalIntelligence
How are time series problems different?
• Different states in a time series can make the problem harder to model.
• There could be multiple forecasting horizons; short, medium, long term.
• Typically you care about the prediction as well as the confidence in the prediction.
• Model testing and validation must be conducted in a different way to avoid data leakage and select
the best model.
Amplifying OrganisationalIntelligence
What are desirable properties of time series methods?
Multi-step multivariate prediction
Shares information across time-series
Leverages meta-information
Works on sparse data
Handles non-linearities/interactions
Works with high dimensional data
Models autocorrelation structure implicitly
Minimal feature pre-processing and engineering
Amplifying OrganisationalIntelligence
Traditional Models
Autoregressive models are remarkably flexible at handling a
wide range of different time series patterns, but … How
about ability to learn and generalized from similar series (to
learn more complex models without overfitting)
Benefits Challenges
• Interpretable
• Implicitly models auto-
correlation structure
• Works well when there
is little exogenous
information
• Doesn’t share information
across time-series
• Forecasting a large number of
individual or grouped time-
series
• Struggles with sparsity and
special events
Benefits Challenges
• Shares information across time
series
• Uses meta-information
• Models non-linearities as well
as interactions
• Some works with missing
values
• Struggles if little meta-
information
• Requires larger volumes of data
• Larger amounts of data
preprocessing needed.
• Tend to average predictions too
much across time series
Based on neural networks with a modified architecture.
Implicitly models interactions, non-linearities as well as
time-series features. LSTM’s (vs RNN) do a better job of
modelling long term time dependencies.
ML Models
Random Forest. Prophet. LSTM. AWS ForecastAutoregressive ARIMA. ETS
Classic vs Modern Models: Benefit & Challenges
Amplifying OrganisationalIntelligence
Benefits
• 50% more accurate
forecasts with machine
learning
• Reduce forecasting time
from months to hours
Use cases
• Product Demand
Planning
• Retail product demand
• Supply chain demand
• Operational metrics
• Business metrics
• Financial planning
• Resource planning
Statistical Machine Learning
Volume of data Works well with little
information
Needs data from
several series or
several features
Can share meta-
information
No (ARIMAX
exception)
Yes
Can handle sparse
data
No Yes
Can handle non-
linearities/interaction
s
No or only explicitly Yes
Can leverage shared
information between
time-series
No (VAR exception) Yes, but tends to
average too much
Can work with high
dimensional data
Limited Yes
AWS Forecast
Amazon Forecast is a fully managed service that uses machine learning to deliver highly accurate forecasts.
Amplifying OrganisationalIntelligence
Datasets and Dataset
Groups
Predictors
Forecasts
AWS Forecast
HowThis Works?
Setting Up:
• Sign Up for AWS
• Set Up the AWSCLI
• Set Up Permissions for Amazon
Forecast
• Autoregressive Integrated Moving Average(ARIMA)
• arn:aws:forecast:::algorithm/ARIMA
• DeepAR
• arn:aws:forecast:::algorithm/Deep_AR
• Prophet
• arn:aws:forecast:::algorithm/Prophet
Predictor:
Amplifying OrganisationalIntelligence
AWS Predictor: DeepAr
DeepAR is a forecasting model based on autoregressive RNNs, which learns a global model from historical
data of all time series in all datasets
DeepAr is
Multi-step multivariate time series:
• Given observed values of a series i for
t time-steps, estimating probability distribution
of the next T steps
Pros Cons
• Shares information across
groups of time series
• Models non-linearities as well
as interactions
• Minimal manual feature
engineering
• Ability to incorporate a wide
range of likelihood models,
including probabilistic forecasts
in the form of Monte Carlo
samples
• Struggles if little meta-
information
• Requires larger volumes of data
• Tend to average predictions too
much across time series
Amplifying OrganisationalIntelligence
Best Practices for using the DeepAR Algorithm
• Input/Output interface:
• Supports two data channels (Train and Test for evaluation)
• Format: JSON, gzip, and Parquet
• Best practice:
• Except for when splitting your dataset for train and test, always provide the entire time series. Why: the lagged value features
• Test points should start immediately after the last time point of training
• Avoid using very large values (>400) for the prediction length because it makes the model slow and less
accurate. Solution: consider aggregating your data at a higher frequency.
• ARIMA or ETS, might provide more accurate results on on a single time series. The DeepAR algorithm starts to
outperform the standard methods when your dataset contains hundreds of related time series.
• Train: on both GPU and CPU instances. Inference: only CPU
• Use small number for context_length, prediction_length, num_cells, num_layers, or mini_batch_size, in case of small
instances
Amplifying OrganisationalIntelligence
DeepAR: Demo
SageMaker/DeepAR demo on electricity dataset
Amplifying OrganisationalIntelligence
References
• https://docs.aws.amazon.com/forecast/latest/dg/forecast.dg.pdf
• https://aws.amazon.com/blogs/aws/amazon-forecast-time-series-forecasting-made-easy/
• https://docs.aws.amazon.com/sagemaker/latest/dg/deepar.html

More Related Content

PPTX
Artificial Intelligence Course | AI Tutorial For Beginners | Artificial Intel...
PPTX
Deep ar presentation
PDF
Time series deep learning
PDF
Default Credit Card Prediction
PDF
Knowledge Graphs are Worthless, Knowledge Graph Use Cases are Priceless
PPTX
No sql distilled-distilled
PDF
An Introduction to Anomaly Detection
PPTX
Data Science and Analysis.pptx
Artificial Intelligence Course | AI Tutorial For Beginners | Artificial Intel...
Deep ar presentation
Time series deep learning
Default Credit Card Prediction
Knowledge Graphs are Worthless, Knowledge Graph Use Cases are Priceless
No sql distilled-distilled
An Introduction to Anomaly Detection
Data Science and Analysis.pptx

What's hot (20)

PPTX
Time series forecasting with machine learning
PPTX
K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...
PPTX
Schema-on-Read vs Schema-on-Write
PDF
How to become a data scientist in 6 months
PDF
Feature Engineering & Selection
PDF
Graph-Powered Machine Learning
PDF
Anomaly detection
PPTX
PPTX
Polyglot Persistence
PDF
Big query
PPTX
Support Vector Machines Simply
PPTX
Decision Tree Algorithm & Analysis | Machine Learning Algorithm | Data Scienc...
PPTX
Hierarchical clustering
PPTX
PPTX
Data preprocessing in Machine learning
PPTX
Data Science Training | Data Science For Beginners | Data Science With Python...
PPTX
Intro to Data Science by DatalentTeam at Data Science Clinic#11
PDF
Machine Learning Strategies for Time Series Prediction
PPTX
Ingestion in data pipelines with Managed Kafka Clusters in Azure HDInsight
PDF
Graph Gurus 15: Introducing TigerGraph 2.4
Time series forecasting with machine learning
K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...
Schema-on-Read vs Schema-on-Write
How to become a data scientist in 6 months
Feature Engineering & Selection
Graph-Powered Machine Learning
Anomaly detection
Polyglot Persistence
Big query
Support Vector Machines Simply
Decision Tree Algorithm & Analysis | Machine Learning Algorithm | Data Scienc...
Hierarchical clustering
Data preprocessing in Machine learning
Data Science Training | Data Science For Beginners | Data Science With Python...
Intro to Data Science by DatalentTeam at Data Science Clinic#11
Machine Learning Strategies for Time Series Prediction
Ingestion in data pipelines with Managed Kafka Clusters in Azure HDInsight
Graph Gurus 15: Introducing TigerGraph 2.4
Ad

Similar to AWS Forcecast: DeepAR Predictor Time-series (20)

PDF
Auto-Train a Time-Series Forecast Model With AML + ADB
PPTX
Machine Learning for Forecasting: From Data to Deployment
PPTX
Gaussian Processes and Time Series.pptx
PDF
Time Series Made Easy
PDF
Forecasting time series powerful and simple
PDF
Time Series Forecasting Using Recurrent Neural Network and Vector Autoregress...
PDF
Social_Distancing_DIS_Time_Series
PPTX
Anshita_Timeseries forcast_usecase_inAI.pptx
PDF
Mastering Time Series Forecasting - Guide to Techniques, Applications, and Fu...
PDF
A Survey on Deep Learning for time series Forecasting
PDF
Multivariate Time Series
ODP
Power ai t-imeseries
PPTX
Gde time series_modeling
PDF
IBM SPSS Forecasting
PDF
Time-series prediction with neural networks
PPTX
Presentation On Time Series Analysis in Mechine Learning
PDF
Walk-Through Demand Sales Time Series Forecasting
PDF
A Review on the Comparison of Box Jenkins ARIMA and LSTM of Deep Learning
PPTX
Role of Hybrid Time Series Models (ARIMA-ANN) in Forecasting Scenario of Agri...
PPTX
Introduction to Time Series Analytics with Microsoft Azure
Auto-Train a Time-Series Forecast Model With AML + ADB
Machine Learning for Forecasting: From Data to Deployment
Gaussian Processes and Time Series.pptx
Time Series Made Easy
Forecasting time series powerful and simple
Time Series Forecasting Using Recurrent Neural Network and Vector Autoregress...
Social_Distancing_DIS_Time_Series
Anshita_Timeseries forcast_usecase_inAI.pptx
Mastering Time Series Forecasting - Guide to Techniques, Applications, and Fu...
A Survey on Deep Learning for time series Forecasting
Multivariate Time Series
Power ai t-imeseries
Gde time series_modeling
IBM SPSS Forecasting
Time-series prediction with neural networks
Presentation On Time Series Analysis in Mechine Learning
Walk-Through Demand Sales Time Series Forecasting
A Review on the Comparison of Box Jenkins ARIMA and LSTM of Deep Learning
Role of Hybrid Time Series Models (ARIMA-ANN) in Forecasting Scenario of Agri...
Introduction to Time Series Analytics with Microsoft Azure
Ad

More from PolarSeven Pty Ltd (20)

PDF
Aws user group #04 landing zones
PDF
Aws user group #03 - All things Iot
PDF
Aws user group #01 lets talk serverless
PDF
AWS Reinvent Recap 2018
PDF
AWS User Group October
PDF
AWS User Group August
PDF
AWS User Group November
PDF
AWS User Group September
PDF
Amazon Web Services User Group Sydney - March 2018
PDF
Amazon Web Services User Group Sydney - February 2018
PDF
Deep Dive on Cloud Policies and Automation
PDF
Securing Traffic Leaving A VPC
PDF
Telstra Programmable Networks & Scaling a Serverless Team with Automation
PDF
AWS User Group Sydney - Meetup #60
PPTX
Shared Security in AWS
PPTX
Visibility, Optimization & Governance for Cloud Services
PDF
AWS OpsWorks for Chef Automate
PPTX
AWS CloudFormation Automation, TrafficScript, and Serverless architecture wit...
PPTX
AWS User Group December 2016
PDF
AWS User Group Sydney - Atlassian 5-10-16
Aws user group #04 landing zones
Aws user group #03 - All things Iot
Aws user group #01 lets talk serverless
AWS Reinvent Recap 2018
AWS User Group October
AWS User Group August
AWS User Group November
AWS User Group September
Amazon Web Services User Group Sydney - March 2018
Amazon Web Services User Group Sydney - February 2018
Deep Dive on Cloud Policies and Automation
Securing Traffic Leaving A VPC
Telstra Programmable Networks & Scaling a Serverless Team with Automation
AWS User Group Sydney - Meetup #60
Shared Security in AWS
Visibility, Optimization & Governance for Cloud Services
AWS OpsWorks for Chef Automate
AWS CloudFormation Automation, TrafficScript, and Serverless architecture wit...
AWS User Group December 2016
AWS User Group Sydney - Atlassian 5-10-16

Recently uploaded (20)

PPTX
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPT
Teaching material agriculture food technology
PDF
Modernizing your data center with Dell and AMD
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
Machine learning based COVID-19 study performance prediction
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
Big Data Technologies - Introduction.pptx
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
[발표본] 너의 과제는 클라우드에 있어_KTDS_김동현_20250524.pdf
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
KodekX | Application Modernization Development
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
“AI and Expert System Decision Support & Business Intelligence Systems”
Teaching material agriculture food technology
Modernizing your data center with Dell and AMD
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Machine learning based COVID-19 study performance prediction
Review of recent advances in non-invasive hemoglobin estimation
Spectral efficient network and resource selection model in 5G networks
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Dropbox Q2 2025 Financial Results & Investor Presentation
Advanced methodologies resolving dimensionality complications for autism neur...
Big Data Technologies - Introduction.pptx
20250228 LYD VKU AI Blended-Learning.pptx
[발표본] 너의 과제는 클라우드에 있어_KTDS_김동현_20250524.pdf
NewMind AI Weekly Chronicles - August'25 Week I
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Reach Out and Touch Someone: Haptics and Empathic Computing
KodekX | Application Modernization Development
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication

AWS Forcecast: DeepAR Predictor Time-series

  • 1. Amplifying OrganisationalIntelligence Intellify Pty Ltd IntellifyAI Intellify_AISydney Level 8 11York Street Sydney, NSW 2000 T. (02) 8089 4073 www.intellify.com.au Melbourne Level 28 303 Collins Street Melbourne,VIC 3000 T. (03) 9132 9846 info@intellify.com.au 20 Bridge Street AWS Forecast: DeepAR Predictor Time-series
  • 2. Amplifying OrganisationalIntelligence Agenda 1. Introduction 2. Why time-series and how are they different? 3. Classical (Statistical) and Modern time-series methods 4. AWS Forecast: Modern Time-series 5. AWS Predictor: DeepAR 6. Demo on DeepAR
  • 3. Amplifying OrganisationalIntelligence Why are time-series methods important? 1 2 3 Time series are everywhere! Most methods were designed for use on cross-sectional data We can drive better business outcomes through the use of time-series methods
  • 4. Amplifying OrganisationalIntelligence How are time series problems different? • Different states in a time series can make the problem harder to model. • There could be multiple forecasting horizons; short, medium, long term. • Typically you care about the prediction as well as the confidence in the prediction. • Model testing and validation must be conducted in a different way to avoid data leakage and select the best model.
  • 5. Amplifying OrganisationalIntelligence What are desirable properties of time series methods? Multi-step multivariate prediction Shares information across time-series Leverages meta-information Works on sparse data Handles non-linearities/interactions Works with high dimensional data Models autocorrelation structure implicitly Minimal feature pre-processing and engineering
  • 6. Amplifying OrganisationalIntelligence Traditional Models Autoregressive models are remarkably flexible at handling a wide range of different time series patterns, but … How about ability to learn and generalized from similar series (to learn more complex models without overfitting) Benefits Challenges • Interpretable • Implicitly models auto- correlation structure • Works well when there is little exogenous information • Doesn’t share information across time-series • Forecasting a large number of individual or grouped time- series • Struggles with sparsity and special events Benefits Challenges • Shares information across time series • Uses meta-information • Models non-linearities as well as interactions • Some works with missing values • Struggles if little meta- information • Requires larger volumes of data • Larger amounts of data preprocessing needed. • Tend to average predictions too much across time series Based on neural networks with a modified architecture. Implicitly models interactions, non-linearities as well as time-series features. LSTM’s (vs RNN) do a better job of modelling long term time dependencies. ML Models Random Forest. Prophet. LSTM. AWS ForecastAutoregressive ARIMA. ETS Classic vs Modern Models: Benefit & Challenges
  • 7. Amplifying OrganisationalIntelligence Benefits • 50% more accurate forecasts with machine learning • Reduce forecasting time from months to hours Use cases • Product Demand Planning • Retail product demand • Supply chain demand • Operational metrics • Business metrics • Financial planning • Resource planning Statistical Machine Learning Volume of data Works well with little information Needs data from several series or several features Can share meta- information No (ARIMAX exception) Yes Can handle sparse data No Yes Can handle non- linearities/interaction s No or only explicitly Yes Can leverage shared information between time-series No (VAR exception) Yes, but tends to average too much Can work with high dimensional data Limited Yes AWS Forecast Amazon Forecast is a fully managed service that uses machine learning to deliver highly accurate forecasts.
  • 8. Amplifying OrganisationalIntelligence Datasets and Dataset Groups Predictors Forecasts AWS Forecast HowThis Works? Setting Up: • Sign Up for AWS • Set Up the AWSCLI • Set Up Permissions for Amazon Forecast • Autoregressive Integrated Moving Average(ARIMA) • arn:aws:forecast:::algorithm/ARIMA • DeepAR • arn:aws:forecast:::algorithm/Deep_AR • Prophet • arn:aws:forecast:::algorithm/Prophet Predictor:
  • 9. Amplifying OrganisationalIntelligence AWS Predictor: DeepAr DeepAR is a forecasting model based on autoregressive RNNs, which learns a global model from historical data of all time series in all datasets DeepAr is Multi-step multivariate time series: • Given observed values of a series i for t time-steps, estimating probability distribution of the next T steps Pros Cons • Shares information across groups of time series • Models non-linearities as well as interactions • Minimal manual feature engineering • Ability to incorporate a wide range of likelihood models, including probabilistic forecasts in the form of Monte Carlo samples • Struggles if little meta- information • Requires larger volumes of data • Tend to average predictions too much across time series
  • 10. Amplifying OrganisationalIntelligence Best Practices for using the DeepAR Algorithm • Input/Output interface: • Supports two data channels (Train and Test for evaluation) • Format: JSON, gzip, and Parquet • Best practice: • Except for when splitting your dataset for train and test, always provide the entire time series. Why: the lagged value features • Test points should start immediately after the last time point of training • Avoid using very large values (>400) for the prediction length because it makes the model slow and less accurate. Solution: consider aggregating your data at a higher frequency. • ARIMA or ETS, might provide more accurate results on on a single time series. The DeepAR algorithm starts to outperform the standard methods when your dataset contains hundreds of related time series. • Train: on both GPU and CPU instances. Inference: only CPU • Use small number for context_length, prediction_length, num_cells, num_layers, or mini_batch_size, in case of small instances
  • 12. Amplifying OrganisationalIntelligence References • https://docs.aws.amazon.com/forecast/latest/dg/forecast.dg.pdf • https://aws.amazon.com/blogs/aws/amazon-forecast-time-series-forecasting-made-easy/ • https://docs.aws.amazon.com/sagemaker/latest/dg/deepar.html

Editor's Notes

  • #4: Lets first start with why time-series series methods are important. The first reasons is that time series problems are everywhere; they appear in financial data, customer behavior data, property data and engineering problems. In fact, in our experience, we have that around 70% of our consulting projects have some time-series component or consideration that needs to be incorporated into the solution. The second reason is that most methods, especially the standard ones inside statistics and machine learning are built for cross-sectional problems. If you haven’t heard of this terminology before, cross sectional problems are where we take many observations at a point in time from many individuals.
  • #5: Time–series data is a chronological sequence of observations on a particular variable.
  • #6: Time–series data is a chronological sequence of observations on a particular variable.
  • #7: Exponential smoothing (ETS methods) Classical methods typically work through: Decomposition of time-series into each of its components Find average historical affects for each component Aggregate average historical affects and forecast one step ahead Modern time series methods follow the same patterns as traditional machine learning approaches with 3 major modifications: Time-series features are manually created by the user (time-series feature engineering) if the algorithm cannot implicitly model them. Specific Machine learning methods are applied that give us the point estimate as well as the distribution. Traditional time series validation (not random sampling) is used with specific metrics.
  • #8: Developers with no machine learning expertise can use the Amazon Forecast APIs, AWS Command Line Interface (AWS CLI), or Amazon Forecast console to import training data into one or more Amazon Forecast datasets, train predictors, and generate forecasts.
  • #9: When creating forecasting projects in Amazon Forecast, you work with the following resources: Before using Amazon Forecast to evaluate or forecast time-series data, create an AWS account, configure access permissions, and set up the AWS Command Line Interface (AWS CLI).
  • #10: Autoregression is a time series model that uses observations from previous time steps as input to a regression equation to predict the value at the next time step.  A recurrent neural network (RNN) is a class of artificial neural networks where connections between nodes form a directed graph along a temporal sequence
  • #12: https://github.com/awslabs/amazon-sagemaker-examples/blob/master/introduction_to_amazon_algorithms/deepar_electricity/DeepAR-Electricity.ipynb