SlideShare a Scribd company logo
BUSINESS ANALYTICS FOUNDATION WITH R
TOOLS
Lesson 4 - Predictive Modeling Techniques
Copyright 2016,Beamsync, All rights reserved.
• Hierarchical : Also known as nesting clusters as it also clusters to exist within bigger clusters to form
a tree.
• Partitioned clustering : Its simply a division of the set of data objects into non-overlapping clusters
such that each object is in exactly one subset.
• Exclusive clustering: They assign each object to a single cluster.
• Overlapping clustering : Its used to reflect the fact that an object can simultaneously belong to
more than one group.
• Fuzzy clustering : Every object belongs to every cluster with a membership weight that goes
between 0 – if it absolutely doesn’t belong to that cluster and 1 – if it absolutely belongs to the
cluster.
• Complete clustering : It performs a hierarchical cluster analysis using a set of dissimilarities on n
objects that are being clustered. They tend to find compact clusters of an approximately equal
diameters.
TYPES OF CLUSTERING
Copyright 2016,Beamsync, All rights reserved.
• Well – separated : The distance between any two points in different groups is greater than the
distance between any two points within a group. They need not be globular.
• Prototype – based : The prototype of a cluster is often a centroid for data with continuous
attributes. Such clusters tend to be globular.
• Graph – based : When data is represented as a graph where nodes are the objects and links
represent connection among the objects. They tend to be globular.
• Density – based : This method is employed when the clusters are irregular and when noise and
outliers are present.
• Shared – property : Also known as conceptual clustering its the process of identifying the pattern in
the clusters to successfully segregate into groups of clusters.
TYPES OF CLUSTERS
Copyright 2016,Beamsync, All rights reserved.
• K – means : It’s a prototype based clustering technique that attempts to define the number of
clusters (K). They are represented as centroids.
• Agglomerative Hierarchical Clustering : It refers to a collection of closely related clustering
techniques that produce a hierarchical clustering by starting with each point as singleton cluster
and repeatedly merging the closest clusters until a single, all encompassing cluster remains.
• DBSCAN : It’s a density based clustering algorithm that produces a partitioned clustering, in which
number of clusters is automatically determined by the algorithm.
METHODS TO FORM CLUSTERS.
Copyright 2016,Beamsync, All rights reserved.
• Time series data is an ordered sequence of observations on a quantitative variable measured over
an equally spaced time interval.
• Time series are used in statistics, signal processing, pattern recognition, econometrics, mathematic
finance, weather forecasting, earthquake prediction electroencephalography, control engineering,
astronomy , communications engineering and other places.
• The ultimate objective of time series is –we want to predict the future value of variable under
consideration.
• Time series analysis is
• a set of methods used for analyzing time series data
• and forecasting the future value of the variable under consideration.
• In time series analysis it is assumed that the data consist of set of identifiable components and
random errors which usually makes the pattern difficult to identify.
• E.g. Consider a store that sells blankets and quilts. Then the store will observe sale of the quilts to
be maximum during the winter season. This case comes under seasonal variation.
TIME SERIES
Copyright 2016,Beamsync, All rights reserved.
• Long term trend – The smooth long term direction of time series where the data can increase or
decrease in some pattern.
• Seasonal variation – Patterns of change in a time series within a year which tends to repeat every
year.
• Cyclical variation – Its much alike seasonal variation but the rise and fall of time series over periods
are longer than one year.
• Irregular variation – Any variation that is not explainable by any of the three above mentioned
components. But that doesn’t explain the variations to be random. They can be classified into –
stationary and non – stationary variation.
• When the data neither increases nor decreases, i.e. its completely random its called stationary
variation.
• When the data has some explainable portion remaining and can be analyzed further then such
case is called non – stationary variation.
COMPONENTS OF TIME SERIES ANALYSIS
Copyright 2016,Beamsync, All rights reserved.
CYCLICAL VERSUS SEASONAL ANALYSIS
Cyclical
•Regular and predictable changes which
recur over a period of time
• Eg. Boosted sales due to inflation
Seasonal
•Changes recurring over a period of one
year
•Eg. Increased sale of blankets and quilts in
winter
Copyright 2016,Beamsync, All rights reserved.
• Separating a time series into its constituent components, which are usually a trend
component and
an irregular component, and if it is a seasonal time series, a seasonal component.
• Time series value a function of three components –
• Trendcomponent,
• Seasonal component
• irregular or error component.
• An additive decomposition model takes the following
form: Yt = Trend + Seasonal + Irregular
• A multiplicative model takes the following form:
Yt = Trend X Seasonal XIrregular
• A time series that has seasonal effects removed is called deseasonalised time series.
DECOMPOSITION OF TIME SERIES
Copyright 2016,Beamsync, All rights reserved.
• Decompose() function for additive models
• The function “decompose()” returns a list object as its result.
• Here the estimates of the seasonal component, trend component and irregular component are
stored in “seasonal”, “trend”, and “random” respectively.
• Decompose()function
• Estimates the trend component by moving average
• Estimates seasonal component by averaging observations
• Calculates the error component by subtracting the earlier values from the time series.
• Works well on a complete time series data
DECOMPOSITION OF SEASONAL AND TREND DATA
Copyright 2016,Beamsync, All rights reserved.
• Use MovingAverage
• TTR package
• SMA
Smooth time series using a simple moving average
• EMA
Exponential Smoothing
• WMA
Weighted moving average
• Touse the SMA() function, we need to specify the order (span) of the simple moving average, using
the parameter “n”.
• For example, to calculate a simple moving average of order 5, we set n=5 in the SMA() function.
DECOMPOSING NON-SEASONAL TIME SERIES
Copyright 2016,Beamsync, All rights reserved.
Thank You
Beamsync is top training institute for business analytics course in Bangalore.
Beamsync is also providing certification along with training course. For more
details click here: http://beamsync.com/business-analytics-training-bangalore/
Copyright 2016,Beamsync, All rights reserved.

More Related Content

PPTX
Business Analytics Foundation with R tools - Part 2
PPTX
Business Analytics Foundation with R Tools Part 1
PPTX
Data mining Part 1
PPTX
Data Mining: Mining ,associations, and correlations
PPTX
Firebird: cost-based optimization and statistics, by Dmitry Yemanov (in English)
ODP
Understandung Firebird optimizer, by Dmitry Yemanov (in English)
PDF
HRUG - Linear regression with R
Business Analytics Foundation with R tools - Part 2
Business Analytics Foundation with R Tools Part 1
Data mining Part 1
Data Mining: Mining ,associations, and correlations
Firebird: cost-based optimization and statistics, by Dmitry Yemanov (in English)
Understandung Firebird optimizer, by Dmitry Yemanov (in English)
HRUG - Linear regression with R

What's hot (20)

PDF
Missing data handling
PPT
Sales Force Alignment
PPTX
MT6702 Unit 4 Analysis of Data
PPT
Fernandos Statistics
PDF
Delivery Feet Data using K Mean Clustering with Applied SPSS
PPT
Emilie Rousselin Stastistics
PPTX
Matlab Data And Statistics
PPTX
Matlab:Regression
PDF
A Comparative Study for Anomaly Detection in Data Mining
PDF
Portfolio Theory of Information Retrieval
PDF
Data Trend Analysis by Assigning Polynomial Function For Given Data Set
PPTX
Pca(principal components analysis)
PDF
The RuLIS approach to outliers (Marcello D'Orazio,FAO)
 
PDF
PCA (Principal component analysis)
PPT
5 6 Scatter Plots & Best Fit Lines
PPS
Scatter Plot
PPTX
Scatter diagram and control chart
PPTX
Data Configuration for Analysis with Matt Hansen at StatStuff
PPT
Stem & leaf, Bar graphs, and Histograms
PPTX
Introduction concepts of Statistics
Missing data handling
Sales Force Alignment
MT6702 Unit 4 Analysis of Data
Fernandos Statistics
Delivery Feet Data using K Mean Clustering with Applied SPSS
Emilie Rousselin Stastistics
Matlab Data And Statistics
Matlab:Regression
A Comparative Study for Anomaly Detection in Data Mining
Portfolio Theory of Information Retrieval
Data Trend Analysis by Assigning Polynomial Function For Given Data Set
Pca(principal components analysis)
The RuLIS approach to outliers (Marcello D'Orazio,FAO)
 
PCA (Principal component analysis)
5 6 Scatter Plots & Best Fit Lines
Scatter Plot
Scatter diagram and control chart
Data Configuration for Analysis with Matt Hansen at StatStuff
Stem & leaf, Bar graphs, and Histograms
Introduction concepts of Statistics
Ad

Similar to Business Analytics Foundation with R Tools - Part 3 (20)

PDF
Lecture9_Time_Series_2024_and_data_analysis (1).pdf
PPTX
Intro and Lesson 1 introduction to time series .pptx
PPTX
Time series analysis & forecasting-Day1.pptx
PPTX
Module 3 - Time Series.pptx
PPTX
Lesson 1 introduction_to_time_series
PDF
Similarity distance measures
PDF
Time series for yotube_1_data anlysis.pdf
DOCX
Time Series FORECASTING
PPT
Mba 532 2011_part_3_time_series_analysis
PPTX
Business Statistics_Time Series1232.pptx
PPTX
Time series Forecasting Time Series.pptx
PPTX
Presentation On Time Series Analysis in Mechine Learning
PDF
Module 5.pptx (Data science in engineering)
PDF
Data Science - Part X - Time Series Forecasting
DOCX
Time series analysis aids in decomposing a series into parts which.docx
PDF
lecture3.pdf
PDF
Time Series and Forecasting.pdf University Mathematics I: Olaniyi Evans
DOCX
TIME SERIES ANALYSIS.docx
PPTX
Time Series
PPTX
Time series
Lecture9_Time_Series_2024_and_data_analysis (1).pdf
Intro and Lesson 1 introduction to time series .pptx
Time series analysis & forecasting-Day1.pptx
Module 3 - Time Series.pptx
Lesson 1 introduction_to_time_series
Similarity distance measures
Time series for yotube_1_data anlysis.pdf
Time Series FORECASTING
Mba 532 2011_part_3_time_series_analysis
Business Statistics_Time Series1232.pptx
Time series Forecasting Time Series.pptx
Presentation On Time Series Analysis in Mechine Learning
Module 5.pptx (Data science in engineering)
Data Science - Part X - Time Series Forecasting
Time series analysis aids in decomposing a series into parts which.docx
lecture3.pdf
Time Series and Forecasting.pdf University Mathematics I: Olaniyi Evans
TIME SERIES ANALYSIS.docx
Time Series
Time series
Ad

More from Beamsync (6)

PPTX
Business Analytics Foundation with R tool - Part 5
PPTX
Basic Analytic Techniques - Using R Tool - Part 1
PPTX
Introduction to Business Analytics Course Part 10
PPTX
Introduction to Business Analytics Course Part 9
PPTX
Introduction to Business Analytics Course Part 7
PPTX
Introduction to Business Analytics Part 1
Business Analytics Foundation with R tool - Part 5
Basic Analytic Techniques - Using R Tool - Part 1
Introduction to Business Analytics Course Part 10
Introduction to Business Analytics Course Part 9
Introduction to Business Analytics Course Part 7
Introduction to Business Analytics Part 1

Recently uploaded (20)

PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PDF
O7-L3 Supply Chain Operations - ICLT Program
PPTX
PPH.pptx obstetrics and gynecology in nursing
PDF
Classroom Observation Tools for Teachers
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PDF
Basic Mud Logging Guide for educational purpose
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PPTX
Cell Structure & Organelles in detailed.
PPTX
GDM (1) (1).pptx small presentation for students
PDF
Insiders guide to clinical Medicine.pdf
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PDF
Computing-Curriculum for Schools in Ghana
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
VCE English Exam - Section C Student Revision Booklet
Module 4: Burden of Disease Tutorial Slides S2 2025
STATICS OF THE RIGID BODIES Hibbelers.pdf
human mycosis Human fungal infections are called human mycosis..pptx
Pharmacology of Heart Failure /Pharmacotherapy of CHF
O7-L3 Supply Chain Operations - ICLT Program
PPH.pptx obstetrics and gynecology in nursing
Classroom Observation Tools for Teachers
FourierSeries-QuestionsWithAnswers(Part-A).pdf
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
Basic Mud Logging Guide for educational purpose
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
Cell Structure & Organelles in detailed.
GDM (1) (1).pptx small presentation for students
Insiders guide to clinical Medicine.pdf
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
Computing-Curriculum for Schools in Ghana
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
VCE English Exam - Section C Student Revision Booklet

Business Analytics Foundation with R Tools - Part 3

  • 1. BUSINESS ANALYTICS FOUNDATION WITH R TOOLS Lesson 4 - Predictive Modeling Techniques Copyright 2016,Beamsync, All rights reserved.
  • 2. • Hierarchical : Also known as nesting clusters as it also clusters to exist within bigger clusters to form a tree. • Partitioned clustering : Its simply a division of the set of data objects into non-overlapping clusters such that each object is in exactly one subset. • Exclusive clustering: They assign each object to a single cluster. • Overlapping clustering : Its used to reflect the fact that an object can simultaneously belong to more than one group. • Fuzzy clustering : Every object belongs to every cluster with a membership weight that goes between 0 – if it absolutely doesn’t belong to that cluster and 1 – if it absolutely belongs to the cluster. • Complete clustering : It performs a hierarchical cluster analysis using a set of dissimilarities on n objects that are being clustered. They tend to find compact clusters of an approximately equal diameters. TYPES OF CLUSTERING Copyright 2016,Beamsync, All rights reserved.
  • 3. • Well – separated : The distance between any two points in different groups is greater than the distance between any two points within a group. They need not be globular. • Prototype – based : The prototype of a cluster is often a centroid for data with continuous attributes. Such clusters tend to be globular. • Graph – based : When data is represented as a graph where nodes are the objects and links represent connection among the objects. They tend to be globular. • Density – based : This method is employed when the clusters are irregular and when noise and outliers are present. • Shared – property : Also known as conceptual clustering its the process of identifying the pattern in the clusters to successfully segregate into groups of clusters. TYPES OF CLUSTERS Copyright 2016,Beamsync, All rights reserved.
  • 4. • K – means : It’s a prototype based clustering technique that attempts to define the number of clusters (K). They are represented as centroids. • Agglomerative Hierarchical Clustering : It refers to a collection of closely related clustering techniques that produce a hierarchical clustering by starting with each point as singleton cluster and repeatedly merging the closest clusters until a single, all encompassing cluster remains. • DBSCAN : It’s a density based clustering algorithm that produces a partitioned clustering, in which number of clusters is automatically determined by the algorithm. METHODS TO FORM CLUSTERS. Copyright 2016,Beamsync, All rights reserved.
  • 5. • Time series data is an ordered sequence of observations on a quantitative variable measured over an equally spaced time interval. • Time series are used in statistics, signal processing, pattern recognition, econometrics, mathematic finance, weather forecasting, earthquake prediction electroencephalography, control engineering, astronomy , communications engineering and other places. • The ultimate objective of time series is –we want to predict the future value of variable under consideration. • Time series analysis is • a set of methods used for analyzing time series data • and forecasting the future value of the variable under consideration. • In time series analysis it is assumed that the data consist of set of identifiable components and random errors which usually makes the pattern difficult to identify. • E.g. Consider a store that sells blankets and quilts. Then the store will observe sale of the quilts to be maximum during the winter season. This case comes under seasonal variation. TIME SERIES Copyright 2016,Beamsync, All rights reserved.
  • 6. • Long term trend – The smooth long term direction of time series where the data can increase or decrease in some pattern. • Seasonal variation – Patterns of change in a time series within a year which tends to repeat every year. • Cyclical variation – Its much alike seasonal variation but the rise and fall of time series over periods are longer than one year. • Irregular variation – Any variation that is not explainable by any of the three above mentioned components. But that doesn’t explain the variations to be random. They can be classified into – stationary and non – stationary variation. • When the data neither increases nor decreases, i.e. its completely random its called stationary variation. • When the data has some explainable portion remaining and can be analyzed further then such case is called non – stationary variation. COMPONENTS OF TIME SERIES ANALYSIS Copyright 2016,Beamsync, All rights reserved.
  • 7. CYCLICAL VERSUS SEASONAL ANALYSIS Cyclical •Regular and predictable changes which recur over a period of time • Eg. Boosted sales due to inflation Seasonal •Changes recurring over a period of one year •Eg. Increased sale of blankets and quilts in winter Copyright 2016,Beamsync, All rights reserved.
  • 8. • Separating a time series into its constituent components, which are usually a trend component and an irregular component, and if it is a seasonal time series, a seasonal component. • Time series value a function of three components – • Trendcomponent, • Seasonal component • irregular or error component. • An additive decomposition model takes the following form: Yt = Trend + Seasonal + Irregular • A multiplicative model takes the following form: Yt = Trend X Seasonal XIrregular • A time series that has seasonal effects removed is called deseasonalised time series. DECOMPOSITION OF TIME SERIES Copyright 2016,Beamsync, All rights reserved.
  • 9. • Decompose() function for additive models • The function “decompose()” returns a list object as its result. • Here the estimates of the seasonal component, trend component and irregular component are stored in “seasonal”, “trend”, and “random” respectively. • Decompose()function • Estimates the trend component by moving average • Estimates seasonal component by averaging observations • Calculates the error component by subtracting the earlier values from the time series. • Works well on a complete time series data DECOMPOSITION OF SEASONAL AND TREND DATA Copyright 2016,Beamsync, All rights reserved.
  • 10. • Use MovingAverage • TTR package • SMA Smooth time series using a simple moving average • EMA Exponential Smoothing • WMA Weighted moving average • Touse the SMA() function, we need to specify the order (span) of the simple moving average, using the parameter “n”. • For example, to calculate a simple moving average of order 5, we set n=5 in the SMA() function. DECOMPOSING NON-SEASONAL TIME SERIES Copyright 2016,Beamsync, All rights reserved.
  • 11. Thank You Beamsync is top training institute for business analytics course in Bangalore. Beamsync is also providing certification along with training course. For more details click here: http://beamsync.com/business-analytics-training-bangalore/ Copyright 2016,Beamsync, All rights reserved.