SlideShare a Scribd company logo
1 I NAME OF PRESENTER
Periodic Pattern Mining in
Time Series Databases
Ashis Kumar Chanda
Swapnil Saha
Department of Computer Science and Engineering
University of Dhaka
2 I NAME OF PRESENTERCSE, DU2
Introduction
Key Terms
Suffix Tree Generation
Conclusion
>
>
>
Time Series Database>
Periodic Pattern Detection
>
Topics to be covered
>
3 I NAME OF PRESENTERCSE, DU3
Introduction
What is a time-series database?
A time-series database consists of
sequences of values or events obtained
over repeated measurements of time
A fixed time intervals (e.g., hourly, daily,
weekly).
A time series is a set of observation taken at
specified times
A time series involving a variable Y
If a time series is defined by y1, y2, y3 ...
Values at times t1, t2, t3 ... Then we can
write a function of time Y=F(t)
4
 Long term movements
 Cyclic movements
 Seasonal movements
 Irregular or random movements
We can define each movements as L, C, S, I
variables respectively
And Time series variables Y = L+C+S+I
or Y = L*C*S*I
5
 Symbol periodicity
axy apq amn
 Sequence periodicity
abxy abpq abmn
 Segment periodicity
abxy abxy abxy
6
 Perfect Periodicity
abxy abpq abmn
abxy acpq abmn
Here conf( 4,0, ab)= 2/3 = 0.67
7
 Periodicity in Subsection of a Time Series
T= gbxy asdf abpq abmn
Stpos = 8
endPos= 15
So, Subsection part gbxy asdf abpq abmn
8
 Periodicity with Time Tolerance
We can’t get always noise free time series data
So we check some more bit then our target
sequence
This extra bit is known as time tolerance (tt)
If X is a pattern of p length in T then we check
At stPos, stPos+p±tt, stPos+2p±tt . . . ..
9
 A period in a time series may be represented
by 5 tuple
( S, p, stPos, endPos, Conf)
S = sequence of periodic pattern
p = check pattern after p num of char
Conf= confidence
stPos, endPos is the starting and ending
position of segment where match pattern
10
 Suppose, T= abxy acpq abdd abmn
then ( ab, 4, 0, 11, 1) means
Find ab pattern in T from 0 position to 11
postion affter 4 char
a b x y a c p q a b d d abmn
0 1 2 3 4 5 6 7 8 9 10 11
11
Occurrence Vector:
a b c a b b a b b a $
0 1 2 3 4 5 6 7 8 9
Occurrence vector of a : (0 3 6 9)
Occurrence vector of ab : (0 3 6)
12
Difference Vector:
a b c a b b a b b a $
0 1 2 3 4 5 6 7 8 9
Occurrence vector of a : 0 3
Difference vector : 3
Occurrence vector of bb : 4 7
Difference vetor : 3
13
How to get a string format from
a Transactional database?
14
Discretization Technique
15
We need to define a range or group from DB
and characterized each range by a unique
ASCII character
Suppose,
In our previous example,
log in defined by a
log out ,, x
before log in ,, b
before log out ,, c
after log out ,, d
16
17
18
accx acxd axdd bacx
 ‘abcabbaabb$’ has following ten suffixes. We
can ignore the 10th suffix when generating
suffix tree
1. abcabbabb$
2. bcabbabb$
3. cabbabb$
4. abbabb$
5. bbabb$
6. babb$
7. abb$
8. bb$
9. b$
10. $
19
 Strings:
1. abcabbabb$
20
a
b
a
c
b
b
a
b
b
$
 Strings:
1. abcabbabb$
2. bcabbabb$
21
a
b
a
c
b
b
a
b
b
$
b
c
b
a
b
$
a
b
b
 Strings:
1. abcabbabb$
2. bcabbabb$
3. cabbabb$
22
a
b
a
c
b
b
a
b
b
$
b
c
b
a
b
$
a
b
b
c
b
a
b
$
a
b
b
 Strings:
1. abcabbabb$
2. bcabbabb$
3. cabbabb$
4. abbabb$
23
a
b
b
c
b
a
b
$
a
b
b
c
b
a
b
$
a
b
b
a
c
b
b
a
b
b
$
 Strings:
1. abcabbabb$
2. bcabbabb$
3. cabbabb$
4. abbabb$
24
a
b
b
c
b
a
b
$
a
b
b
c
b
a
b
$
a
b
b
a
c
b
b
a
b
b
$
b
a
b
b
$
 Strings:
1. abcabbabb$
2. bcabbabb$
3. cabbabb$
4. abbabb$
5. bbabb$
25
a
b b
c
b
a
b
$
a
b
b
a
c
b
b
a
b
b
$
b
a
b
b
$
c
b
a
b
$
a
b
b
b
a
b
b
$
 Strings:
1. abcabbabb$
2. bcabbabb$
3. cabbabb$
4. abbabb$
5. bbabb$
6. babb$
26
a
b b
c
b
a
b
$
a
b
b
a
c
b
b
a
b
b
$
b
a
b
b
$
c
b
a
b
$
a
b
b
b
a
b
b
$
a b
b
$
 Strings:
1. abcabbabb$
2. bcabbabb$
3. cabbabb$
4. abbabb$
5. bbabb$
6. babb$
7. abb$
27
a
b b
c
b
a
b
$
a
b
b
a
c
b
b
a
b
b
$
b
a
b
b
$
c
b
a
b
$
a
b
b
b
a
b
b
$
a b
b
$
$
 Strings:
1. abcabbabb$
2. bcabbabb$
3. cabbabb$
4. abbabb$
5. bbabb$
6. babb$
7. abb$
8. bb$
28
a
b b
c
b
a
b
$
a
b
b
a
c
b
b
a
b
b
$
b
a
b
b
$
c
b
a
b
$
a
b
b
b
a
a b
b
$
$
b
b
$
$
 Strings:
1. abcabbabb$
2. bcabbabb$
3. cabbabb$
4. abbabb$
5. bbabb$
6. babb$
7. abb$
8. bb$
9. b$
29
a
b b
c
b
a
b
$
a
b
b
a
c
b
b
a
b
b
$
b
a
b
b
$
c
b
a
b
$
a
b
b
b
a
a b
b
$
$
b
b
$
$
$
abcabbabb$
Edge leaf node holds
a number that represents
starting position
of the suffix
Each intermediate node holds a number which
is the length of the substring read from root
to the intermediate node
30
0
a
b
1
b
c
b
a
b
$
a
b
b
2a
c
b
b
a
b
b
$
2
6
b
a
b
b
$
c
b
a
b
$
a
b
b
1
4
b
a
5
a b
b
$
3
$
3
b
b
$
2
7
$
$
8
abcabbabb$
Find Occrrence Vector
31
0
a
b
1
b
c
b
a
b
$
a
b
b
2a
c
b
b
a
b
b
$
2
6
b
a
b
b
$
c
b
a
b
$
a
b
b
1
4
b
a
5
a b
b
$
3
$
3
b
b
$
2
7
$
$
8(3,6)
abcabbabb$
Find Occrrence Vector
32
0
a
b
1
b
c
b
a
b
$
a
b
b
2a
c
b
b
a
b
b
$
2
6
b
a
b
b
$
c
b
a
b
$
a
b
b
1
4
b
a
5
a b
b
$
3
$
3
b
b
$
2
7
$
$
8(3,6)
(0,3,6)
abcabbabb$
Find Occrrence Vector
33
0
a
b
1
b
c
b
a
b
$
a
b
b
2a
c
b
b
a
b
b
$
2
6
b
a
b
b
$
c
b
a
b
$
a
b
b
1
4
b
a
5
a b
b
$
3
$
3
b
b
$
2
7
$
$
8(3,6)
(0,3,6)
(4,7)
(1,5,8,4,7)
Input: a time series of Size n
Output: Positions of periodic patterns
Process:
for each occurrence vector of size k
find p
for 0 to k
check each position after p char
count confidence
add to list if greater than threshold
34
abcabbabb$
ab - (0,3,6)
abb - (3,6)
bb - (4,7)
b - (1,5,8,4,7)
35
stpos= 0
endPos= 6
P= 3-0 = 3
Now check occurrence vector of ab
if difference equal p
count increment
Check confidence
Add to pattern list if confidence >= Θ
abcdabcabcab$
ab - (0,4,7,10)
36
stpos= 0
endPos= 10
P= 4-0 = 4
Now check occurrence vector of ab
if difference equal p
count increment
Only one pattern get 0 to 10 with p=4
abcdabcabcab$
abcdabcabcab$
ab - (0,4,7,10)
37
stpos= 4
endPos= 10
P= 7-4 = 3
Now check occurrence vector of ab
if difference equal p
count increment
3 pattern get 4 to 10 with p=3
abcdabcabcab$
38
- Elfeky proposed two separate algorithms to
detect symbol & segment periodicity. (CONV)
& (WARP)
But it not used in sub-sequence & complexity
O(nlogn) & O(n^2)
- Han’s parper algorithm used in sub-sequence
But it need user input
39
- In this perspective, The algorithm discussed
here is better than previous
- Complexity O(nlogn)
- Works online
40
41 I NAME OF PRESENTERCSE, DU41
References
- Periodic pattern mining using suffix tree
by Rasheed, Al-Shalalfa, & Alhajj, 2011
- Effective periodic pattern mining in time series database
by Nishi, Farhan, Samiullah, Jeong
- Data Mining Concepts & Techniques
by J. Han & M. Kamber
- Database system Concept
by Abraham Sillberschatz, Korth, Sudarshan
42 I NAME OF PRESENTERCSE, DU42
Questions
43 I NAME OF PRESENTERCSE, DU43
Thank You

More Related Content

PDF
Lambda Calculus
PPTX
Introduction to the lambda calculus
PPT
Headerfiles
PPTX
Calculus- Basics
PDF
GeoGebra JavaScript CheatSheet
PDF
6. binary tree
PDF
2Bytesprog2 course_2014_c5_pointers
PPT
Datacompression1
Lambda Calculus
Introduction to the lambda calculus
Headerfiles
Calculus- Basics
GeoGebra JavaScript CheatSheet
6. binary tree
2Bytesprog2 course_2014_c5_pointers
Datacompression1

What's hot (14)

PDF
Constraint Programming in Haskell
PDF
Reg ex cheatsheet
PPTX
Array, string and pointer
PDF
CAPS_Discipline_Training
PPTX
Derivatives in graphing-dfs
PDF
Faster Python, FOSDEM
PPTX
Variables 2
PDF
Compiling fµn language
PPT
When youseeab
PPTX
Programming for engineers in python
PDF
Runtime Monitoring of Stream Logic Formulae (Talk @ FPS 2015)
DOCX
Type header file in c++ and its function
PDF
Activity Recognition Through Complex Event Processing: First Findings
PPTX
Radical functions
Constraint Programming in Haskell
Reg ex cheatsheet
Array, string and pointer
CAPS_Discipline_Training
Derivatives in graphing-dfs
Faster Python, FOSDEM
Variables 2
Compiling fµn language
When youseeab
Programming for engineers in python
Runtime Monitoring of Stream Logic Formulae (Talk @ FPS 2015)
Type header file in c++ and its function
Activity Recognition Through Complex Event Processing: First Findings
Radical functions
Ad

Viewers also liked (20)

PPTX
FPPM algorithm
PDF
Anomaly Detection @Twitter
PDF
Analysis of Various Periodicity Detection Algorithms in Time Series Data with...
PPTX
A Framework for Periodic Outlier Pattern Detection in Time-Series Sequences
PPT
Page rank
PPT
Data.Mining.C.8(Ii).Web Mining 570802461
PPTX
Mis(data mning)
PPT
Rule discovery from time series
PPTX
Page rank and hyperlink
PPTX
Survival of the Fittest: Using Genetic Algorithm for Data Mining Optimization
PPTX
Sequential pattern mining
PPT
Introduction to text mining
PPT
5.2 mining time series data
PPTX
Web mining (structure mining)
PPT
5.3 mining sequential patterns
PPTX
A review of machine learning based anomaly detection
PPTX
Text mining
PDF
Genetic Algorithms Made Easy
PPT
Regression
PPT
Big Data & Text Mining
FPPM algorithm
Anomaly Detection @Twitter
Analysis of Various Periodicity Detection Algorithms in Time Series Data with...
A Framework for Periodic Outlier Pattern Detection in Time-Series Sequences
Page rank
Data.Mining.C.8(Ii).Web Mining 570802461
Mis(data mning)
Rule discovery from time series
Page rank and hyperlink
Survival of the Fittest: Using Genetic Algorithm for Data Mining Optimization
Sequential pattern mining
Introduction to text mining
5.2 mining time series data
Web mining (structure mining)
5.3 mining sequential patterns
A review of machine learning based anomaly detection
Text mining
Genetic Algorithms Made Easy
Regression
Big Data & Text Mining
Ad

Similar to Periodic pattern mining (20)

PPTX
An efficient approach to mine flexible periodic patterns in time series datab...
PDF
Pattern matching programs
PPT
W9Presentation.ppt
PPTX
String matching algorithms(knuth morris-pratt)
PPTX
String-Matching algorithms KNuth-Morri-Pratt.pptx
PDF
C034011016
PPT
String matching algorithm
PDF
An Index Based K-Partitions Multiple Pattern Matching Algorithm
PPT
KMP Pattern Matching algorithm
PPTX
Knuth morris pratt string matching algo
PDF
Agrhwoowheh3hwjoeorhehehwjeoeoeooekekekekkekee
PDF
lecture13.pdfhejejejejekkeejejejejejejejej
PPTX
Searching.pptx
PDF
MMP-TREE FOR SEQUENTIAL PATTERN MINING WITH MULTIPLE MINIMUM SUPPORTS IN PROG...
PPTX
Daa unit 5
PPTX
Data structures notes for college students btech.pptx
PDF
Count-min sketch to Infinity.pdf
PPTX
4 sequential pattern mining
PPTX
Time series data mining techniques
An efficient approach to mine flexible periodic patterns in time series datab...
Pattern matching programs
W9Presentation.ppt
String matching algorithms(knuth morris-pratt)
String-Matching algorithms KNuth-Morri-Pratt.pptx
C034011016
String matching algorithm
An Index Based K-Partitions Multiple Pattern Matching Algorithm
KMP Pattern Matching algorithm
Knuth morris pratt string matching algo
Agrhwoowheh3hwjoeorhehehwjeoeoeooekekekekkekee
lecture13.pdfhejejejejekkeejejejejejejejej
Searching.pptx
MMP-TREE FOR SEQUENTIAL PATTERN MINING WITH MULTIPLE MINIMUM SUPPORTS IN PROG...
Daa unit 5
Data structures notes for college students btech.pptx
Count-min sketch to Infinity.pdf
4 sequential pattern mining
Time series data mining techniques

More from Ashis Kumar Chanda (20)

PPT
Word 2 vector
PPTX
Multi-class Image Classification using deep convolutional networks on extreme...
PPT
Full resolution image compression with recurrent neural networks
PPT
Understanding Natural Language Queries over Relational Databases
PPTX
03. Agile Development
PPT
Software Cost Estimation
PPT
Risk Management
PPT
Project Management
PPTX
Requirements engineering
PPT
2. Software process
PPT
1. Introduction
PDF
Secure software design
PPT
Sequential logic circuit optimization
PPT
Introduction to CS
PPT
Iterative deepening search
PPTX
PPTX
Linear Machine Decision Tree
PPT
Logical Operations on BDD
Word 2 vector
Multi-class Image Classification using deep convolutional networks on extreme...
Full resolution image compression with recurrent neural networks
Understanding Natural Language Queries over Relational Databases
03. Agile Development
Software Cost Estimation
Risk Management
Project Management
Requirements engineering
2. Software process
1. Introduction
Secure software design
Sequential logic circuit optimization
Introduction to CS
Iterative deepening search
Linear Machine Decision Tree
Logical Operations on BDD

Recently uploaded (20)

PDF
null (2) bgfbg bfgb bfgb fbfg bfbgf b.pdf
PDF
BIO-INSPIRED HORMONAL MODULATION AND ADAPTIVE ORCHESTRATION IN S-AI-GPT
PDF
Artificial Superintelligence (ASI) Alliance Vision Paper.pdf
PDF
distributed database system" (DDBS) is often used to refer to both the distri...
PDF
Soil Improvement Techniques Note - Rabbi
PDF
COURSE DESCRIPTOR OF SURVEYING R24 SYLLABUS
PPT
Occupational Health and Safety Management System
PPTX
Artificial Intelligence
PDF
Categorization of Factors Affecting Classification Algorithms Selection
PDF
A SYSTEMATIC REVIEW OF APPLICATIONS IN FRAUD DETECTION
PPTX
communication and presentation skills 01
PPTX
Information Storage and Retrieval Techniques Unit III
PPTX
introduction to high performance computing
PPTX
Feature types and data preprocessing steps
PDF
BIO-INSPIRED ARCHITECTURE FOR PARSIMONIOUS CONVERSATIONAL INTELLIGENCE : THE ...
PPTX
Fundamentals of safety and accident prevention -final (1).pptx
PPTX
Current and future trends in Computer Vision.pptx
PPTX
Module 8- Technological and Communication Skills.pptx
PDF
Visual Aids for Exploratory Data Analysis.pdf
PDF
Influence of Green Infrastructure on Residents’ Endorsement of the New Ecolog...
null (2) bgfbg bfgb bfgb fbfg bfbgf b.pdf
BIO-INSPIRED HORMONAL MODULATION AND ADAPTIVE ORCHESTRATION IN S-AI-GPT
Artificial Superintelligence (ASI) Alliance Vision Paper.pdf
distributed database system" (DDBS) is often used to refer to both the distri...
Soil Improvement Techniques Note - Rabbi
COURSE DESCRIPTOR OF SURVEYING R24 SYLLABUS
Occupational Health and Safety Management System
Artificial Intelligence
Categorization of Factors Affecting Classification Algorithms Selection
A SYSTEMATIC REVIEW OF APPLICATIONS IN FRAUD DETECTION
communication and presentation skills 01
Information Storage and Retrieval Techniques Unit III
introduction to high performance computing
Feature types and data preprocessing steps
BIO-INSPIRED ARCHITECTURE FOR PARSIMONIOUS CONVERSATIONAL INTELLIGENCE : THE ...
Fundamentals of safety and accident prevention -final (1).pptx
Current and future trends in Computer Vision.pptx
Module 8- Technological and Communication Skills.pptx
Visual Aids for Exploratory Data Analysis.pdf
Influence of Green Infrastructure on Residents’ Endorsement of the New Ecolog...

Periodic pattern mining

  • 1. 1 I NAME OF PRESENTER Periodic Pattern Mining in Time Series Databases Ashis Kumar Chanda Swapnil Saha Department of Computer Science and Engineering University of Dhaka
  • 2. 2 I NAME OF PRESENTERCSE, DU2 Introduction Key Terms Suffix Tree Generation Conclusion > > > Time Series Database> Periodic Pattern Detection > Topics to be covered >
  • 3. 3 I NAME OF PRESENTERCSE, DU3 Introduction What is a time-series database? A time-series database consists of sequences of values or events obtained over repeated measurements of time A fixed time intervals (e.g., hourly, daily, weekly).
  • 4. A time series is a set of observation taken at specified times A time series involving a variable Y If a time series is defined by y1, y2, y3 ... Values at times t1, t2, t3 ... Then we can write a function of time Y=F(t) 4
  • 5.  Long term movements  Cyclic movements  Seasonal movements  Irregular or random movements We can define each movements as L, C, S, I variables respectively And Time series variables Y = L+C+S+I or Y = L*C*S*I 5
  • 6.  Symbol periodicity axy apq amn  Sequence periodicity abxy abpq abmn  Segment periodicity abxy abxy abxy 6
  • 7.  Perfect Periodicity abxy abpq abmn abxy acpq abmn Here conf( 4,0, ab)= 2/3 = 0.67 7
  • 8.  Periodicity in Subsection of a Time Series T= gbxy asdf abpq abmn Stpos = 8 endPos= 15 So, Subsection part gbxy asdf abpq abmn 8
  • 9.  Periodicity with Time Tolerance We can’t get always noise free time series data So we check some more bit then our target sequence This extra bit is known as time tolerance (tt) If X is a pattern of p length in T then we check At stPos, stPos+p±tt, stPos+2p±tt . . . .. 9
  • 10.  A period in a time series may be represented by 5 tuple ( S, p, stPos, endPos, Conf) S = sequence of periodic pattern p = check pattern after p num of char Conf= confidence stPos, endPos is the starting and ending position of segment where match pattern 10
  • 11.  Suppose, T= abxy acpq abdd abmn then ( ab, 4, 0, 11, 1) means Find ab pattern in T from 0 position to 11 postion affter 4 char a b x y a c p q a b d d abmn 0 1 2 3 4 5 6 7 8 9 10 11 11
  • 12. Occurrence Vector: a b c a b b a b b a $ 0 1 2 3 4 5 6 7 8 9 Occurrence vector of a : (0 3 6 9) Occurrence vector of ab : (0 3 6) 12
  • 13. Difference Vector: a b c a b b a b b a $ 0 1 2 3 4 5 6 7 8 9 Occurrence vector of a : 0 3 Difference vector : 3 Occurrence vector of bb : 4 7 Difference vetor : 3 13
  • 14. How to get a string format from a Transactional database? 14 Discretization Technique
  • 15. 15
  • 16. We need to define a range or group from DB and characterized each range by a unique ASCII character Suppose, In our previous example, log in defined by a log out ,, x before log in ,, b before log out ,, c after log out ,, d 16
  • 17. 17
  • 19.  ‘abcabbaabb$’ has following ten suffixes. We can ignore the 10th suffix when generating suffix tree 1. abcabbabb$ 2. bcabbabb$ 3. cabbabb$ 4. abbabb$ 5. bbabb$ 6. babb$ 7. abb$ 8. bb$ 9. b$ 10. $ 19
  • 21.  Strings: 1. abcabbabb$ 2. bcabbabb$ 21 a b a c b b a b b $ b c b a b $ a b b
  • 22.  Strings: 1. abcabbabb$ 2. bcabbabb$ 3. cabbabb$ 22 a b a c b b a b b $ b c b a b $ a b b c b a b $ a b b
  • 23.  Strings: 1. abcabbabb$ 2. bcabbabb$ 3. cabbabb$ 4. abbabb$ 23 a b b c b a b $ a b b c b a b $ a b b a c b b a b b $
  • 24.  Strings: 1. abcabbabb$ 2. bcabbabb$ 3. cabbabb$ 4. abbabb$ 24 a b b c b a b $ a b b c b a b $ a b b a c b b a b b $ b a b b $
  • 25.  Strings: 1. abcabbabb$ 2. bcabbabb$ 3. cabbabb$ 4. abbabb$ 5. bbabb$ 25 a b b c b a b $ a b b a c b b a b b $ b a b b $ c b a b $ a b b b a b b $
  • 26.  Strings: 1. abcabbabb$ 2. bcabbabb$ 3. cabbabb$ 4. abbabb$ 5. bbabb$ 6. babb$ 26 a b b c b a b $ a b b a c b b a b b $ b a b b $ c b a b $ a b b b a b b $ a b b $
  • 27.  Strings: 1. abcabbabb$ 2. bcabbabb$ 3. cabbabb$ 4. abbabb$ 5. bbabb$ 6. babb$ 7. abb$ 27 a b b c b a b $ a b b a c b b a b b $ b a b b $ c b a b $ a b b b a b b $ a b b $ $
  • 28.  Strings: 1. abcabbabb$ 2. bcabbabb$ 3. cabbabb$ 4. abbabb$ 5. bbabb$ 6. babb$ 7. abb$ 8. bb$ 28 a b b c b a b $ a b b a c b b a b b $ b a b b $ c b a b $ a b b b a a b b $ $ b b $ $
  • 29.  Strings: 1. abcabbabb$ 2. bcabbabb$ 3. cabbabb$ 4. abbabb$ 5. bbabb$ 6. babb$ 7. abb$ 8. bb$ 9. b$ 29 a b b c b a b $ a b b a c b b a b b $ b a b b $ c b a b $ a b b b a a b b $ $ b b $ $ $
  • 30. abcabbabb$ Edge leaf node holds a number that represents starting position of the suffix Each intermediate node holds a number which is the length of the substring read from root to the intermediate node 30 0 a b 1 b c b a b $ a b b 2a c b b a b b $ 2 6 b a b b $ c b a b $ a b b 1 4 b a 5 a b b $ 3 $ 3 b b $ 2 7 $ $ 8
  • 34. Input: a time series of Size n Output: Positions of periodic patterns Process: for each occurrence vector of size k find p for 0 to k check each position after p char count confidence add to list if greater than threshold 34
  • 35. abcabbabb$ ab - (0,3,6) abb - (3,6) bb - (4,7) b - (1,5,8,4,7) 35 stpos= 0 endPos= 6 P= 3-0 = 3 Now check occurrence vector of ab if difference equal p count increment Check confidence Add to pattern list if confidence >= Θ
  • 36. abcdabcabcab$ ab - (0,4,7,10) 36 stpos= 0 endPos= 10 P= 4-0 = 4 Now check occurrence vector of ab if difference equal p count increment Only one pattern get 0 to 10 with p=4 abcdabcabcab$
  • 37. abcdabcabcab$ ab - (0,4,7,10) 37 stpos= 4 endPos= 10 P= 7-4 = 3 Now check occurrence vector of ab if difference equal p count increment 3 pattern get 4 to 10 with p=3 abcdabcabcab$
  • 38. 38
  • 39. - Elfeky proposed two separate algorithms to detect symbol & segment periodicity. (CONV) & (WARP) But it not used in sub-sequence & complexity O(nlogn) & O(n^2) - Han’s parper algorithm used in sub-sequence But it need user input 39
  • 40. - In this perspective, The algorithm discussed here is better than previous - Complexity O(nlogn) - Works online 40
  • 41. 41 I NAME OF PRESENTERCSE, DU41 References - Periodic pattern mining using suffix tree by Rasheed, Al-Shalalfa, & Alhajj, 2011 - Effective periodic pattern mining in time series database by Nishi, Farhan, Samiullah, Jeong - Data Mining Concepts & Techniques by J. Han & M. Kamber - Database system Concept by Abraham Sillberschatz, Korth, Sudarshan
  • 42. 42 I NAME OF PRESENTERCSE, DU42 Questions
  • 43. 43 I NAME OF PRESENTERCSE, DU43 Thank You