SlideShare a Scribd company logo
Theories and Applications of Spatial-Temporal Data Mining and Knowledge Discovery Yee Leung Email:  [email_address]   Department of Geography  and Resource Management  The Chinese University of Hong Kong
 
 
a) b)
a) b)
 
 
 
Daily rainfall data of two stations in Pearl River basin of China
 
The monthly sunspot time series.
The Portuguese Stock Index PSI-20 evolution from 1993 to 2002 (adopted from J.A.O. Matos et al. / Physica A 342 (2004) 665 – 676)
Outbreak of Avian Flu in different regions
 
What are the structures and processes hidden in spatial data? What are the Concepts hidden in the  information system? Do the Concepts form a knowledge  structure?
Typhoon Tracks Adapted from Wang and Chan
Typhoon/Hurricane Tracking Objective:  Intensity, track (land falling, recurvature) Object: The space-time track of unusually low sea-     surface air pressure in the x-y-z plane Data: potential temperature, horizontal velocity,    vertical velocity, relative humidity,  horizontal    wind, etc Data: Hundreds and thousands of gigabytes within a    specific time interval
 
 
 
Data Mining in Hyperspectral Images 1. Objective Classification, Pattern Recognition 2. Object Spectral Signatures of Objects   3. Data Spectral, Non-spectral Data 4. Data Volume e.g. : AVIRIS : from 0.4 to 2.45 micrometers, 224 bands   HYDICE : from 0.4 to 2.5 micrometers, 210 bands   Hyperion : from 0.4 to 2.5 micrometers, 220 bands,   30 meter resolution
The Objective of Knowledge Discovery and Data Mining Fayyad:  The discovery of non-trivial, novel,    potentially useful and interpretable   knowledge/information from data Data  Information  Knowledge  Decision
Characteristics of Spatial Data   1.   Voluminous 2.   Sparse 3.   Diversity 4.   Complex 5.   Dynamic 6.   Redundant 7.   Imperfect (random , fuzzy , granular ,  incomplete , noisy)  8.   Multi-scale
Main Tasks of Spatial Knowledge Discovery and Data Mining   1. Clustering   3. Association 2. Classification Spatial Relations Temporal Relations Spatial-temporal Relations *  In particular : the local-global issue 4. Processes
CLUSTERING The Scale-Space Filtering Approach The Regression-Classes Decomposition Approach
Scale Space Theory Given a primary   image  f (x) at a distance of  σ   from human eyes, the observed blur red  image   f (x,  σ ) can be mathematically determined by the following partial differential   equation : The solution of the above equation is explicitly expressed as where ‘∗’ denotes the convolution operation, g   (x,  σ  ) is the Gaussian function
If the training samples are treated as an imaginary image with expression: Then the corresponding blurred image  f (x,  σ ,   D l ) at scale   σ  can be specified by
Essentials of Clustering by  Scale-space Filtering 1.   Visual system simulation 2.   Cluster validity check 3.   Clustering validity check 4.   Relevant concepts   (a) life time of a cluster (b) life time of a clustering (c) compactness (d) isolatedness
 
 
 
 
 
 
 
 
 
 
 
 
Ms-time plot of clustering results for earthquakes (Ms≥6):  a) 3 clusters in the 59th~95th scale range; b) 17 clusters at the 6th scale step  a) b)
Temporal segmentation of Strong Earthquakes (Ms≥6.0) of 1290A.D. - 2000A.D.   Scale-space for earthquakes (Ms≥6)
Indices of clustering along the time scale for earthquakes (Ms≥6.0):  a) number of clusters; b) Lifetime, Isolation and Compactness of the clustering a) b)
a) b) Ms-time plot of clustering results for earthquakes (Ms≥4.7):  a) 2 clusters in the 74th~112th scale range; b) 18 clusters at the 10th scale step
Temporal Segmentation of Strong Earthquakes (Ms≥4.7) of 1484A.D. - 2000A.D. Indices of clustering along the time scale for earthquakes (Ms≥4.7):  a) number of clusters ( The vertical axis just shows the part no larger than 150 );  b) Lifetime, Isolation and Compactness of the clustering a) b)
Table 1   Seismic active periods and episodes obtained by the clustering algorithm and the seismologists ( The number in parentheses is the number of earthquakes in the cluster )
 
Advantages of Scale-space Filtering Free from solving global optimization problem Independent of initialization Robust Outliers Detection Generalization of scale-related algorithms Consistent with visual system
5. Scale Space Clustering Scale-Space Filtering for Simulated Data
5. Scale Space Clustering Scale-Space Filtering for Remote-Sensing Data Clustering Tree Quasi-Light
Clustering by Regression-Classes Decomposition Method
Simple Gaussian Class
Linear Structure
Identification of line objects in remotely sensed data
Ellipsoidal Structure
 
Two ellipsoidal feature extraction
General Curvilinear Structure
Complex Shape Structure
ANALYSIS OF SPATIAL RELATIONSHIP Global Description Moran’s I Geary’s c OLR Local Description Local Moran’s I Local Geary’s c G Statistic Geographically Weighted Regression Mixture Distribution
Geographically Weighted Regression Hypothesis testing 1.  Ho: No difference between OLR and GWR 2.  Ho: a 1k  = a 2k  = … = a nk
 
 
(Regression-Classes Decomposition Method)
CLASSIFICATION The Neural Network Approach The Classification and Regression Tree The Statistical Classifiers
Information Extraction and Classification Neural Networks for Classification--MLP-BP
Some Typical Feedforward Neural Networks  Perception   In late 1950s: layered feed forward networks named perceptron. Today:  Perceptron : single-layer, feed-forward networks. See Fig. 8, each multi-output unit  O  is fed  independently  from the input units. Figure 8. Perceptrons
Mulitlayer Feedforward Neural Network   Learning algorithms for multilayer networks are neither efficient nor guaranteed to converge to  global optimum . Most popular learning method:  back-propagation . Back-propagation learning   The restaurant problem: use a 2-layer network, 10 attributes = 10 input units, 4 hidden units. See Fig. 13. Some Typical Feedforward Neural Networks (con ’ t) Fig. 13. A 2-layer feedforward network for the restaurant problem.
 
 
 
 
 
 
 
Competitive Pattern Recognition by Recurrent NN
 
Typhoon Tracks Adapted from Wang and Chan
Trees by Classification and Regression Tree (CART)  MSW 6/12/18: Maximum Sustained Wind of TC 6/12/18 hours before recurvature.  0: Recurve,1: Straight
1. If MSW of a TC is smaller than or equal to 34 m/s and MSW of that TC is smaller than 2 m/s 6 hours later, then the TC will recurve in 12 hours with 96% accuracy. 2. If MSW of a TC is smaller than or equal to 34 m/s and MSW of that TC is larger than 2 m/s 6 hours later, the TC will move straight in 12 hours with 86.8% accuracy.  3. If MSW of a TC is larger than 34 m/s, it will recurve in 18 hours with 94.1% accuracy. Rules by CART
DISCOVERY OF TEMPORAL PROCESSES The Multifractal Approach Conventional Time Series Analyses
Mining of Scaling Behavior by Multifractal Analysis
Multiplicative Cascade An approach to the study of scaling behavior with multiple scales (granules). Multiplicative Binomial Cascade
Schematic representation of cascade (adopted from Puente and Lopez, 1995, Physical Letters A)
 
TEMPORAL ANALYSIS Linear Time Invariant System    Self Similarity    Multiscaling    Infinitely Divisible Cascade Stationary Process Non-stationary Process Random Walk Fractional Brownian Motion, fmb Multifractal Analysis of Stochastic Trends
The Multifractal Approach Establish a data model for stochastic time series Discovery of relevant models in stochastic time series
MF-DFA Detrended fluctuation analysis (DFA) is a method for detecting the long-range correlation and fractal property in the both stationary and non-stationary time series.  MF-DFA , which is based on DFA,   can give full description of more complicated scaling behavior of time series
MF-DFA Given a time series  with length  N. Step1:    i=1,2,…,N; Step2: Divide  Y(i)  into    non-overlapping segments of equal lengths  s.  In order not to disregard this part of the series, the same procedure is repeated starting from the opposite end. Thereby, 2 Ns  segments are obtained altogether.
MF-DFA  Step 3 . Calculate the local trend for each of the 2 N s   segments by a least squares fit of the series. Then determine the variance for each segment  ν ,  ν  = 1 , . . .,N s , and for  ν  =  N s   + 1 , . . . ,  2 N s . Here,  is the fitting polynomial in segment  ν,  whose order  m  can be 1, 2, 3  …  .  Step 4 . Average over all segments to obtain the  q th-order fluctuation function, defined Where  ,  s ≥ m  + 2.
MF-DFA Step5: Determine the scaling behavior of the fluctuation function  by analyzing log-log plots of  Fq(s)  versus  s  for each value of  q . If we have   , for large values of  s,  we get the exponent  h(q),  which may depend on  q  generally. H=h(q=2),   for stationary time series; H=h(q=2)-1,  for non-stationary time series.
MF-DFA For MF-DFA, if  h(q)  is constant for all  q , the corresponding time series is mono-fractal. However, if  h(q)  varies with  q,  that means multifractal.
( adopted from  Peng et. al., 1994 )
 
 
 
 
 
 
 
 
Daily rainfall data of two stations in Pearl River basin of China
Log-log plots of  F q  (s)  versus  s  for the daily rainfall time series  of station 56691 in Pearl River basin (left) and Station Chuantang in East River basin (right) with  q =2.
The  h ( q ) curves of daily rainfall time series of stations in the Pearl River basin (left) and stations in the East River basin (right).
The  curves of daily rainfall time series of stations in the Pearl River basin (left) and stations in the East River basin (right).
The  curves of daily rainfall time series of stations in the Pearl River basin (left) and stations in the East River basin (right)
The  curves of daily rainfall time series of 5 stations in the Pearl River basin
The  curves of daily rainfall time series of stations in the Pearl River basin (left) and stations in the East River basin (right).
The  curves of daily rainfall time series of stations in the Pearl River basin (left) and stations in the East River basin (right). The real lines are their cascade model fitting.
The correlation relationship between the altitude of the rainfall stations in the East River basin and the  D (2) value of the rainfall time series.
Elevation of rainfall stations in the East River basin with the  D2  values of their rainfall data.  Elevation (m above MSL)
DISCOVERY OF KNOWLEDGE STRUCTURES The Concept Lattice Approach
Discovery of Hierarchical Knowledge from Relational Spatial Data
 
Spatial Concept/Class and Data Encapsulation
Concept Hierarchy
Inheritance
Generalization and Specialization
  Summary (1)Concept lattice as a mathematical foundation for object-oriented spatial information system (2)Concpet lattice can be employed as method to unravel hierarchical structure from spatial information system (3)A bridge between relational spatial information system (vector-based, raster-based) and object-oriented spatial information system.
Yee Leung. Knowledge Discovery in Spatial Data. Berlin: Springer-Verlag, 2010. [email_address] IGU-Commission on Modeling Geographical Systems http://www.science.mcmaster.ca/~igu~cmgs/

More Related Content

PPT
4.2 spatial data mining
PPT
Spatial data mining
PPTX
Introduction to spatial data mining
PPT
Spatial Database Systems
PPTX
Spatial databases
PDF
Database gis fundamentals
PPTX
Spatial databases
PDF
Components of Spatial Data Quality in GIS
4.2 spatial data mining
Spatial data mining
Introduction to spatial data mining
Spatial Database Systems
Spatial databases
Database gis fundamentals
Spatial databases
Components of Spatial Data Quality in GIS

What's hot (20)

PPTX
ppt spatial data
PDF
GIS data structure
PPT
GIS Data Types
PPTX
TYBSC IT PGIS Unit I Chapter II Geographic Information and Spacial Database
PPTX
Geographic Phenomena and their Representations
PPTX
GIS & Raster
PPT
datamodel_vector
PDF
Finding Meaning in Points, Areas and Surfaces: Spatial Analysis in R
PPT
raster data model
PPTX
TYBSC IT PGIS Unit V Data Visualization
PDF
Scattered gis handbook
PPTX
Spatial analysis and modeling
PDF
Spatial data analysis 1
PDF
Mrp Intrim
PPTX
TYBSC IT PGIS Unit I Chapter I- Introduction to Geographic Information Systems
PDF
Geographic query and analysis
PDF
Understanding raster
PDF
Conceptual models of real world geographical phenomena (epm107_2007)
PPTX
Raster data model
PPT
215 spatial db
ppt spatial data
GIS data structure
GIS Data Types
TYBSC IT PGIS Unit I Chapter II Geographic Information and Spacial Database
Geographic Phenomena and their Representations
GIS & Raster
datamodel_vector
Finding Meaning in Points, Areas and Surfaces: Spatial Analysis in R
raster data model
TYBSC IT PGIS Unit V Data Visualization
Scattered gis handbook
Spatial analysis and modeling
Spatial data analysis 1
Mrp Intrim
TYBSC IT PGIS Unit I Chapter I- Introduction to Geographic Information Systems
Geographic query and analysis
Understanding raster
Conceptual models of real world geographical phenomena (epm107_2007)
Raster data model
215 spatial db
Ad

Viewers also liked (17)

PPT
Temporal data mining
PPT
The study on mining temporal patterns and related applications in dynamic soc...
PDF
Semantic Meta-Mining of Knowledge Discovery Processes
PDF
Temporal Topic Models for Probabilistic Motif Mining (SMiLe2014)
DOCX
Use of data mining techniques in the discovery of spatial and ...
PPT
Problem solution
PDF
Knowledge Discovery from Academic Data using Association Rule Mining, Paper P...
PPTX
Managing children’s behaviour
PPTX
RSC: Mining and Modeling Temporal Activity in Social Media
PPTX
Data mining and knowledge discovery
PDF
Temporal Data
PDF
Spatio-Temporal Data Mining and Classification of Ships' Trajectories
PPTX
Web mining (structure mining)
PPTX
Web Mining Presentation Final
PDF
Web mining slides
PPT
Data mining slides
 
PPTX
Data mining
Temporal data mining
The study on mining temporal patterns and related applications in dynamic soc...
Semantic Meta-Mining of Knowledge Discovery Processes
Temporal Topic Models for Probabilistic Motif Mining (SMiLe2014)
Use of data mining techniques in the discovery of spatial and ...
Problem solution
Knowledge Discovery from Academic Data using Association Rule Mining, Paper P...
Managing children’s behaviour
RSC: Mining and Modeling Temporal Activity in Social Media
Data mining and knowledge discovery
Temporal Data
Spatio-Temporal Data Mining and Classification of Ships' Trajectories
Web mining (structure mining)
Web Mining Presentation Final
Web mining slides
Data mining slides
 
Data mining
Ad

Similar to Theories and Applications of Spatial-Temporal Data Mining and Knowledge Discovery (20)

PDF
Identification of the Memory Process in the Irregularly Sampled Discrete Time...
PDF
Ill-posedness formulation of the emission source localization in the radio- d...
PDF
Application of stochastic modeling in geomagnetism
PDF
Application of stochastic modeling in geomagnetism
PDF
Inversão com sigmoides
PDF
Climate Extremes Workshop - Extreme Values of Vertical Wind Speed in Doppler ...
PPT
Recurrence Quantification Analysis : Tutorial & application to eye-movement data
PDF
My Prize Winning Physics Poster from 2006
PPSX
Characterization of Subsurface Heterogeneity: Integration of Soft and Hard In...
PDF
Investigatng MultIfractality of Solar Irradiance Data Through Wavelet Based M...
PDF
Gwendolyn Eadie: A New Method for Time Series Analysis in Astronomy with an A...
PDF
Wereszczynski Molecular Dynamics
PDF
Pakdd
PDF
Flood routing by kinematic wave model
PDF
PR12-225 Discovering Physical Concepts With Neural Networks
PDF
Instantons in 1D QM
PDF
Nature12888
PDF
PDF
Anomaly Detection in Sequences of Short Text Using Iterative Language Models
PDF
Pablo Estevez: "Computational Intelligence Applied to Time Series Analysis"
Identification of the Memory Process in the Irregularly Sampled Discrete Time...
Ill-posedness formulation of the emission source localization in the radio- d...
Application of stochastic modeling in geomagnetism
Application of stochastic modeling in geomagnetism
Inversão com sigmoides
Climate Extremes Workshop - Extreme Values of Vertical Wind Speed in Doppler ...
Recurrence Quantification Analysis : Tutorial & application to eye-movement data
My Prize Winning Physics Poster from 2006
Characterization of Subsurface Heterogeneity: Integration of Soft and Hard In...
Investigatng MultIfractality of Solar Irradiance Data Through Wavelet Based M...
Gwendolyn Eadie: A New Method for Time Series Analysis in Astronomy with an A...
Wereszczynski Molecular Dynamics
Pakdd
Flood routing by kinematic wave model
PR12-225 Discovering Physical Concepts With Neural Networks
Instantons in 1D QM
Nature12888
Anomaly Detection in Sequences of Short Text Using Iterative Language Models
Pablo Estevez: "Computational Intelligence Applied to Time Series Analysis"

More from Beniamino Murgante (20)

PDF
Analyzing and assessing ecological transition in building sustainable cities
PDF
Smart Cities: New Science for the Cities
PDF
The evolution of spatial analysis and modeling in decision processes
PDF
Smart City or Urban Science?
PDF
Involving citizens in smart energy approaches: the experience of an energy pa...
PPTX
Programmazione per la governance territoriale in tema di tutela della biodive...
PDF
Involving Citizens in a Participation Process for Increasing Walkability
PDF
Presentation of ICCSA 2019 at the University of Saint petersburg
PPTX
RISCHIO TERRITORIALE NEL GOVERNO DEL TERRITORIO: Ricerca e formazione nelle s...
PDF
Presentation of ICCSA 2017 at the University of trieste
PPT
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
PPTX
Focussing Energy Consumers’ Behaviour Change towards Energy Efficiency and Lo...
PPTX
Socio-Economic Planning profiles: Sciences VS Daily activities in public sector 
PPT
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
PPT
Garden in motion. An experience of citizens involvement in public space regen...
PPT
Planning and Smartness: the true challenge
PDF
GeoSDI: una piattaforma social di dati geografici basata sui principi di INSP...
PPT
Murgante smart energy
PPT
Informazione Geografica, Città, Smartness
PPT
Tecnologie, Territorio, Smartness
Analyzing and assessing ecological transition in building sustainable cities
Smart Cities: New Science for the Cities
The evolution of spatial analysis and modeling in decision processes
Smart City or Urban Science?
Involving citizens in smart energy approaches: the experience of an energy pa...
Programmazione per la governance territoriale in tema di tutela della biodive...
Involving Citizens in a Participation Process for Increasing Walkability
Presentation of ICCSA 2019 at the University of Saint petersburg
RISCHIO TERRITORIALE NEL GOVERNO DEL TERRITORIO: Ricerca e formazione nelle s...
Presentation of ICCSA 2017 at the University of trieste
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
Focussing Energy Consumers’ Behaviour Change towards Energy Efficiency and Lo...
Socio-Economic Planning profiles: Sciences VS Daily activities in public sector 
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
Garden in motion. An experience of citizens involvement in public space regen...
Planning and Smartness: the true challenge
GeoSDI: una piattaforma social di dati geografici basata sui principi di INSP...
Murgante smart energy
Informazione Geografica, Città, Smartness
Tecnologie, Territorio, Smartness

Recently uploaded (20)

PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Electronic commerce courselecture one. Pdf
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPTX
Big Data Technologies - Introduction.pptx
PDF
Empathic Computing: Creating Shared Understanding
PPT
Teaching material agriculture food technology
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPTX
Cloud computing and distributed systems.
PDF
cuic standard and advanced reporting.pdf
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Spectral efficient network and resource selection model in 5G networks
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Advanced methodologies resolving dimensionality complications for autism neur...
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
MYSQL Presentation for SQL database connectivity
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Per capita expenditure prediction using model stacking based on satellite ima...
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Chapter 3 Spatial Domain Image Processing.pdf
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Electronic commerce courselecture one. Pdf
NewMind AI Weekly Chronicles - August'25 Week I
Big Data Technologies - Introduction.pptx
Empathic Computing: Creating Shared Understanding
Teaching material agriculture food technology
MIND Revenue Release Quarter 2 2025 Press Release
Cloud computing and distributed systems.
cuic standard and advanced reporting.pdf
Network Security Unit 5.pdf for BCA BBA.
Spectral efficient network and resource selection model in 5G networks

Theories and Applications of Spatial-Temporal Data Mining and Knowledge Discovery

  • 1. Theories and Applications of Spatial-Temporal Data Mining and Knowledge Discovery Yee Leung Email: [email_address] Department of Geography and Resource Management The Chinese University of Hong Kong
  • 2.  
  • 3.  
  • 6.  
  • 7.  
  • 8.  
  • 9. Daily rainfall data of two stations in Pearl River basin of China
  • 10.  
  • 11. The monthly sunspot time series.
  • 12. The Portuguese Stock Index PSI-20 evolution from 1993 to 2002 (adopted from J.A.O. Matos et al. / Physica A 342 (2004) 665 – 676)
  • 13. Outbreak of Avian Flu in different regions
  • 14.  
  • 15. What are the structures and processes hidden in spatial data? What are the Concepts hidden in the information system? Do the Concepts form a knowledge structure?
  • 16. Typhoon Tracks Adapted from Wang and Chan
  • 17. Typhoon/Hurricane Tracking Objective: Intensity, track (land falling, recurvature) Object: The space-time track of unusually low sea- surface air pressure in the x-y-z plane Data: potential temperature, horizontal velocity, vertical velocity, relative humidity, horizontal wind, etc Data: Hundreds and thousands of gigabytes within a specific time interval
  • 18.  
  • 19.  
  • 20.  
  • 21. Data Mining in Hyperspectral Images 1. Objective Classification, Pattern Recognition 2. Object Spectral Signatures of Objects 3. Data Spectral, Non-spectral Data 4. Data Volume e.g. : AVIRIS : from 0.4 to 2.45 micrometers, 224 bands HYDICE : from 0.4 to 2.5 micrometers, 210 bands Hyperion : from 0.4 to 2.5 micrometers, 220 bands, 30 meter resolution
  • 22. The Objective of Knowledge Discovery and Data Mining Fayyad: The discovery of non-trivial, novel, potentially useful and interpretable knowledge/information from data Data Information Knowledge Decision
  • 23. Characteristics of Spatial Data 1.   Voluminous 2.   Sparse 3.   Diversity 4.   Complex 5.   Dynamic 6.   Redundant 7.   Imperfect (random , fuzzy , granular , incomplete , noisy) 8.   Multi-scale
  • 24. Main Tasks of Spatial Knowledge Discovery and Data Mining 1. Clustering 3. Association 2. Classification Spatial Relations Temporal Relations Spatial-temporal Relations * In particular : the local-global issue 4. Processes
  • 25. CLUSTERING The Scale-Space Filtering Approach The Regression-Classes Decomposition Approach
  • 26. Scale Space Theory Given a primary image f (x) at a distance of σ from human eyes, the observed blur red image f (x, σ ) can be mathematically determined by the following partial differential equation : The solution of the above equation is explicitly expressed as where ‘∗’ denotes the convolution operation, g (x, σ ) is the Gaussian function
  • 27. If the training samples are treated as an imaginary image with expression: Then the corresponding blurred image f (x, σ , D l ) at scale σ can be specified by
  • 28. Essentials of Clustering by Scale-space Filtering 1.   Visual system simulation 2.   Cluster validity check 3.   Clustering validity check 4.   Relevant concepts (a) life time of a cluster (b) life time of a clustering (c) compactness (d) isolatedness
  • 29.  
  • 30.  
  • 31.  
  • 32.  
  • 33.  
  • 34.  
  • 35.  
  • 36.  
  • 37.  
  • 38.  
  • 39.  
  • 40.  
  • 41. Ms-time plot of clustering results for earthquakes (Ms≥6): a) 3 clusters in the 59th~95th scale range; b) 17 clusters at the 6th scale step a) b)
  • 42. Temporal segmentation of Strong Earthquakes (Ms≥6.0) of 1290A.D. - 2000A.D. Scale-space for earthquakes (Ms≥6)
  • 43. Indices of clustering along the time scale for earthquakes (Ms≥6.0): a) number of clusters; b) Lifetime, Isolation and Compactness of the clustering a) b)
  • 44. a) b) Ms-time plot of clustering results for earthquakes (Ms≥4.7): a) 2 clusters in the 74th~112th scale range; b) 18 clusters at the 10th scale step
  • 45. Temporal Segmentation of Strong Earthquakes (Ms≥4.7) of 1484A.D. - 2000A.D. Indices of clustering along the time scale for earthquakes (Ms≥4.7): a) number of clusters ( The vertical axis just shows the part no larger than 150 ); b) Lifetime, Isolation and Compactness of the clustering a) b)
  • 46. Table 1 Seismic active periods and episodes obtained by the clustering algorithm and the seismologists ( The number in parentheses is the number of earthquakes in the cluster )
  • 47.  
  • 48. Advantages of Scale-space Filtering Free from solving global optimization problem Independent of initialization Robust Outliers Detection Generalization of scale-related algorithms Consistent with visual system
  • 49. 5. Scale Space Clustering Scale-Space Filtering for Simulated Data
  • 50. 5. Scale Space Clustering Scale-Space Filtering for Remote-Sensing Data Clustering Tree Quasi-Light
  • 51. Clustering by Regression-Classes Decomposition Method
  • 54. Identification of line objects in remotely sensed data
  • 56.  
  • 60. ANALYSIS OF SPATIAL RELATIONSHIP Global Description Moran’s I Geary’s c OLR Local Description Local Moran’s I Local Geary’s c G Statistic Geographically Weighted Regression Mixture Distribution
  • 61. Geographically Weighted Regression Hypothesis testing 1. Ho: No difference between OLR and GWR 2. Ho: a 1k = a 2k = … = a nk
  • 62.  
  • 63.  
  • 65. CLASSIFICATION The Neural Network Approach The Classification and Regression Tree The Statistical Classifiers
  • 66. Information Extraction and Classification Neural Networks for Classification--MLP-BP
  • 67. Some Typical Feedforward Neural Networks Perception In late 1950s: layered feed forward networks named perceptron. Today: Perceptron : single-layer, feed-forward networks. See Fig. 8, each multi-output unit O is fed independently from the input units. Figure 8. Perceptrons
  • 68. Mulitlayer Feedforward Neural Network Learning algorithms for multilayer networks are neither efficient nor guaranteed to converge to global optimum . Most popular learning method: back-propagation . Back-propagation learning The restaurant problem: use a 2-layer network, 10 attributes = 10 input units, 4 hidden units. See Fig. 13. Some Typical Feedforward Neural Networks (con ’ t) Fig. 13. A 2-layer feedforward network for the restaurant problem.
  • 69.  
  • 70.  
  • 71.  
  • 72.  
  • 73.  
  • 74.  
  • 75.  
  • 77.  
  • 78. Typhoon Tracks Adapted from Wang and Chan
  • 79. Trees by Classification and Regression Tree (CART) MSW 6/12/18: Maximum Sustained Wind of TC 6/12/18 hours before recurvature. 0: Recurve,1: Straight
  • 80. 1. If MSW of a TC is smaller than or equal to 34 m/s and MSW of that TC is smaller than 2 m/s 6 hours later, then the TC will recurve in 12 hours with 96% accuracy. 2. If MSW of a TC is smaller than or equal to 34 m/s and MSW of that TC is larger than 2 m/s 6 hours later, the TC will move straight in 12 hours with 86.8% accuracy. 3. If MSW of a TC is larger than 34 m/s, it will recurve in 18 hours with 94.1% accuracy. Rules by CART
  • 81. DISCOVERY OF TEMPORAL PROCESSES The Multifractal Approach Conventional Time Series Analyses
  • 82. Mining of Scaling Behavior by Multifractal Analysis
  • 83. Multiplicative Cascade An approach to the study of scaling behavior with multiple scales (granules). Multiplicative Binomial Cascade
  • 84. Schematic representation of cascade (adopted from Puente and Lopez, 1995, Physical Letters A)
  • 85.  
  • 86. TEMPORAL ANALYSIS Linear Time Invariant System  Self Similarity  Multiscaling  Infinitely Divisible Cascade Stationary Process Non-stationary Process Random Walk Fractional Brownian Motion, fmb Multifractal Analysis of Stochastic Trends
  • 87. The Multifractal Approach Establish a data model for stochastic time series Discovery of relevant models in stochastic time series
  • 88. MF-DFA Detrended fluctuation analysis (DFA) is a method for detecting the long-range correlation and fractal property in the both stationary and non-stationary time series. MF-DFA , which is based on DFA, can give full description of more complicated scaling behavior of time series
  • 89. MF-DFA Given a time series with length N. Step1: i=1,2,…,N; Step2: Divide Y(i) into non-overlapping segments of equal lengths s. In order not to disregard this part of the series, the same procedure is repeated starting from the opposite end. Thereby, 2 Ns segments are obtained altogether.
  • 90. MF-DFA Step 3 . Calculate the local trend for each of the 2 N s segments by a least squares fit of the series. Then determine the variance for each segment ν , ν = 1 , . . .,N s , and for ν = N s + 1 , . . . , 2 N s . Here, is the fitting polynomial in segment ν, whose order m can be 1, 2, 3 … . Step 4 . Average over all segments to obtain the q th-order fluctuation function, defined Where , s ≥ m + 2.
  • 91. MF-DFA Step5: Determine the scaling behavior of the fluctuation function by analyzing log-log plots of Fq(s) versus s for each value of q . If we have , for large values of s, we get the exponent h(q), which may depend on q generally. H=h(q=2), for stationary time series; H=h(q=2)-1, for non-stationary time series.
  • 92. MF-DFA For MF-DFA, if h(q) is constant for all q , the corresponding time series is mono-fractal. However, if h(q) varies with q, that means multifractal.
  • 93. ( adopted from Peng et. al., 1994 )
  • 94.  
  • 95.  
  • 96.  
  • 97.  
  • 98.  
  • 99.  
  • 100.  
  • 101.  
  • 102. Daily rainfall data of two stations in Pearl River basin of China
  • 103. Log-log plots of F q (s) versus s for the daily rainfall time series of station 56691 in Pearl River basin (left) and Station Chuantang in East River basin (right) with q =2.
  • 104. The h ( q ) curves of daily rainfall time series of stations in the Pearl River basin (left) and stations in the East River basin (right).
  • 105. The curves of daily rainfall time series of stations in the Pearl River basin (left) and stations in the East River basin (right).
  • 106. The curves of daily rainfall time series of stations in the Pearl River basin (left) and stations in the East River basin (right)
  • 107. The curves of daily rainfall time series of 5 stations in the Pearl River basin
  • 108. The curves of daily rainfall time series of stations in the Pearl River basin (left) and stations in the East River basin (right).
  • 109. The curves of daily rainfall time series of stations in the Pearl River basin (left) and stations in the East River basin (right). The real lines are their cascade model fitting.
  • 110. The correlation relationship between the altitude of the rainfall stations in the East River basin and the D (2) value of the rainfall time series.
  • 111. Elevation of rainfall stations in the East River basin with the D2 values of their rainfall data. Elevation (m above MSL)
  • 112. DISCOVERY OF KNOWLEDGE STRUCTURES The Concept Lattice Approach
  • 113. Discovery of Hierarchical Knowledge from Relational Spatial Data
  • 114.  
  • 115. Spatial Concept/Class and Data Encapsulation
  • 119. Summary (1)Concept lattice as a mathematical foundation for object-oriented spatial information system (2)Concpet lattice can be employed as method to unravel hierarchical structure from spatial information system (3)A bridge between relational spatial information system (vector-based, raster-based) and object-oriented spatial information system.
  • 120. Yee Leung. Knowledge Discovery in Spatial Data. Berlin: Springer-Verlag, 2010. [email_address] IGU-Commission on Modeling Geographical Systems http://www.science.mcmaster.ca/~igu~cmgs/