SlideShare a Scribd company logo
CVPR 2012
Cross-view Activity Recognition using
            Hankelets
Binlong Li, Octavia I. Camps and Mario Sznaier
            Northeastern University




                                      Mobuddies
Dynamic Systems
 Dynamic systems have been recently used in a
  wide range of computer vision applications
 Given temporal sequence of observations
      (e.g. track coordinates) model temporal
  evolution as a function of low-dimensional state
  vector          that changes over time
 Simplest case – linear time invariant (LTI) system
  (w – noise)



 Practical limitation: given set of
  observations, triple      is not unique and is
Hankel Matrices
 Given a sequence of measurements
 , its block Hankel matrix is defined as:




 Columns correspond to overlapping
  subsequences of data
 Block anti-diagonals of the matrix are constant
 This structure encapsulates the dynamic
  information of the system
Initial condition invariance
 Linear time invariant system (LTI):


 In the absence of noice (w = 0):


 Then Hankel matrix is broken down to:




 Columns of Hankel matrix span the same
 subspace regardless of initial conditions
Autoregressive measurements
 Suppose the sequence of measurements is auto-
 regressive:

 Recall, that:




 Setting r = n in the above, we obtain:


 In other words, last column of Hankel matrix is a
 linear combination of other columns
Affine transformation invariance
 Suppose we have two Hankel matrices           and
  corresponding to a trajectory and its affine
  transformation. Auto-regressive property allows
  us to write:

 Suppose affine transformation is defined as
 Then, taking into account its linearity:


 In other words, sequences share the same
  autoregressor
 Recall, that
 Therefore, columns of two Hankel matrices span
Previous work
 B. Li et. al “Activity Recognition using Dynamic
    Subspace Angles”, CVPR 2011
   Considers initial condition invariance.
   Imagine that class of actions (e.g. “walk”) can be
    represented by a single dynamical system, and
    in-class variations are captured by different initial
    condition
   Then differentiating between two actions breaks
    down into determining whether columns of the
    two corresponding Hankel matrices lie in the
    same subspace
   Uses angles between subspaces as a measure of
Overview of the method
 Uses Dense trajectories to extract many short 15-
  frame tracklets.
 Builds Hankel matrix for each tracklet, capturing its
  velocity
 Employs BoF-like approach (BoHk)
 Does three experiments: single-view data, multiple
  view with knowledge transfer, multiple view without
  knowledge transfer
Hankelets
 Hankelet is a Hankel matrix for a short trajectory
 of 15 frames, formed by a sequence of
 normalized velocities:




 Normalize
  Hankelets:
Comparing Hankelets
 Introduce dissimilarity score between two
 Hankelets:

 Derivations show, that d ≈ 0 for Hankelets
 corresponding to noisy measurements of the
 same dynamical system
Building codebook: cluster center
 Modify the K-means algorithm for dissimilarity
  scores:
 Current Hankelet is assigned to a cluster whose
  “representative” has smallest dissimilarity with the
  current Hankelet
 Cluster’s “representative” is chosed as follows.
  Take random Hankelet within the class, find
  dissimilarities between the Hankelet and all other
  Hankelets in the cluster and compute their mean.
  The Hankelet with dissimilarity closest to the
  mean is selected as its “representative”
Building codebook: Gamma pdf
 The histogram of dissimilarities for a typical cluster in
  the dictionary of Hankelets:
 Represent each cluster
with its representative and
gamma pdf:



 Furthermore, each cluster
w has a prior probability
Bag of Hankelets (BoHk)
 Each activity video is represented with a
  histogram of labels from the dictionary of K
  Hankelets
 Cluster label is assigned using max probability:


   where       is cluster representative,    is
  cluster prior
 Finally, one-against-all non-linear SVM trained for
  activities recognition
Bi-Lingual Hankelets
 Bi-lingual Hankelets can be easily learned from
  unlabeled videos captured simultaneously from
  the different viewpoints by matching Hankelets
  across views (~80% are matched)
 Hankelets are matched using threshold on
  dissimilarity score, if their start times are the
  same (no spatial information)
Cross-view Activity Recognition using Hankelets
Cross-view action recognition
 A labeled dataset is given, with Source and Target
 views
  Training
 Extract and match Bi-lingual Hankelets with
  dissimilarity score
 Build codebook of Bi-lingual Hankelets using the K-
  means
 Label Hankelets in Source data using max posterior
  probability
 Train one-against-all non-linear SVM using Source
  data
  Testing
Experiments
  Single-view
 KTH dataset: 95.89% avg.
  Cross-View with data transfer
 Use only Bi-lingual Hankelets
 IXMAS dataset: 56.4% avg. (45.5%
 improvement)
  Cross-View without data transfer
 Use all Hankelets (not only Bi-lingual)
 IXMAS dataset: 90.57% avg. (20.28%
 improvement)
Thank you!

More Related Content

PDF
Approaches to online quantile estimation
PPTX
Video Multi-Object Tracking using Deep Learning
PDF
A Performance Analysis of Self-* Evolutionary Algorithms on Networks with Cor...
PPTX
Reading group nfm - 20170312
DOCX
PDF
Sensitivity Analysis of Checkpointing Strategies for Multimemetic Algorithms ...
PDF
Analyzing Resilience to Computational Glitches in Island-based Evolutionary A...
PDF
Approaches to online quantile estimation
Video Multi-Object Tracking using Deep Learning
A Performance Analysis of Self-* Evolutionary Algorithms on Networks with Cor...
Reading group nfm - 20170312
Sensitivity Analysis of Checkpointing Strategies for Multimemetic Algorithms ...
Analyzing Resilience to Computational Glitches in Island-based Evolutionary A...

What's hot (11)

PDF
Wavelet Signal Processing
PDF
Evaluating Island-based EAs on Unstable Networks with Complex Failure Patterns
PDF
Packet Classification using Support Vector Machines with String Kernels
PPTX
Data Applied:Forecast
ODP
Deep Recurrent Neural Network for Multi-target Filtering
PPTX
Machine Learning Algorithms (Part 1)
PDF
A Study of the Performance of Self-* Memetic Algorithms on Heterogeneous Ephe...
PPT
PDF
SEQUENTIAL CLUSTERING-BASED EVENT DETECTION FOR NONINTRUSIVE LOAD MONITORING
PPTX
Grid based method & model based clustering method
Wavelet Signal Processing
Evaluating Island-based EAs on Unstable Networks with Complex Failure Patterns
Packet Classification using Support Vector Machines with String Kernels
Data Applied:Forecast
Deep Recurrent Neural Network for Multi-target Filtering
Machine Learning Algorithms (Part 1)
A Study of the Performance of Self-* Memetic Algorithms on Heterogeneous Ephe...
SEQUENTIAL CLUSTERING-BASED EVENT DETECTION FOR NONINTRUSIVE LOAD MONITORING
Grid based method & model based clustering method
Ad

Similar to Cross-view Activity Recognition using Hankelets (20)

PDF
Termpaper ai
PPT
Special Aspects of Control for Quadrupedal Walking based on Biological Princi...
PPTX
Adaptive equalization
PDF
Test Generation for Analog and Mixed-Signal Circuits Using Hybrid System Mode...
PDF
TEST GENERATION FOR ANALOG AND MIXED-SIGNAL CIRCUITS USING HYBRID SYSTEM MODELS
PPTX
Kane/DeAlbert dynamics for multibody system
PDF
14 Machine Learning Single Layer Perceptron
PDF
Best data science course in pune. converted
PPTX
Data science course in pune
PPTX
Data science course in pune
PPTX
Data science training in mumbai
PPTX
Machine learning certification in gurgaon
PPTX
data science course
PPTX
Data science certification in mumbai
PPTX
machine learning training in bangalore
PPTX
data science course in pune
PPTX
Data science courses in bangalore
PPTX
Machine learning
PPTX
Data science training ang placements
Termpaper ai
Special Aspects of Control for Quadrupedal Walking based on Biological Princi...
Adaptive equalization
Test Generation for Analog and Mixed-Signal Circuits Using Hybrid System Mode...
TEST GENERATION FOR ANALOG AND MIXED-SIGNAL CIRCUITS USING HYBRID SYSTEM MODELS
Kane/DeAlbert dynamics for multibody system
14 Machine Learning Single Layer Perceptron
Best data science course in pune. converted
Data science course in pune
Data science course in pune
Data science training in mumbai
Machine learning certification in gurgaon
data science course
Data science certification in mumbai
machine learning training in bangalore
data science course in pune
Data science courses in bangalore
Machine learning
Data science training ang placements
Ad

Recently uploaded (20)

PPTX
sap open course for s4hana steps from ECC to s4
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Approach and Philosophy of On baking technology
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Encapsulation theory and applications.pdf
PDF
Empathic Computing: Creating Shared Understanding
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
cuic standard and advanced reporting.pdf
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PPT
Teaching material agriculture food technology
PPTX
Cloud computing and distributed systems.
sap open course for s4hana steps from ECC to s4
Chapter 3 Spatial Domain Image Processing.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
Approach and Philosophy of On baking technology
Understanding_Digital_Forensics_Presentation.pptx
Advanced methodologies resolving dimensionality complications for autism neur...
The AUB Centre for AI in Media Proposal.docx
Digital-Transformation-Roadmap-for-Companies.pptx
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
NewMind AI Weekly Chronicles - August'25 Week I
Diabetes mellitus diagnosis method based random forest with bat algorithm
Network Security Unit 5.pdf for BCA BBA.
Encapsulation theory and applications.pdf
Empathic Computing: Creating Shared Understanding
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
cuic standard and advanced reporting.pdf
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Teaching material agriculture food technology
Cloud computing and distributed systems.

Cross-view Activity Recognition using Hankelets

  • 1. CVPR 2012 Cross-view Activity Recognition using Hankelets Binlong Li, Octavia I. Camps and Mario Sznaier Northeastern University Mobuddies
  • 2. Dynamic Systems  Dynamic systems have been recently used in a wide range of computer vision applications  Given temporal sequence of observations (e.g. track coordinates) model temporal evolution as a function of low-dimensional state vector that changes over time  Simplest case – linear time invariant (LTI) system (w – noise)  Practical limitation: given set of observations, triple is not unique and is
  • 3. Hankel Matrices  Given a sequence of measurements , its block Hankel matrix is defined as:  Columns correspond to overlapping subsequences of data  Block anti-diagonals of the matrix are constant  This structure encapsulates the dynamic information of the system
  • 4. Initial condition invariance  Linear time invariant system (LTI):  In the absence of noice (w = 0):  Then Hankel matrix is broken down to:  Columns of Hankel matrix span the same subspace regardless of initial conditions
  • 5. Autoregressive measurements  Suppose the sequence of measurements is auto- regressive:  Recall, that:  Setting r = n in the above, we obtain:  In other words, last column of Hankel matrix is a linear combination of other columns
  • 6. Affine transformation invariance  Suppose we have two Hankel matrices and corresponding to a trajectory and its affine transformation. Auto-regressive property allows us to write:  Suppose affine transformation is defined as  Then, taking into account its linearity:  In other words, sequences share the same autoregressor  Recall, that  Therefore, columns of two Hankel matrices span
  • 7. Previous work  B. Li et. al “Activity Recognition using Dynamic Subspace Angles”, CVPR 2011  Considers initial condition invariance.  Imagine that class of actions (e.g. “walk”) can be represented by a single dynamical system, and in-class variations are captured by different initial condition  Then differentiating between two actions breaks down into determining whether columns of the two corresponding Hankel matrices lie in the same subspace  Uses angles between subspaces as a measure of
  • 8. Overview of the method  Uses Dense trajectories to extract many short 15- frame tracklets.  Builds Hankel matrix for each tracklet, capturing its velocity  Employs BoF-like approach (BoHk)  Does three experiments: single-view data, multiple view with knowledge transfer, multiple view without knowledge transfer
  • 9. Hankelets  Hankelet is a Hankel matrix for a short trajectory of 15 frames, formed by a sequence of normalized velocities:  Normalize Hankelets:
  • 10. Comparing Hankelets  Introduce dissimilarity score between two Hankelets:  Derivations show, that d ≈ 0 for Hankelets corresponding to noisy measurements of the same dynamical system
  • 11. Building codebook: cluster center  Modify the K-means algorithm for dissimilarity scores:  Current Hankelet is assigned to a cluster whose “representative” has smallest dissimilarity with the current Hankelet  Cluster’s “representative” is chosed as follows. Take random Hankelet within the class, find dissimilarities between the Hankelet and all other Hankelets in the cluster and compute their mean. The Hankelet with dissimilarity closest to the mean is selected as its “representative”
  • 12. Building codebook: Gamma pdf  The histogram of dissimilarities for a typical cluster in the dictionary of Hankelets:  Represent each cluster with its representative and gamma pdf:  Furthermore, each cluster w has a prior probability
  • 13. Bag of Hankelets (BoHk)  Each activity video is represented with a histogram of labels from the dictionary of K Hankelets  Cluster label is assigned using max probability: where is cluster representative, is cluster prior  Finally, one-against-all non-linear SVM trained for activities recognition
  • 14. Bi-Lingual Hankelets  Bi-lingual Hankelets can be easily learned from unlabeled videos captured simultaneously from the different viewpoints by matching Hankelets across views (~80% are matched)  Hankelets are matched using threshold on dissimilarity score, if their start times are the same (no spatial information)
  • 16. Cross-view action recognition  A labeled dataset is given, with Source and Target views Training  Extract and match Bi-lingual Hankelets with dissimilarity score  Build codebook of Bi-lingual Hankelets using the K- means  Label Hankelets in Source data using max posterior probability  Train one-against-all non-linear SVM using Source data Testing
  • 17. Experiments Single-view  KTH dataset: 95.89% avg. Cross-View with data transfer  Use only Bi-lingual Hankelets  IXMAS dataset: 56.4% avg. (45.5% improvement) Cross-View without data transfer  Use all Hankelets (not only Bi-lingual)  IXMAS dataset: 90.57% avg. (20.28% improvement)