SlideShare a Scribd company logo
Active Learning for Acoustic
Classification AND
Power Aware Feature Selection for
Audio/Sound Scene
Mulu W. Adhana
/Groep T Campus
Outline
2
• PhD Topic: Active Learning for Acoustic Scene/Event
Classification
o Active Learning: Introduction, principles, …
o Experimental Results
o Current Activities: adaptive, transfer learning, …
• Power Aware Feature Selection
o Proposed Method
o Experiments
o Conclusions
Active Learning: Introduction
3
• Traditional machine learning Techniques (Supervised)
o Abundant labeled data
o Labeling: labour intensive costly and error prone
o Domain expert
• Semi-supervised learning
• Active learning
o Seed information (very small in size)
o Sample unlabeled data points for annotation
o Involves user to actively annotate data point
• Reserved for informative/ambiguous ones
Active Learning Scheme
4
Active Learning in Practice
5
?
automatically
assigned
?
Small distance to separating
hyperplane
Ask the user
Active Learning: Sampling for annotation
6
• How ambiguous?
o Distance from separating line
o Probability assigned to the data point
o Entropy of the probability assigned to the data point
Active Learning: KLR
7
• KLR: alternative to SVM with probabilistic outcome
o Natural extension to multi-class problem
o Probabilistic outcome (𝑝𝑖 = 𝒀 = 𝑦𝑖 𝑿 = 𝑥 ,
𝑖 = 1, … , 𝐶)
o Put threshold on some uncertainty measure: e.g.
𝑝 𝐶 − 𝑝 𝐶−1 > 𝑇 [2]
Results: KLR Active Learning
8
NAR dataset:
• 21 classes
• 431 annotated
examples
• 10-fold cross-
validation
• Reduce 84%
manual annotation
Active, Transfer and Incremental Learning
9
Annotated dataset
from different domain
Small Annotated
dataset
Source-task
• From different env’t
• With different
sensing devices
Knowledge
Target-task
• Slightly different
from the source
task
Power Aware Feature Selection
10
• Wearable Sensor Networks draw energy from battery
o Sensing, processing raw data, packet size,
o bit-depth while acquiring sound scene/event, …
• On-node processing. e.g. movement monitoring
o Extracting expensive features => Energy depletion
• Select inexpensive and information carrying features
Proposed Approach
11
• Apply two criteria: Error rate and cost of extracting a
feature/subset of features(CPU-time)
• Incorporating dependency of feature extraction process
k<<n
Proposed Approach…
12
• Sequential forward selection, wrapper approach:
o Let 𝑌 holds already selected features, then
𝑆 = 𝑆 ∪ min
𝑋 𝑖
(𝑒𝑟𝑟𝑜𝑟(𝑆 ∪ 𝑋𝑖) + 𝜆 ∗ 𝐶𝑜𝑠𝑡(𝑆 ∪ 𝑋𝑖)/(|𝑆 ∪ 𝑋𝑖|)
where 𝜆 tradeoff ,
𝑆 ∪ 𝑋𝑖 −feature cardinality
𝑋𝑖 − random candidate feature
o Stopping criterion: add feature 𝑋𝑖 to Y such that
𝐶 𝑌 ∪ 𝑋𝑖 > 𝐶 𝑌
Feature Extraction: Dependency Graph
13
Experimental Setup
14
• Feature selection on the NAR dataset recorded by
humanoid robot NAO
Classes Audio Features
• 21 classes (door open,
door close, fridge open
close, moving chair, …)
• 431 examples (Sound
Scene)
35 (14-MFCC, 12-GTCC,
pitch Cross-correlation,
Spectral Rolloff, Spectral
Flatness, Spectral Flux,
Spectral Kurtosis , Spectral
Skewness , Spectral Slope,
Spectral Spread and ZCR)
Results
15
Questions

More Related Content

ODP
Deep Recurrent Neural Network for Multi-target Filtering
PPTX
Speaker Dependent WaveNet Vocoder
PDF
S1140183 Presentation
PDF
SuMo-SS: Submodular Optimization Sensor Scattering for Deploying Sensor Netwo...
PPTX
Extend Your Journey: Considering Signal Strength and Fluctuation in Location-...
PPTX
Seminar nov2017
PPTX
The ACTION Project: Applications Coordinate with Transport, IP and Optical Ne...
PDF
Deep Learning and Tensorflow Implementation(딥러닝, 텐서플로우, 파이썬, CNN)_Myungyon Ki...
Deep Recurrent Neural Network for Multi-target Filtering
Speaker Dependent WaveNet Vocoder
S1140183 Presentation
SuMo-SS: Submodular Optimization Sensor Scattering for Deploying Sensor Netwo...
Extend Your Journey: Considering Signal Strength and Fluctuation in Location-...
Seminar nov2017
The ACTION Project: Applications Coordinate with Transport, IP and Optical Ne...
Deep Learning and Tensorflow Implementation(딥러닝, 텐서플로우, 파이썬, CNN)_Myungyon Ki...

Similar to Active Learning for Acoustic Classification AND Power Aware Feature Selection for Audio/Sound Scene (20)

PDF
Lecture 11 - KNN and Clustering, a lecture in subject module Statistical & Ma...
PDF
l1_introduction.pdf
PPTX
Energy Monitoring With Self-taught Deep Network
PPT
Toward Rich, User-Defined Aggregation & Subset-Selection Services
PPT
Semantics in Sensor Networks
PDF
DutchMLSchool 2022 - History and Developments in ML
PDF
Training machine learning knn 2017
PPTX
K-Nearest Neighbor Classifier
PPT
Applications of ann_in_microwave_engineering
PDF
Data wrangling week 10
PDF
Sara Hooker & Sean McPherson, Delta Analytics, at MLconf Seattle 2017
PPTX
Density based clustering
PDF
Module 1-Introduction to Signals and Systems.pdf
PDF
Garuda Robotics x DataScience SG Meetup (Sep 2015)
PPT
Presentations on basic understanding of networm management
PPTX
background.pptx
PDF
03-06-ACA-Input-FeatureLearning.pdf
PPT
5.1 mining data streams
PPTX
Gradient Based Power Line Insulator Detection
PPTX
ETA Prediction with Graph Neural Networks in Google Maps
Lecture 11 - KNN and Clustering, a lecture in subject module Statistical & Ma...
l1_introduction.pdf
Energy Monitoring With Self-taught Deep Network
Toward Rich, User-Defined Aggregation & Subset-Selection Services
Semantics in Sensor Networks
DutchMLSchool 2022 - History and Developments in ML
Training machine learning knn 2017
K-Nearest Neighbor Classifier
Applications of ann_in_microwave_engineering
Data wrangling week 10
Sara Hooker & Sean McPherson, Delta Analytics, at MLconf Seattle 2017
Density based clustering
Module 1-Introduction to Signals and Systems.pdf
Garuda Robotics x DataScience SG Meetup (Sep 2015)
Presentations on basic understanding of networm management
background.pptx
03-06-ACA-Input-FeatureLearning.pdf
5.1 mining data streams
Gradient Based Power Line Insulator Detection
ETA Prediction with Graph Neural Networks in Google Maps
Ad

Recently uploaded (20)

PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
PDF
Model Code of Practice - Construction Work - 21102022 .pdf
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PPTX
CH1 Production IntroductoryConcepts.pptx
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PDF
Digital Logic Computer Design lecture notes
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PPTX
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
PDF
composite construction of structures.pdf
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PPTX
bas. eng. economics group 4 presentation 1.pptx
PPTX
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
PDF
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
PPTX
OOP with Java - Java Introduction (Basics)
PPTX
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
PPTX
Welding lecture in detail for understanding
PPTX
web development for engineering and engineering
PPTX
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
PPT
Mechanical Engineering MATERIALS Selection
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
Model Code of Practice - Construction Work - 21102022 .pdf
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
CH1 Production IntroductoryConcepts.pptx
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
Digital Logic Computer Design lecture notes
UNIT-1 - COAL BASED THERMAL POWER PLANTS
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
composite construction of structures.pdf
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
bas. eng. economics group 4 presentation 1.pptx
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
OOP with Java - Java Introduction (Basics)
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
Welding lecture in detail for understanding
web development for engineering and engineering
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
Mechanical Engineering MATERIALS Selection
Ad

Active Learning for Acoustic Classification AND Power Aware Feature Selection for Audio/Sound Scene

  • 1. Active Learning for Acoustic Classification AND Power Aware Feature Selection for Audio/Sound Scene Mulu W. Adhana /Groep T Campus
  • 2. Outline 2 • PhD Topic: Active Learning for Acoustic Scene/Event Classification o Active Learning: Introduction, principles, … o Experimental Results o Current Activities: adaptive, transfer learning, … • Power Aware Feature Selection o Proposed Method o Experiments o Conclusions
  • 3. Active Learning: Introduction 3 • Traditional machine learning Techniques (Supervised) o Abundant labeled data o Labeling: labour intensive costly and error prone o Domain expert • Semi-supervised learning • Active learning o Seed information (very small in size) o Sample unlabeled data points for annotation o Involves user to actively annotate data point • Reserved for informative/ambiguous ones
  • 5. Active Learning in Practice 5 ? automatically assigned ? Small distance to separating hyperplane Ask the user
  • 6. Active Learning: Sampling for annotation 6 • How ambiguous? o Distance from separating line o Probability assigned to the data point o Entropy of the probability assigned to the data point
  • 7. Active Learning: KLR 7 • KLR: alternative to SVM with probabilistic outcome o Natural extension to multi-class problem o Probabilistic outcome (𝑝𝑖 = 𝒀 = 𝑦𝑖 𝑿 = 𝑥 , 𝑖 = 1, … , 𝐶) o Put threshold on some uncertainty measure: e.g. 𝑝 𝐶 − 𝑝 𝐶−1 > 𝑇 [2]
  • 8. Results: KLR Active Learning 8 NAR dataset: • 21 classes • 431 annotated examples • 10-fold cross- validation • Reduce 84% manual annotation
  • 9. Active, Transfer and Incremental Learning 9 Annotated dataset from different domain Small Annotated dataset Source-task • From different env’t • With different sensing devices Knowledge Target-task • Slightly different from the source task
  • 10. Power Aware Feature Selection 10 • Wearable Sensor Networks draw energy from battery o Sensing, processing raw data, packet size, o bit-depth while acquiring sound scene/event, … • On-node processing. e.g. movement monitoring o Extracting expensive features => Energy depletion • Select inexpensive and information carrying features
  • 11. Proposed Approach 11 • Apply two criteria: Error rate and cost of extracting a feature/subset of features(CPU-time) • Incorporating dependency of feature extraction process k<<n
  • 12. Proposed Approach… 12 • Sequential forward selection, wrapper approach: o Let 𝑌 holds already selected features, then 𝑆 = 𝑆 ∪ min 𝑋 𝑖 (𝑒𝑟𝑟𝑜𝑟(𝑆 ∪ 𝑋𝑖) + 𝜆 ∗ 𝐶𝑜𝑠𝑡(𝑆 ∪ 𝑋𝑖)/(|𝑆 ∪ 𝑋𝑖|) where 𝜆 tradeoff , 𝑆 ∪ 𝑋𝑖 −feature cardinality 𝑋𝑖 − random candidate feature o Stopping criterion: add feature 𝑋𝑖 to Y such that 𝐶 𝑌 ∪ 𝑋𝑖 > 𝐶 𝑌
  • 14. Experimental Setup 14 • Feature selection on the NAR dataset recorded by humanoid robot NAO Classes Audio Features • 21 classes (door open, door close, fridge open close, moving chair, …) • 431 examples (Sound Scene) 35 (14-MFCC, 12-GTCC, pitch Cross-correlation, Spectral Rolloff, Spectral Flatness, Spectral Flux, Spectral Kurtosis , Spectral Skewness , Spectral Slope, Spectral Spread and ZCR)

Editor's Notes

  • #8: [1] Karsmakers, Peter, Kristiaan Pelckmans, and Johan AK Suykens. "Multi-class kernel logistic regression: a fixed-size implementation." Neural Networks, 2007. IJCNN 2007. International Joint Conference on. IEEE, 2007. [2] Adhana, Mulu Weldegebreal, Bart Vanrumste, and Peter Karsmakers. "Active Learning for Audio-based Home Monitoring." Proceedings of Benelearn 2016. No. Epub ahead of print. 2016.