SlideShare a Scribd company logo
2 December 2005
Fusion in Multimodal Interactive Systems:
An HMM-Based Algorithm for User-Induced
Adaptation
Bruno Dumas1, Beat Signer1 and Denis Lalanne2
1 WISE Research Lab, Vrije Universiteit Brussel, Belgium
2 DIVA Research Group, University of Fribourg, Switzerland
Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 2
June 26, 2012
Outline
• Fusion of multimodal input
• Multimodal interaction: from design to implementation
• Algorithms for fusion of multimodal input
• A Hidden Markov Model-based fusion algorithm
• Instantiation of the algorithm
• Evaluation of the algorithm
• Qualitative test
• Quantitative evaluation
• Performance assessment
• Conclusion
Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 3
June 26, 2012
Fusion of Multimodal Input
 Challenges in multimodal interaction engineering:
continous and probabilistic inputs, real-time…
Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 4
June 26, 2012
Challenging/Ambiguous Fusion Cases
• “Put-that-there”
 Complementarity case
• “Play next track”
 Different meanings based on order and context
Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 5
June 26, 2012
CARE Properties at the Design Level
 Describe how modalities can be combined
 Complementarity
 All modalities are necessary
 Assignment
 Absence of choice
 Redundancy
 Same expressive power between modalities
 Equivalence
 Any modality is sufficient
Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 6
June 26, 2012
HephaisTK Multimodal Framework
Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 7
June 26, 2012
Description of Multimodal Dialogues
• Syntactic-level description
• For more details: check
SMUIML language
Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 8
June 26, 2012
Fusion Algorithms
• Frame-based algorithms & Unification-
based algorithms
• Symbolic approaches
• Both are a de facto standard
• Symbolic-statistical algorithms
Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 9
June 26, 2012
Symbolic-Statistical Approaches
 Symbolic data
 Interpretation of low-level data
 Statistical processing
 Machine learning algorithms
Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 10
June 26, 2012
Our HMM-Based Fusion Algorithm
 Why hidden Markov models?
 Very good modelling of time-related events
 Takes advantage of input data coming from
probabilistic input modalities
 On-the-fly adaptation of the model
- e.g. based on user feedback
Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 11
June 26, 2012
Put-That-There Example
Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 12
June 26, 2012
Put-That-There Example
Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 13
June 26, 2012
Put-That-There Example
Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 14
June 26, 2012
Put-That-There Example
Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 15
June 26, 2012
Put-That-There Example
Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 16
June 26, 2012
Put-That-There Example
Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 17
June 26, 2012
Instantiation of the Algorithm
Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 18
June 26, 2012
Evaluation of the Algorithm
 Evaluation on three levels
 Qualitative “gut check” test
 Benchmarks on real-world examples
 Performance assessment
Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 19
June 26, 2012
Evaluation: Qualitative Test
• Three expert users were asked to assess the
behaviour of the algorithm
 frames-based and HMM-based fusion
 5 minutes per condition + interview
• Coherent behaviour between the algorithms
• No noticeable difference
in responsiveness
Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 20
June 26, 2012
Evaluation: Benchmark
Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 21
June 26, 2012
Evaluation: Benchmark
Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 22
June 26, 2012
Benchmark Results (1)
• CARE properties: equivalence and redundancy tests
Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 23
June 26, 2012
Benchmark Results (2)
• Sequential and non-sequential complementarity tests
Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 24
June 26, 2012
Benchmark Results (3)
• “Play next track” example
Meaning
frames
HMM-based
Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 25
June 26, 2012
Evaluation: Performance
• Per fusion algorithm:
• 5 runs of 40 input pieces
• 5 x 20 expected fusion results
• Frame-based: 18.2 ms
• Standard deviation: 12.7 ms
• HMM-based: 16.6 ms
• Standard deviation: 11.6 ms
Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 26
June 26, 2012
Conclusion
• A new symbolic-statistical multimodal fusion
algorithm based on Hidden Markov Models
• Integrated into the HephaisTK framework
• Superior recognition rates compared to existing
algorithms
• Handling of ambiguous cases
• Processing of real-time user feedback
http://wise.vub.ac.be/bruno-dumas

More Related Content

PPTX
Resarch methodology
PPTX
Magazine resarch
PPTX
Vibration Energy Harvesting: Going Beyond Idealization
PDF
How to Become a Data Scientist
KEY
Intro to Data Science for Enterprise Big Data
PDF
The Science of Memorable Presentations
PPT
Organization Development
PPTX
Payment and Settlement Systems(SWIFT,NEFT and Securities Cycle)
Resarch methodology
Magazine resarch
Vibration Energy Harvesting: Going Beyond Idealization
How to Become a Data Scientist
Intro to Data Science for Enterprise Big Data
The Science of Memorable Presentations
Organization Development
Payment and Settlement Systems(SWIFT,NEFT and Securities Cycle)

Similar to Fusion in Multimodal Interactive Systems: An HMM-Based Algorithm for User-Induced Adaptation (20)

PPTX
21st century skills 03FEB15
PPTX
Bb on Tour 2016 | Innovation and Your Institution (Part 2) – Open Source Road...
PPTX
BPM Cluster Meeting 2018
PDF
ECIR Recommendation Challenges
PPTX
MBUID survey
PPTX
Lecture-2 Applied ML .pptx
PPTX
Validation of Design Tools-PPT for CEDAR Meeting-04-15-2016
PPTX
Lecture_2_Deep_Learning_Overview (1).pptx
PPTX
BIM, Big Data and Mashup in Architectural Computing – Experimenting with Digi...
PDF
Architectual Models Distributed System Presentation
PDF
Distributed System Presentation Chapter 2
PPT
Selection of an appropriate project approach
PDF
Challenges of Cloud Monitoring
PDF
Privacy and Security in Multi-modal User Interface Modeling for Social Media
PPTX
Usability evaluation methods (part 2) and performance metrics
PDF
Course Review - Lecture 13 - Introduction to Databases (1007156ANR)
PDF
Innovation day 2012 16. koenraad rombaut & michiel de paepe - verhaert - mo...
PDF
CASCON06.ppt
PPTX
Formal Method
PPTX
Introduction to mechatronics
21st century skills 03FEB15
Bb on Tour 2016 | Innovation and Your Institution (Part 2) – Open Source Road...
BPM Cluster Meeting 2018
ECIR Recommendation Challenges
MBUID survey
Lecture-2 Applied ML .pptx
Validation of Design Tools-PPT for CEDAR Meeting-04-15-2016
Lecture_2_Deep_Learning_Overview (1).pptx
BIM, Big Data and Mashup in Architectural Computing – Experimenting with Digi...
Architectual Models Distributed System Presentation
Distributed System Presentation Chapter 2
Selection of an appropriate project approach
Challenges of Cloud Monitoring
Privacy and Security in Multi-modal User Interface Modeling for Social Media
Usability evaluation methods (part 2) and performance metrics
Course Review - Lecture 13 - Introduction to Databases (1007156ANR)
Innovation day 2012 16. koenraad rombaut & michiel de paepe - verhaert - mo...
CASCON06.ppt
Formal Method
Introduction to mechatronics
Ad

Recently uploaded (20)

PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Getting Started with Data Integration: FME Form 101
PDF
Approach and Philosophy of On baking technology
PDF
Hybrid model detection and classification of lung cancer
PDF
A comparative study of natural language inference in Swahili using monolingua...
PDF
Hindi spoken digit analysis for native and non-native speakers
PDF
1 - Historical Antecedents, Social Consideration.pdf
PPTX
Chapter 5: Probability Theory and Statistics
PDF
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
PDF
Web App vs Mobile App What Should You Build First.pdf
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
PPTX
Tartificialntelligence_presentation.pptx
PPTX
TLE Review Electricity (Electricity).pptx
PDF
DP Operators-handbook-extract for the Mautical Institute
PDF
Accuracy of neural networks in brain wave diagnosis of schizophrenia
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PPTX
OMC Textile Division Presentation 2021.pptx
Encapsulation_ Review paper, used for researhc scholars
Unlocking AI with Model Context Protocol (MCP)
Getting Started with Data Integration: FME Form 101
Approach and Philosophy of On baking technology
Hybrid model detection and classification of lung cancer
A comparative study of natural language inference in Swahili using monolingua...
Hindi spoken digit analysis for native and non-native speakers
1 - Historical Antecedents, Social Consideration.pdf
Chapter 5: Probability Theory and Statistics
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
Web App vs Mobile App What Should You Build First.pdf
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
Tartificialntelligence_presentation.pptx
TLE Review Electricity (Electricity).pptx
DP Operators-handbook-extract for the Mautical Institute
Accuracy of neural networks in brain wave diagnosis of schizophrenia
A comparative analysis of optical character recognition models for extracting...
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
OMC Textile Division Presentation 2021.pptx
Ad

Fusion in Multimodal Interactive Systems: An HMM-Based Algorithm for User-Induced Adaptation

  • 1. 2 December 2005 Fusion in Multimodal Interactive Systems: An HMM-Based Algorithm for User-Induced Adaptation Bruno Dumas1, Beat Signer1 and Denis Lalanne2 1 WISE Research Lab, Vrije Universiteit Brussel, Belgium 2 DIVA Research Group, University of Fribourg, Switzerland
  • 2. Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 2 June 26, 2012 Outline • Fusion of multimodal input • Multimodal interaction: from design to implementation • Algorithms for fusion of multimodal input • A Hidden Markov Model-based fusion algorithm • Instantiation of the algorithm • Evaluation of the algorithm • Qualitative test • Quantitative evaluation • Performance assessment • Conclusion
  • 3. Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 3 June 26, 2012 Fusion of Multimodal Input  Challenges in multimodal interaction engineering: continous and probabilistic inputs, real-time…
  • 4. Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 4 June 26, 2012 Challenging/Ambiguous Fusion Cases • “Put-that-there”  Complementarity case • “Play next track”  Different meanings based on order and context
  • 5. Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 5 June 26, 2012 CARE Properties at the Design Level  Describe how modalities can be combined  Complementarity  All modalities are necessary  Assignment  Absence of choice  Redundancy  Same expressive power between modalities  Equivalence  Any modality is sufficient
  • 6. Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 6 June 26, 2012 HephaisTK Multimodal Framework
  • 7. Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 7 June 26, 2012 Description of Multimodal Dialogues • Syntactic-level description • For more details: check SMUIML language
  • 8. Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 8 June 26, 2012 Fusion Algorithms • Frame-based algorithms & Unification- based algorithms • Symbolic approaches • Both are a de facto standard • Symbolic-statistical algorithms
  • 9. Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 9 June 26, 2012 Symbolic-Statistical Approaches  Symbolic data  Interpretation of low-level data  Statistical processing  Machine learning algorithms
  • 10. Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 10 June 26, 2012 Our HMM-Based Fusion Algorithm  Why hidden Markov models?  Very good modelling of time-related events  Takes advantage of input data coming from probabilistic input modalities  On-the-fly adaptation of the model - e.g. based on user feedback
  • 11. Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 11 June 26, 2012 Put-That-There Example
  • 12. Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 12 June 26, 2012 Put-That-There Example
  • 13. Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 13 June 26, 2012 Put-That-There Example
  • 14. Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 14 June 26, 2012 Put-That-There Example
  • 15. Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 15 June 26, 2012 Put-That-There Example
  • 16. Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 16 June 26, 2012 Put-That-There Example
  • 17. Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 17 June 26, 2012 Instantiation of the Algorithm
  • 18. Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 18 June 26, 2012 Evaluation of the Algorithm  Evaluation on three levels  Qualitative “gut check” test  Benchmarks on real-world examples  Performance assessment
  • 19. Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 19 June 26, 2012 Evaluation: Qualitative Test • Three expert users were asked to assess the behaviour of the algorithm  frames-based and HMM-based fusion  5 minutes per condition + interview • Coherent behaviour between the algorithms • No noticeable difference in responsiveness
  • 20. Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 20 June 26, 2012 Evaluation: Benchmark
  • 21. Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 21 June 26, 2012 Evaluation: Benchmark
  • 22. Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 22 June 26, 2012 Benchmark Results (1) • CARE properties: equivalence and redundancy tests
  • 23. Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 23 June 26, 2012 Benchmark Results (2) • Sequential and non-sequential complementarity tests
  • 24. Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 24 June 26, 2012 Benchmark Results (3) • “Play next track” example Meaning frames HMM-based
  • 25. Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 25 June 26, 2012 Evaluation: Performance • Per fusion algorithm: • 5 runs of 40 input pieces • 5 x 20 expected fusion results • Frame-based: 18.2 ms • Standard deviation: 12.7 ms • HMM-based: 16.6 ms • Standard deviation: 11.6 ms
  • 26. Bruno Dumas – WISE resarch lab – Department of Computer Science - bdumas@vub.ac.be 26 June 26, 2012 Conclusion • A new symbolic-statistical multimodal fusion algorithm based on Hidden Markov Models • Integrated into the HephaisTK framework • Superior recognition rates compared to existing algorithms • Handling of ambiguous cases • Processing of real-time user feedback http://wise.vub.ac.be/bruno-dumas