SlideShare a Scribd company logo
User-Driven Quality Enhancement
for Audio Signal Processing
Convention Paper 8823
D. Comminiello, S. Scardapane, M. Scarpiniti, A. Uncini
134th AES CONVENTION
Rome, Italy – 2013 May 4-7
Outline of the Talk
User-Driven Quality Enhancement 2
2013 May 4-7
(1)
• Problem Definition
• Our Framework
(2)
• Applications
• Immersive Speech, Games…
(3)
• Preliminary Results
• Conclusions
Understanding the User
User-Driven Quality Enhancement 3
2013 May 4-7
Audio
Processor
Personal
Judgement
Development
Stage
???
Evaluation Procedures
User-Driven Quality Enhancement 4
2013 May 4-7
Objective Indexes
Psychoacustic
Models
Subjective Tests
Classical Approach
User-Driven Quality Enhancement 5
2013 May 4-7
Our Approach
User-Driven Quality Enhancement 6
2013 May 4-7
Possible
Enhanced
Audio
Interactive Evolutionary Algorithm
User-Driven Quality Enhancement 7
2013 May 4-7
Pool of Possible Settings
Subjective
Evaluation
«Reproduction»
Selection
New Pool
Main Drawbacks
User-Driven Quality Enhancement 8
2013 May 4-7
User
Fatigue
User
Discrimination
Fast convergence obtained with few possible fitness values
Time
Constraint
Partial
Ordering
Our Proposal
User-Driven Quality Enhancement 9
2013 May 4-7
1. IEC has been seldomly used in audio processing
applications.
2. We believe it to be of practical interest for a wide range
of tasks.
3. This is the main reason we are proposing this
framework.
Applications – Games
User-Driven Quality Enhancement 10
2013 May 4-7
Audio for
Games
Applications – Forensic Audio
User-Driven Quality Enhancement 11
2013 May 4-7
Forensic
Audio
(Image property of SoundAndSound)
Applications – Immersive Experience
User-Driven Quality Enhancement 12
2013 May 4-7
Immersive
Audio
(Image property of Integrated Media Systems
Center)
Interactive AEC
User-Driven Quality Enhancement 13
2013 May 4-7
Test Setup
User-Driven Quality Enhancement 14
2013 May 4-7
1. Five signals distorted by female voice.
2. Echo cancellation through affine projection algorithm
(APA) with 4 parameters.
3. A standard GA minimizes normalized misalignment:
7. An IGA should minimize user’s preferences.
Test Setup - Workflow
User-Driven Quality Enhancement 15
2013 May 4-7
Original
Signals
IEC
Minimization
Set 1
Standard
Minimization
Set 2
Subjective Comparison
Results
User-Driven Quality Enhancement 16
2013 May 4-7
55%
28%
17%
Preferences
IGA GA UNKNOWN
Conclusions
User-Driven Quality Enhancement 17
2013 May 4-7
1. Good results of our framework on AEC.
2. User fatigue is the main drawback to be confronted.
3. Several applications awaits in the future.
4. Possible combination of objective and subjective
measurements.
Thanks for
your attention!
simone.scardapane@uniroma1.it

More Related Content

PPTX
1 AUDIO SIGNAL PROCESSING
PPTX
Signal processing system for audio sensing and manipulation for the control o...
PDF
Rsa documentation
PDF
FPGA FIR filter implementation (Audio signal processing)
PPT
Audio and video compression
PDF
Balanced Measurement Sets - Criteria for Improving Project Management Practices
PDF
Balanced Measurement Sets: Criteria for Improving Project Management Practices
1 AUDIO SIGNAL PROCESSING
Signal processing system for audio sensing and manipulation for the control o...
Rsa documentation
FPGA FIR filter implementation (Audio signal processing)
Audio and video compression
Balanced Measurement Sets - Criteria for Improving Project Management Practices
Balanced Measurement Sets: Criteria for Improving Project Management Practices

Similar to audio signal (20)

PDF
Snorm–A Prototype for Increasing Audio File Stepwise Normalization
PDF
2013 UX RESEARCH - Usability Testing Approaches
PPTX
Video Summarisation Presentation. .pptx
PDF
IRJET- A Survey on Sound Recognition
PDF
Review On Speech Recognition using Deep Learning
PDF
User Involvement in Software Evolution Practice: A Case Study
PDF
SPEECH RECOGNITION WITH LANGUAGE SPECIFICATION
PPTX
Home mess systems- Prototype 2 & Evaluation
PDF
WQD2011 - INNOVATION - DEWA - Tracking Management System
PDF
1st Technical Meeting - WP4
PPTX
final ppt[1].pptx - Read-Only1234.pptx
PDF
fOSSa 2010 - Spago4Q: OSS for Quality Monitoring in IT Projects and Services
PPTX
26531 reilly
PDF
sourabh_bajaj_resume
DOC
Madigan, Edwin 2009
PDF
Performance of different classifiers in speech recognition
PDF
Performance of different classifiers in speech recognition
PPTX
Predicting Media Memorability with Audio, Video, and Text representations
PDF
IRJET - A Robust Sign Language and Hand Gesture Recognition System using Conv...
PPT
Barcamp Bangalore 2 - On User Experience and Usability Testing
Snorm–A Prototype for Increasing Audio File Stepwise Normalization
2013 UX RESEARCH - Usability Testing Approaches
Video Summarisation Presentation. .pptx
IRJET- A Survey on Sound Recognition
Review On Speech Recognition using Deep Learning
User Involvement in Software Evolution Practice: A Case Study
SPEECH RECOGNITION WITH LANGUAGE SPECIFICATION
Home mess systems- Prototype 2 & Evaluation
WQD2011 - INNOVATION - DEWA - Tracking Management System
1st Technical Meeting - WP4
final ppt[1].pptx - Read-Only1234.pptx
fOSSa 2010 - Spago4Q: OSS for Quality Monitoring in IT Projects and Services
26531 reilly
sourabh_bajaj_resume
Madigan, Edwin 2009
Performance of different classifiers in speech recognition
Performance of different classifiers in speech recognition
Predicting Media Memorability with Audio, Video, and Text representations
IRJET - A Robust Sign Language and Hand Gesture Recognition System using Conv...
Barcamp Bangalore 2 - On User Experience and Usability Testing
Ad

Recently uploaded (20)

PDF
Key Trends in Website Development 2025 | B3AITS - Bow & 3 Arrows IT Solutions
PDF
SOUND-NOTE-ARCHITECT-MOHIUDDIN AKHAND SMUCT
PDF
YOW2022-BNE-MinimalViableArchitecture.pdf
PPTX
YV PROFILE PROJECTS PROFILE PRES. DESIGN
PPTX
Acoustics new for. Sound insulation and absorber
PPT
UNIT I- Yarn, types, explanation, process
PPTX
Orthtotics presentation regarding physcial therapy
PDF
Pongal 2026 Sponsorship Presentation - Bhopal Tamil Sangam
PDF
UNIT 1 Introduction fnfbbfhfhfbdhdbdto Java.pptx.pdf
DOCX
A Contemporary Luxury Villa in Dubai Jumeirah-2.docx
PPTX
Special finishes, classification and types, explanation
PPTX
BSCS lesson 3.pptxnbbjbb mnbkjbkbbkbbkjb
PPT
Machine printing techniques and plangi dyeing
PDF
Urban Design Final Project-Context
PDF
Design Thinking - Module 1 - Introduction To Design Thinking - Dr. Rohan Dasg...
PPTX
CLASS_11_BUSINESS_STUDIES_PPT_CHAPTER_1_Business_Trade_Commerce.pptx
PPT
pump pump is a mechanism that is used to transfer a liquid from one place to ...
PDF
Quality Control Management for RMG, Level- 4, Certificate
PDF
Emailing DDDX-MBCaEiB.pdf DDD_Europe_2022_Intro_to_Context_Mapping_pdf-165590...
PPTX
NEW EIA PART B - Group 5 (Section 50).pptx
Key Trends in Website Development 2025 | B3AITS - Bow & 3 Arrows IT Solutions
SOUND-NOTE-ARCHITECT-MOHIUDDIN AKHAND SMUCT
YOW2022-BNE-MinimalViableArchitecture.pdf
YV PROFILE PROJECTS PROFILE PRES. DESIGN
Acoustics new for. Sound insulation and absorber
UNIT I- Yarn, types, explanation, process
Orthtotics presentation regarding physcial therapy
Pongal 2026 Sponsorship Presentation - Bhopal Tamil Sangam
UNIT 1 Introduction fnfbbfhfhfbdhdbdto Java.pptx.pdf
A Contemporary Luxury Villa in Dubai Jumeirah-2.docx
Special finishes, classification and types, explanation
BSCS lesson 3.pptxnbbjbb mnbkjbkbbkbbkjb
Machine printing techniques and plangi dyeing
Urban Design Final Project-Context
Design Thinking - Module 1 - Introduction To Design Thinking - Dr. Rohan Dasg...
CLASS_11_BUSINESS_STUDIES_PPT_CHAPTER_1_Business_Trade_Commerce.pptx
pump pump is a mechanism that is used to transfer a liquid from one place to ...
Quality Control Management for RMG, Level- 4, Certificate
Emailing DDDX-MBCaEiB.pdf DDD_Europe_2022_Intro_to_Context_Mapping_pdf-165590...
NEW EIA PART B - Group 5 (Section 50).pptx
Ad

audio signal

Editor's Notes

  • #2: <number>
  • #4: «Trivial» paradox. <number>