SlideShare a Scribd company logo
An Experiment with Voice Recognition to
Improve Call Center Quality
Lew Berman, PhD, MS
Don Allen
Chuck Akin
Josh Duell
Matt Jans, PhD
May 21, 2018
2018 International Field Directors and Technology Conference
Denver, Colorado
Agenda
 Framing the problem
 Monitoring metrics
 Experimental design
 Results
 Discussion
2
Framing the problem
 Call center monitoring for large-scale national phone surveys
 Current approach
 Record all interviews and save recordings for 60 days
 Randomly select ~10% of ‘survey time’
 Include a variety of dispositions
 Challenges
 Many files and considerable disk storage, much of which is not actively used
 Manually monitor calls
 Costly to review calls and store files
 Difficult to track performance over time or in real-time
 Thus, investigating automation
3
Current Approaches
 Common elements, varying by mode
 Silent monitoring during call
 Direct observations
 Telephone verification interviews
 Record and review interview (CARI)
 Data checks
 Typically review about 5-10% of all interviews [FEDCASIC 2018]
 RTI Quality Monitoring System [SPEIZER 2008]
 Standardized approach to telephone and in-person survey
 Monitoring of protocols, metrics, and feedback
 Collection of trend data
 Increased efficiency of monitoring operations
4
Monitoring Metrics – Anticipated
Automation Challenges
Automation Challenge
Least challenging,
but not simple
Moderately
challenging
Very challenging
Interviewer Speech
Acceptable speech tempo 
Reads questions verbatim 
Enunciation is clear and not breathy, crackly, strained, etc. 
Avoids monotonic speech patterns 
Avoids repeating words, pauses, filled pauses (“um”), dead air 
Maintains neutrality / impartial and not leading 
Data Capture
Correctly record data 
Correctly record dispositions 
Demeanor / behavior
Addresses resistance 
Converts refusals appropriately 
Engages respondent 
Maintains control of the interview 
Stays on script and does not lead 
Tone is professional and pleasant 
Transitions in acceptable manner 
5
Our Study: Correctly Record Data - Low
Hanging Fruit
 Operational Definition: interviewer correctly records respondent answer and
does not inadvertently or purposefully miscode answers or dispositions
 Reasons to monitor incorrectly recorded data
 Attention diverted
 Interviewers moving between surveys and gets stuck in keystroke patterns
 Falsify data (changing or omitting data)
 Fabricate data (make up data)
 Improve production rates
 Measurement
 Compare response in database to that provided by respondent
6
Experimental Design
 Respondents: 51 ICF employees, friends, and family invited
 42 participated: 20 female (48%), 22 male (52%)
 Native and non-native English speakers
 Interviewers: 3 experienced female interviews
 Received basic training on study
 Study conducted June 2017, Voxco platform, cell phones only
 7-item instrument, but respondents provided with random responses
 Cell phones
 Carriers primarily Verizon (48%) and AT&T (31%)
 Devices primarily iPhones (79%)
 Determined to be non-human subjects research
7
Conceptual View of Voice Recognition Process
for Correctly Record Data
Synonym
Matching
Male same as boy, mail
Female same as girl, woman
* Photos by Unknown Author and are licensed under
CC BY-NC-SA. Changes have been made to
diagrams.
JSON File Output with
Voice Recognition Data
Phone
Interview
with
Respondent
Voxco Survey
Software Platform
Microsoft Voice
Recognition REST API
Wav files for recorded
questions and
responses
Word
Filtering
Hesitations, pausing,
emotional expressions, etc.
8
Voice Recognition Results: Correctly
recorded data exact matches
Total Female Male
Category N % Correct N % Correct N % Correct
Gender Question 42 20 22
Incorrectly Recognized 29 69.05% 11 55.00% 18 81.82%
Correctly Recognized 13 30.95% 9 45.00% 4 18.18%
Number Question 42 20 22
Incorrectly Recognized 14 33.33% 7 35.00% 7 31.82%
Correctly Recognized 28 66.67% 13 65.00% 15 68.18%
Yes-No Question 210 100 110
Incorrectly Recognized 42 20.00% 24 24.00% 18 16.36%
Correctly Recognized 168 80.00% 76 76.00% 92 83.64%
Total 294 71.09% 140 70.00% 154 72.08%
9
Impact of Word Filtering & Synonym
Matching
45%
65%
76%
18%
68%
84%
31%
67%
80%
80%
80%
95%
73%
77%
99%
76%
79%
97%
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
Gender Number Yes-No Gender Number Yes-No Gender Number Yes-No
Female Male Total
% Respondent Answers Correctly Recognized and Synonym Matched
by Respondent Gender and Question Type
% Correctly Recognized % Correctly Recognized + Synonym Matched
10
Impact of Word Filtering & Synonym
Matching
Respondent
Group /
Question Type
N
% Correctly
Recognized
# Incorrectly
Recognized
# Synonym
Matches
% Synonym
Matched
Total
Recognized
+ Matched
% Recognized +
Synonym
Matched
Female
Gender 20 45.00% 11 7 63.64% 16 80.00%
Number 20 65.00% 7 3 42.86% 16 80.00%
Yes-No 100 76.00% 24 19 79.17% 95 95.00%
140 70% 42 29 69.05% 127 90.71%
Male
Gender 22 18.18% 18 12 66.67% 16 72.73%
Number 22 68.18% 7 2 28.57% 17 77.27%
Yes-No 110 83.64% 18 17 94.44% 109 99.09%
154 72.08% 43 31 72.09% 142 92.21%
Total
Gender 42 30.95% 29 19 65.52% 32 76.19%
Number 42 66.67% 14 5 35.71% 33 78.57%
Yes-No 210 80.00% 42 36 85.71% 204 97.14%
Total 294 71.09% 85 60 70.59% 269 91.50%
Impact of Synonym Matching Overall Improvement 11
Discussion – Technical Aspects
 Results are promising for verification of correctly recording responses
 Filtering
 Currently manual operation, but can be automated
 Requires word corpus (e.g., “um”) and rules such as removing repeats
 Synonym corpus
 Well understood for gender, yes-no
 However, other questions such as scales / categorical questions will need additional effort
12
Discussion – Human Impacts
 Interviewers
 Provide timely feedback for new interviewers
 Follow-up with objective information for interviewers with poor performance
 Should not be used in a punitive manner, but as a tool for corrective action
 Management
 Understand impact of coaching and training
 Produce daily, weekly, monthly trends reports for supervisors, interviewers, and clients
 Sponsor perspective
 Increase level of monitoring
 Focus on key interview questions for 100% monitoring
 Utilize objective measures
13
General Limitations
 No noise-filtering done on sound files
 Occasionally no clean break between interviewer and respondent
 Only considered English interviews
 Did not account for non-native speakers in analyses
 Occasionally the Microsoft Cognition API would respond with errors
14
Voxco Limitations
 Typically records interview as one “.wav” file
 Prefer separate “.wav” files per question and response
 Implications for setup, interviewer adjustments, and run-on question/response
 Manually broke up “.wav” files into interviewer and respondent files
 Future work required to differentiate
 Signal & noise
 Interviewer, respondent, dog, baby, …
15
Future
 Automate the division between interviewer and respondent
 Utilize noise filtering prior to voice recognition
 Look at other question types such as Likert scales and dates
 Automate speech tempo
16
Lew Berman, PhD, MS
Email: lewis.berman@icf.com
Phone: 301-407-6833
17
References
[FEDCASIC 2018] Federal Computer Assisted Survey Information Collection (FedCASIC)Workshops: Roundtable: What Makes a Good
Interviewer? Metrics and Methods for Ensuring Data Quality. (2018) Organizers: Matt Jans, Lew Berman. Moderator: Matt Jans. April
18, 2018, Suitland, MD. https://www.census.gov/fedcasic/fc2018/ppt/full_program_2018.pdf.
[JACEWICZ 2009] Jacewicz E, Fox RA, O’Neill C, Salmons J. Articulation rate across dialect, age, and gender. Language variation and
change. 2009;21(2):233-256. doi:10.1017/S0954394509990093.
[NCVS 150] National Center for Voice and Speech. Tutorials - Voice Production. Voice Qualities.
http://www.ncvs.org/ncvs/tutorials/voiceprod/tutorial/quality.html
[SPEIZER 2008] Speizer H, Kinsey S, Heman-Ackah R, Thissen M. Rita. (2008) Developing a Common, Mode-independent, Approach
for Evaluating Interview Quality and Interviewer Performance. Research Triangle Institute Internal Report.
https://www.rti.org/sites/default/files/resources/kinsey_iii-c.pdf.
[TSAO 2006] Tsao Ying-Chiao, Weismer Gary, Iqbal Kamran. Interspeaker variation in habitual speaking rate: Additional evidence.
Journal of Speech, Language, and Hearing Research. 2006;49:1156–1164.
18

More Related Content

PPT
Rm 5
DOCX
PDF
Improving inferences from poor quality samples
PDF
2 Studies UX types should know about (Straub UXPA unconference13)
PDF
Finding the right wheel
PDF
Improving Your Literature Reviews with NVivo 10 for Windows
PDF
IUE14 Presentation - Studies UX Pros should know
PPSX
Workshop session 5 - the effectiveness of standard methods to improve general...
Rm 5
Improving inferences from poor quality samples
2 Studies UX types should know about (Straub UXPA unconference13)
Finding the right wheel
Improving Your Literature Reviews with NVivo 10 for Windows
IUE14 Presentation - Studies UX Pros should know
Workshop session 5 - the effectiveness of standard methods to improve general...

What's hot (20)

DOC
Online surveys
PPT
Testing group’s work on fault localization
PPTX
Carma internet research module: Sampling for internet
PDF
Self-service design - Eye-tracking Findings That Will Help You Design Forms T...
PPT
网络调查方法研究 -我们该往哪里走
PPTX
Internet-based research
PPT
problem
PPTX
Online survey
ODP
Web application of Antonucci's hierarchical approach for measuring social net...
PPTX
A study of the availability and use of assistive technology with dyslexic pup...
PPT
The mr process
PPTX
Using text-to-speech in exams - practical solutions and pitfalls, UK perspective
PDF
Data and Perception: How American Cancer Society Uses Surveys to Increase Eng...
PPT
Total Survey Error & Institutional Research: A case study of the University E...
PDF
Build your own gene panels 2013
PDF
Usability Testing for Survey Research:How to and Best Practices
PDF
Getting Started and Finishing your Dissertation Using NVivo
PDF
On Good Behavior: Human Factors + Building Performance
PPTX
Meenakshi
PPT
Is our two way communication a one way street? Informal Surveys & Polls in Li...
Online surveys
Testing group’s work on fault localization
Carma internet research module: Sampling for internet
Self-service design - Eye-tracking Findings That Will Help You Design Forms T...
网络调查方法研究 -我们该往哪里走
Internet-based research
problem
Online survey
Web application of Antonucci's hierarchical approach for measuring social net...
A study of the availability and use of assistive technology with dyslexic pup...
The mr process
Using text-to-speech in exams - practical solutions and pitfalls, UK perspective
Data and Perception: How American Cancer Society Uses Surveys to Increase Eng...
Total Survey Error & Institutional Research: A case study of the University E...
Build your own gene panels 2013
Usability Testing for Survey Research:How to and Best Practices
Getting Started and Finishing your Dissertation Using NVivo
On Good Behavior: Human Factors + Building Performance
Meenakshi
Is our two way communication a one way street? Informal Surveys & Polls in Li...
Ad

Similar to IFD&TC 2018: An Experiment with Voice Recognition to Improve Call Center Quality (20)

PPTX
FedCASIC 2019: On Using Cognitive Computing and Machine Learning Tools to Imp...
PPTX
accm-brfss-2022-presentation-draft.pptx
PPTX
IFD&TC 2019: Automating Call Center Monitoring
PPT
Dolování dat z řeči pro bezpečnostní aplikace - Jan Černocký
PDF
De4201715719
PDF
NLP BASED INTERVIEW ASSESSMENT SYSTEM
PPTX
Aspiring Minds | Svar
PDF
Identity authentication using voice biometrics technique
PDF
Cognitive Pre-testing
PDF
Assessing Speaking Skill A Summary
PPT
Class 1: Investigation Interview Preparation Recording
PPT
Chapter4B.ppt
DOCX
PPTX
Stages of test development and common test techniques (1)
PDF
MediaEval2015 - The SPL-IT-UC Query by Example Search on Speech system for Me...
PPT
Employee introduction to PI Selection.ppt
PPTX
Improving Your Surveys and Questionnaires with Cognitive Interviewing
PPTX
ARBS 2014 Presentation
PPT
Benefits of Concurrent Cognitive and Usability Testing
PDF
Voice Signal Synthesis using Non Negative Matrix Factorization
FedCASIC 2019: On Using Cognitive Computing and Machine Learning Tools to Imp...
accm-brfss-2022-presentation-draft.pptx
IFD&TC 2019: Automating Call Center Monitoring
Dolování dat z řeči pro bezpečnostní aplikace - Jan Černocký
De4201715719
NLP BASED INTERVIEW ASSESSMENT SYSTEM
Aspiring Minds | Svar
Identity authentication using voice biometrics technique
Cognitive Pre-testing
Assessing Speaking Skill A Summary
Class 1: Investigation Interview Preparation Recording
Chapter4B.ppt
Stages of test development and common test techniques (1)
MediaEval2015 - The SPL-IT-UC Query by Example Search on Speech system for Me...
Employee introduction to PI Selection.ppt
Improving Your Surveys and Questionnaires with Cognitive Interviewing
ARBS 2014 Presentation
Benefits of Concurrent Cognitive and Usability Testing
Voice Signal Synthesis using Non Negative Matrix Factorization
Ad

More from Lew Berman (14)

PPTX
2022-06-07 Berman Lew Great Plains Conference FINAL.pptx
PPTX
FedCASIC 2019: Survey Respondent Segmentation: Trust in Government Surveys
PPTX
FedCASIC 2019: Dimensions of Participation: Physical Measures
PPTX
FedCASIC 2019: Topic Salience and Propensity to Respond to Surveys: Findings ...
PPTX
FedCASIC 2019: Designing, implementing, and analyzing Leverage Saliency Theor...
PPTX
FedCASIC 2017: Childhood Immunization Attitudes and Behavior: National Survey...
PPTX
IFD&TC 2012: Validating in-home Measures for the National Health Interview Su...
PPTX
IFD&TC 2012: Use of Text Messaging for NHANES
PPTX
IFD&TC 2019: Technical Challenges and Solutions in Center Management
PDF
Data Science Training and Workforce Development
PPTX
Willingness and Reasons for Unlikeliness to Share Child Immunization Records ...
PDF
IFD&TC 2018: A Novel Approach for Conveniently and Securely Collecting Person...
PDF
Berman pcori challenge powerpoint
PDF
Berman pcori challenge document
2022-06-07 Berman Lew Great Plains Conference FINAL.pptx
FedCASIC 2019: Survey Respondent Segmentation: Trust in Government Surveys
FedCASIC 2019: Dimensions of Participation: Physical Measures
FedCASIC 2019: Topic Salience and Propensity to Respond to Surveys: Findings ...
FedCASIC 2019: Designing, implementing, and analyzing Leverage Saliency Theor...
FedCASIC 2017: Childhood Immunization Attitudes and Behavior: National Survey...
IFD&TC 2012: Validating in-home Measures for the National Health Interview Su...
IFD&TC 2012: Use of Text Messaging for NHANES
IFD&TC 2019: Technical Challenges and Solutions in Center Management
Data Science Training and Workforce Development
Willingness and Reasons for Unlikeliness to Share Child Immunization Records ...
IFD&TC 2018: A Novel Approach for Conveniently and Securely Collecting Person...
Berman pcori challenge powerpoint
Berman pcori challenge document

Recently uploaded (20)

PPTX
2025-08-10 Joseph 02 (shared slides).pptx
PPTX
Impressionism_PostImpressionism_Presentation.pptx
PPTX
Self management and self evaluation presentation
PPTX
Tablets And Capsule Preformulation Of Paracetamol
PPTX
3RD-Q 2022_EMPLOYEE RELATION - Copy.pptx
PDF
PM Narendra Modi's speech from Red Fort on 79th Independence Day.pdf
PDF
Nykaa-Strategy-Case-Fixing-Retention-UX-and-D2C-Engagement (1).pdf
PDF
COLEAD A2F approach and Theory of Change
DOCX
ENGLISH PROJECT FOR BINOD BIHARI MAHTO KOYLANCHAL UNIVERSITY
PPTX
nose tajweed for the arabic alphabets for the responsive
PPTX
Research Process - Research Methods course
PDF
Tunisia's Founding Father(s) Pitch-Deck 2022.pdf
PPTX
NORMAN_RESEARCH_PRESENTATION.in education
PPTX
Sustainable Forest Management ..SFM.pptx
PPTX
FINAL TEST 3C_OCTAVIA RAMADHANI SANTOSO-1.pptx
PDF
natwest.pdf company description and business model
PPTX
chapter8-180915055454bycuufucdghrwtrt.pptx
PPTX
PHIL.-ASTRONOMY-AND-NAVIGATION of ..pptx
PPTX
Tour Presentation Educational Activity.pptx
PPT
The Effect of Human Resource Management Practice on Organizational Performanc...
2025-08-10 Joseph 02 (shared slides).pptx
Impressionism_PostImpressionism_Presentation.pptx
Self management and self evaluation presentation
Tablets And Capsule Preformulation Of Paracetamol
3RD-Q 2022_EMPLOYEE RELATION - Copy.pptx
PM Narendra Modi's speech from Red Fort on 79th Independence Day.pdf
Nykaa-Strategy-Case-Fixing-Retention-UX-and-D2C-Engagement (1).pdf
COLEAD A2F approach and Theory of Change
ENGLISH PROJECT FOR BINOD BIHARI MAHTO KOYLANCHAL UNIVERSITY
nose tajweed for the arabic alphabets for the responsive
Research Process - Research Methods course
Tunisia's Founding Father(s) Pitch-Deck 2022.pdf
NORMAN_RESEARCH_PRESENTATION.in education
Sustainable Forest Management ..SFM.pptx
FINAL TEST 3C_OCTAVIA RAMADHANI SANTOSO-1.pptx
natwest.pdf company description and business model
chapter8-180915055454bycuufucdghrwtrt.pptx
PHIL.-ASTRONOMY-AND-NAVIGATION of ..pptx
Tour Presentation Educational Activity.pptx
The Effect of Human Resource Management Practice on Organizational Performanc...

IFD&TC 2018: An Experiment with Voice Recognition to Improve Call Center Quality

  • 1. An Experiment with Voice Recognition to Improve Call Center Quality Lew Berman, PhD, MS Don Allen Chuck Akin Josh Duell Matt Jans, PhD May 21, 2018 2018 International Field Directors and Technology Conference Denver, Colorado
  • 2. Agenda  Framing the problem  Monitoring metrics  Experimental design  Results  Discussion 2
  • 3. Framing the problem  Call center monitoring for large-scale national phone surveys  Current approach  Record all interviews and save recordings for 60 days  Randomly select ~10% of ‘survey time’  Include a variety of dispositions  Challenges  Many files and considerable disk storage, much of which is not actively used  Manually monitor calls  Costly to review calls and store files  Difficult to track performance over time or in real-time  Thus, investigating automation 3
  • 4. Current Approaches  Common elements, varying by mode  Silent monitoring during call  Direct observations  Telephone verification interviews  Record and review interview (CARI)  Data checks  Typically review about 5-10% of all interviews [FEDCASIC 2018]  RTI Quality Monitoring System [SPEIZER 2008]  Standardized approach to telephone and in-person survey  Monitoring of protocols, metrics, and feedback  Collection of trend data  Increased efficiency of monitoring operations 4
  • 5. Monitoring Metrics – Anticipated Automation Challenges Automation Challenge Least challenging, but not simple Moderately challenging Very challenging Interviewer Speech Acceptable speech tempo  Reads questions verbatim  Enunciation is clear and not breathy, crackly, strained, etc.  Avoids monotonic speech patterns  Avoids repeating words, pauses, filled pauses (“um”), dead air  Maintains neutrality / impartial and not leading  Data Capture Correctly record data  Correctly record dispositions  Demeanor / behavior Addresses resistance  Converts refusals appropriately  Engages respondent  Maintains control of the interview  Stays on script and does not lead  Tone is professional and pleasant  Transitions in acceptable manner  5
  • 6. Our Study: Correctly Record Data - Low Hanging Fruit  Operational Definition: interviewer correctly records respondent answer and does not inadvertently or purposefully miscode answers or dispositions  Reasons to monitor incorrectly recorded data  Attention diverted  Interviewers moving between surveys and gets stuck in keystroke patterns  Falsify data (changing or omitting data)  Fabricate data (make up data)  Improve production rates  Measurement  Compare response in database to that provided by respondent 6
  • 7. Experimental Design  Respondents: 51 ICF employees, friends, and family invited  42 participated: 20 female (48%), 22 male (52%)  Native and non-native English speakers  Interviewers: 3 experienced female interviews  Received basic training on study  Study conducted June 2017, Voxco platform, cell phones only  7-item instrument, but respondents provided with random responses  Cell phones  Carriers primarily Verizon (48%) and AT&T (31%)  Devices primarily iPhones (79%)  Determined to be non-human subjects research 7
  • 8. Conceptual View of Voice Recognition Process for Correctly Record Data Synonym Matching Male same as boy, mail Female same as girl, woman * Photos by Unknown Author and are licensed under CC BY-NC-SA. Changes have been made to diagrams. JSON File Output with Voice Recognition Data Phone Interview with Respondent Voxco Survey Software Platform Microsoft Voice Recognition REST API Wav files for recorded questions and responses Word Filtering Hesitations, pausing, emotional expressions, etc. 8
  • 9. Voice Recognition Results: Correctly recorded data exact matches Total Female Male Category N % Correct N % Correct N % Correct Gender Question 42 20 22 Incorrectly Recognized 29 69.05% 11 55.00% 18 81.82% Correctly Recognized 13 30.95% 9 45.00% 4 18.18% Number Question 42 20 22 Incorrectly Recognized 14 33.33% 7 35.00% 7 31.82% Correctly Recognized 28 66.67% 13 65.00% 15 68.18% Yes-No Question 210 100 110 Incorrectly Recognized 42 20.00% 24 24.00% 18 16.36% Correctly Recognized 168 80.00% 76 76.00% 92 83.64% Total 294 71.09% 140 70.00% 154 72.08% 9
  • 10. Impact of Word Filtering & Synonym Matching 45% 65% 76% 18% 68% 84% 31% 67% 80% 80% 80% 95% 73% 77% 99% 76% 79% 97% 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% Gender Number Yes-No Gender Number Yes-No Gender Number Yes-No Female Male Total % Respondent Answers Correctly Recognized and Synonym Matched by Respondent Gender and Question Type % Correctly Recognized % Correctly Recognized + Synonym Matched 10
  • 11. Impact of Word Filtering & Synonym Matching Respondent Group / Question Type N % Correctly Recognized # Incorrectly Recognized # Synonym Matches % Synonym Matched Total Recognized + Matched % Recognized + Synonym Matched Female Gender 20 45.00% 11 7 63.64% 16 80.00% Number 20 65.00% 7 3 42.86% 16 80.00% Yes-No 100 76.00% 24 19 79.17% 95 95.00% 140 70% 42 29 69.05% 127 90.71% Male Gender 22 18.18% 18 12 66.67% 16 72.73% Number 22 68.18% 7 2 28.57% 17 77.27% Yes-No 110 83.64% 18 17 94.44% 109 99.09% 154 72.08% 43 31 72.09% 142 92.21% Total Gender 42 30.95% 29 19 65.52% 32 76.19% Number 42 66.67% 14 5 35.71% 33 78.57% Yes-No 210 80.00% 42 36 85.71% 204 97.14% Total 294 71.09% 85 60 70.59% 269 91.50% Impact of Synonym Matching Overall Improvement 11
  • 12. Discussion – Technical Aspects  Results are promising for verification of correctly recording responses  Filtering  Currently manual operation, but can be automated  Requires word corpus (e.g., “um”) and rules such as removing repeats  Synonym corpus  Well understood for gender, yes-no  However, other questions such as scales / categorical questions will need additional effort 12
  • 13. Discussion – Human Impacts  Interviewers  Provide timely feedback for new interviewers  Follow-up with objective information for interviewers with poor performance  Should not be used in a punitive manner, but as a tool for corrective action  Management  Understand impact of coaching and training  Produce daily, weekly, monthly trends reports for supervisors, interviewers, and clients  Sponsor perspective  Increase level of monitoring  Focus on key interview questions for 100% monitoring  Utilize objective measures 13
  • 14. General Limitations  No noise-filtering done on sound files  Occasionally no clean break between interviewer and respondent  Only considered English interviews  Did not account for non-native speakers in analyses  Occasionally the Microsoft Cognition API would respond with errors 14
  • 15. Voxco Limitations  Typically records interview as one “.wav” file  Prefer separate “.wav” files per question and response  Implications for setup, interviewer adjustments, and run-on question/response  Manually broke up “.wav” files into interviewer and respondent files  Future work required to differentiate  Signal & noise  Interviewer, respondent, dog, baby, … 15
  • 16. Future  Automate the division between interviewer and respondent  Utilize noise filtering prior to voice recognition  Look at other question types such as Likert scales and dates  Automate speech tempo 16
  • 17. Lew Berman, PhD, MS Email: lewis.berman@icf.com Phone: 301-407-6833 17
  • 18. References [FEDCASIC 2018] Federal Computer Assisted Survey Information Collection (FedCASIC)Workshops: Roundtable: What Makes a Good Interviewer? Metrics and Methods for Ensuring Data Quality. (2018) Organizers: Matt Jans, Lew Berman. Moderator: Matt Jans. April 18, 2018, Suitland, MD. https://www.census.gov/fedcasic/fc2018/ppt/full_program_2018.pdf. [JACEWICZ 2009] Jacewicz E, Fox RA, O’Neill C, Salmons J. Articulation rate across dialect, age, and gender. Language variation and change. 2009;21(2):233-256. doi:10.1017/S0954394509990093. [NCVS 150] National Center for Voice and Speech. Tutorials - Voice Production. Voice Qualities. http://www.ncvs.org/ncvs/tutorials/voiceprod/tutorial/quality.html [SPEIZER 2008] Speizer H, Kinsey S, Heman-Ackah R, Thissen M. Rita. (2008) Developing a Common, Mode-independent, Approach for Evaluating Interview Quality and Interviewer Performance. Research Triangle Institute Internal Report. https://www.rti.org/sites/default/files/resources/kinsey_iii-c.pdf. [TSAO 2006] Tsao Ying-Chiao, Weismer Gary, Iqbal Kamran. Interspeaker variation in habitual speaking rate: Additional evidence. Journal of Speech, Language, and Hearing Research. 2006;49:1156–1164. 18