SlideShare a Scribd company logo
What paradata can tell you about the
quality of web surveys?
Mario Callegaro Ph.D.
Senior Survey Research Scientist
User Insights team, Brand Studio
Google London
Qualtrics Converge Europe, London April 26, 2017
Disclaimer
The opinions expressed in this presentation are the author's own and do not reflect the views of
Google
2
How do we know if a question works?
How do we know if a question measures what is intended to measure?
How do we know if respondents understand the question and can appropriately respond to it?
3
What are paradata?
Paradata are data about the process of answering the survey itself
Taxonomy of paradata types
Paradata for web surveys can be classified into the following groups:
1. Direct paradata
• Contact-info
• Device-type paradata
• Questionnaire navigation paradata
2. Indirect paradata
• E.g. eye tracking, video recording, behavioral coding
5
Contact info paradata
Direct paradata: Contact info
• Outcomes of an email invitation
• Access to the questionnaire introduction page
• Last question answered before breakoff
7
Survey breakoffs by question
8
(Sakshaug & Crawford, 2010) Data courtesy from Sakshaug
75
80
85
90
95
100
Permission asked to use
school records (grades)
for research purposes
Device type paradata
Direct paradata: Device type
• User-agent string
• Screen resolution
• Browser window size
• Javascript and Flash active
• IP Address (mostly considered Personal Identifiable Information)
• GPS coordinates (mostly considered Personal Identifiable Information)
• Cookies
10
Device type: GPS coordinates example
11Dayton, J & H. Driscoll: The Next CAPI Evolution - Completing Web Surveys on Cell-Enabled iPads. AAPOR
Device type: GPS coordinates example (cont.)
12Dayton, J & H. Driscoll: The Next CAPI Evolution - Completing Web Surveys on Cell-Enabled iPads. AAPOR 2011
Questionnaire navigation paradata
part 1
Direct paradata: Questionnaire navigation 1
Mouse clicks and mouse coordinates
Mouse clicks and its position can be captured with a JavaScript. Excessive mouse movements can
be a sign of problems with the question
Change of answers
Change of answers is an indicator of potential confusion with a question and can be used to improve
questionnaire design
Typing and keystrokes
Typing and keystrokes can create an audit trail for each survey and used to detect unusual behavior
both from the respondent side and the interviewer side
14
Questionnaire navigation paradata example
lXNtoilre7_2|1|M677|13|1320#
M548|174|830#
M160|101|1750#
M366|192|550#
M728|4|7690#
M489|247|610#
C493|229|3301#
R110|1#
C493|280|4301#
R110|3#
C493|345|3901#
R110|5#
C521|399|3801#
SU521|399|60|undefined#|
15
Stieger and Reips (2010, p. 1490)
Change of answers ex. (Haraldsen et al, 2005)
16
Fully labeled vs. polar point vs. polar point with numbers vs. answer box
17
Stern (2008, p. 384)
Fully labeled vs. polar point vs. polar point with numbers vs. answer box
Mean ratings
18
2
2 2
3
1
2
3
4
5
Fully labeled Polar point Polar point w/#'s Answer box
Stern (2008) & Christian (2003)
Fully labeled vs. polar point vs. polar point with numbers vs. answer box
% of reciprocal changes
19
2
7
6
8
0
2
4
6
8
10
Fully labeled Polar point Polar point w/ #'s Answer box
Stern (2008)
Questionnaire navigation paradata
part 2
Direct paradata: Questionnaire navigation 2
Order of answering
In a page with multiple questions the order of answering is an indicator on how the respondent reads
the questions
Movements across the questionnaire (forward/backward)
If the questionnaire allows going backward or going forward by skipping questions, unusual
movements are a symptom of issues with the questionnaire or the respondent
Scrolling
The amount of scrolling depends on the screen size of the device used and on the size of the
browser window used by the respondent
21
Time latency paradata
Time spent per question/screen
This is the most published topic in paradata research: time latency information.
There are many studies focusing on major themes:
• Attitude strength
• Response uncertainty
• Question wording
• Response error (e.g. speeding)
• Satisficing / Optimizing
22
Order of response categories:
Positive vs. negative orientation
POSITIVE
How accessible have your
instructors been both in and
outside of class?
Very accessible
Somewhat accessible
Neutral
Somewhat inaccessible
Very inaccessible
Don’t know
23
NEGATIVE
How accessible have your
instructors been both in and
outside of class?
Very inaccessible
Somewhat inaccessible
Neutral
Somewhat accessible
Very accessible
Don’t know
Christian, Parsons & Dillman (2009)
Positive vs. negative orientation
Results in %
24
0
10
20
30
40
50
Positive order Negative order
Christian, Parsons & Dillman (2009)
Positive vs. negative orientation
Time spent answering the question
25
0
0.4
0.8
1.2
1.6
2
2.4
Positive order Negative order
Christian, Parsons & Dillman (2009)
Privacy and ethical issues in collecting paradata
Should we tell respondents we are collecting paradata?
What happens when we tell respondents we are collecting paradata and we ask permission to use
them?
• 59.5% agreed in the LISS Dutch panel (across experimental manipulations)
• 65.6% agreed in the Knowledge Networks U.S. panel (across experiment manipulations)
• 69.3% agreed in a U.S. volunteer non-probability panel (across experimental manipulations)
(Couper and Singer, 2013, studies done using vignettes)
26
Conclusions & references
Conclusions on paradata
• The amount of paradata that can be collected grow as the technological capabilities grow
• Although paradata can be collected “easily” and at a low cost, we should not underestimate the
cost of managing and analysing paradata (Nicolaas, 2011)
• Paradata should not replace other ways of pretesting the questionnaire because it does not
answer all the research questions
• Paradata analysis is another tool to use in assessing the quality of a survey and in making
improvements to the questionnaire and the entire online survey experience
28
References on Paradata for web surveys
Callegaro, M. (2013). Paradata in web surveys
(Chapter 11).
In F. Kreuter (Ed.), Improving surveys with paradata:
Analytic use of process information (pp. 261–279).
Hoboken, NJ: Wiley.
PDF available at
http://research.google.com/pubs/MarioCallegaro.html
Callegaro, Lozar Manfreda & Vehovar (2015). Web
survey methodology. London: Sage
29
30
Q & A

More Related Content

PDF
RvizPlugin作成入門
PDF
Rで学ぶ離散選択モデル
PDF
スパース推定
PPTX
凡人の凡人による凡人のためのデザインパターン第一幕 Public
PDF
実務と論文で学ぶジョブレコメンデーション最前線2022
PDF
Sliced Wasserstein Distance for Learning Gaussian Mixture Models
PDF
文脈自由文法の話
PPTX
RubyとRのおいしい関係
RvizPlugin作成入門
Rで学ぶ離散選択モデル
スパース推定
凡人の凡人による凡人のためのデザインパターン第一幕 Public
実務と論文で学ぶジョブレコメンデーション最前線2022
Sliced Wasserstein Distance for Learning Gaussian Mixture Models
文脈自由文法の話
RubyとRのおいしい関係

What's hot (20)

PDF
NLP2019 松田寛 - GiNZA
PDF
遺伝的アルゴリズムによるNクイーン問題の解法
PDF
単一物体追跡論文のサーベイ
PDF
Tokyor42 ggplot2
PPTX
Rで学ぶ観察データでの因果推定
PDF
統計モデリングで癌の5年生存率データから良い病院を探す
 
PDF
[Tokyor08] Rによるデータサイエンス 第2部 第3章 対応分析
PDF
EU GMP Annex 1 Draft - Closed System Design Consideration with Single-Use Sys...
PDF
Pythonではじめる競技プログラミング
PDF
計算機アーキテクチャを考慮した高能率画像処理プログラミング
PDF
tf,tf2完全理解
PDF
マーケティングサイエンス徹底入門と実践Part2
PDF
4bit-CPU : TD4の解説
PPTX
為替取引(FX)でのtickdataの加工とMySQLで管理
PDF
06 第5.1節-第5.7節 ROS2に対応したツール/パッケージ
PDF
おやつ神社
PDF
Amebaソシャゲ分析事例のご紹介
PDF
[Slide]闇アジャイラーvs光アジャイラーforDevLOVE(EnergizedWorkLT祭)
PDF
関連記事レコメンドエンジン@Yahoo! JAPAN
PDF
Optimizer入門&最新動向
NLP2019 松田寛 - GiNZA
遺伝的アルゴリズムによるNクイーン問題の解法
単一物体追跡論文のサーベイ
Tokyor42 ggplot2
Rで学ぶ観察データでの因果推定
統計モデリングで癌の5年生存率データから良い病院を探す
 
[Tokyor08] Rによるデータサイエンス 第2部 第3章 対応分析
EU GMP Annex 1 Draft - Closed System Design Consideration with Single-Use Sys...
Pythonではじめる競技プログラミング
計算機アーキテクチャを考慮した高能率画像処理プログラミング
tf,tf2完全理解
マーケティングサイエンス徹底入門と実践Part2
4bit-CPU : TD4の解説
為替取引(FX)でのtickdataの加工とMySQLで管理
06 第5.1節-第5.7節 ROS2に対応したツール/パッケージ
おやつ神社
Amebaソシャゲ分析事例のご紹介
[Slide]闇アジャイラーvs光アジャイラーforDevLOVE(EnergizedWorkLT祭)
関連記事レコメンドエンジン@Yahoo! JAPAN
Optimizer入門&最新動向
Ad

Similar to What paradata can tell you about the quality of web surveys? (20)

PPTX
How to find out about the usability of your web site using a survey by @cjforms
PDF
Essentials of Marketing Research 6th Edition Babin Solutions Manual
DOCX
Data collection methods
PDF
Data collection, Data Integration, Data Understanding e Data Cleaning & Prepa...
PDF
Exploring Marketing Research 11th Edition Babin Solutions Manual
PPT
Business Research : Data Collection and Questionnaire Construction
PDF
Essentials of Marketing Research 6th Edition Babin Solutions Manual
PPTX
SurveyMonkey Basics (TechCamp presentation 5-12-12)
PPT
Alberto abisso
PPTX
D3 Project: Creating digital content - Explorer level
PPT
Malhotra04....
PPT
Malhotra04....
PDF
Mastering Online Surveys
PPTX
Lars Lyberg, Inizio: Ett föränderligt surveylandskap
PPTX
00 Data Collection in QLT Research - Design & Procedures.pptx
PPTX
Three Studies on Supplementing Survey Data with Active Data
PDF
Online Research Coming of age - Brownbag Presentation at Universitty of Preto...
PPTX
10 tips for a better UX survey
PPTX
Creating google forms
PPTX
Smart Data Module 3 d drive_external data
How to find out about the usability of your web site using a survey by @cjforms
Essentials of Marketing Research 6th Edition Babin Solutions Manual
Data collection methods
Data collection, Data Integration, Data Understanding e Data Cleaning & Prepa...
Exploring Marketing Research 11th Edition Babin Solutions Manual
Business Research : Data Collection and Questionnaire Construction
Essentials of Marketing Research 6th Edition Babin Solutions Manual
SurveyMonkey Basics (TechCamp presentation 5-12-12)
Alberto abisso
D3 Project: Creating digital content - Explorer level
Malhotra04....
Malhotra04....
Mastering Online Surveys
Lars Lyberg, Inizio: Ett föränderligt surveylandskap
00 Data Collection in QLT Research - Design & Procedures.pptx
Three Studies on Supplementing Survey Data with Active Data
Online Research Coming of age - Brownbag Presentation at Universitty of Preto...
10 tips for a better UX survey
Creating google forms
Smart Data Module 3 d drive_external data
Ad

More from Qualtrics (20)

PPTX
WEBINAR: K12 - How to shape student experiences
PPTX
3 CX Myths That Can Kill Your Brand
PPTX
Closing the Experience Gap with Qualtrics XM
PPTX
Qualtrics CX Masterclass
PPTX
The 5 Competencies for Customer Journey Mapping
PPTX
Stop The Fighting, Find Consensus: How To Manage Your Citizen Experience
PPTX
The Changing CX Environment
PPTX
Increasing your Value-Based Purchasing Score through 5 Patient Rounding Best ...
PPTX
Creating an employee value proposition that recruits and engages today's top ...
PPTX
Qualtrics CX Live Auckland
PPTX
Employee engagement in a high-pressure environment
PPTX
Development and evaluation of digital solutions for weight loss maintenance
PPTX
The Global Shapers Annual Surveys
PPTX
Digital Research in Low-Resource Countries
PPTX
Best Practices for Survey Design
PPTX
Recipe for success: balancing the art & science of employee feedback
PPTX
A journey to customer centricity
PPTX
The Challenges of implementing a CX programme across the Belron International...
PPTX
The Age of Customer Empowerment and its Impact on Brand Experience
PPTX
Brand experience – a Ticketmaster Case Study
WEBINAR: K12 - How to shape student experiences
3 CX Myths That Can Kill Your Brand
Closing the Experience Gap with Qualtrics XM
Qualtrics CX Masterclass
The 5 Competencies for Customer Journey Mapping
Stop The Fighting, Find Consensus: How To Manage Your Citizen Experience
The Changing CX Environment
Increasing your Value-Based Purchasing Score through 5 Patient Rounding Best ...
Creating an employee value proposition that recruits and engages today's top ...
Qualtrics CX Live Auckland
Employee engagement in a high-pressure environment
Development and evaluation of digital solutions for weight loss maintenance
The Global Shapers Annual Surveys
Digital Research in Low-Resource Countries
Best Practices for Survey Design
Recipe for success: balancing the art & science of employee feedback
A journey to customer centricity
The Challenges of implementing a CX programme across the Belron International...
The Age of Customer Empowerment and its Impact on Brand Experience
Brand experience – a Ticketmaster Case Study

Recently uploaded (20)

PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PDF
Mega Projects Data Mega Projects Data
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPTX
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PDF
.pdf is not working space design for the following data for the following dat...
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PDF
Foundation of Data Science unit number two notes
PPTX
Supervised vs unsupervised machine learning algorithms
PDF
Lecture1 pattern recognition............
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Mega Projects Data Mega Projects Data
Data_Analytics_and_PowerBI_Presentation.pptx
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
Introduction to Knowledge Engineering Part 1
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
.pdf is not working space design for the following data for the following dat...
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Business Ppt On Nestle.pptx huunnnhhgfvu
Foundation of Data Science unit number two notes
Supervised vs unsupervised machine learning algorithms
Lecture1 pattern recognition............
Clinical guidelines as a resource for EBP(1).pdf
STUDY DESIGN details- Lt Col Maksud (21).pptx
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx

What paradata can tell you about the quality of web surveys?

  • 1. What paradata can tell you about the quality of web surveys? Mario Callegaro Ph.D. Senior Survey Research Scientist User Insights team, Brand Studio Google London Qualtrics Converge Europe, London April 26, 2017
  • 2. Disclaimer The opinions expressed in this presentation are the author's own and do not reflect the views of Google 2
  • 3. How do we know if a question works? How do we know if a question measures what is intended to measure? How do we know if respondents understand the question and can appropriately respond to it? 3
  • 4. What are paradata? Paradata are data about the process of answering the survey itself
  • 5. Taxonomy of paradata types Paradata for web surveys can be classified into the following groups: 1. Direct paradata • Contact-info • Device-type paradata • Questionnaire navigation paradata 2. Indirect paradata • E.g. eye tracking, video recording, behavioral coding 5
  • 7. Direct paradata: Contact info • Outcomes of an email invitation • Access to the questionnaire introduction page • Last question answered before breakoff 7
  • 8. Survey breakoffs by question 8 (Sakshaug & Crawford, 2010) Data courtesy from Sakshaug 75 80 85 90 95 100 Permission asked to use school records (grades) for research purposes
  • 10. Direct paradata: Device type • User-agent string • Screen resolution • Browser window size • Javascript and Flash active • IP Address (mostly considered Personal Identifiable Information) • GPS coordinates (mostly considered Personal Identifiable Information) • Cookies 10
  • 11. Device type: GPS coordinates example 11Dayton, J & H. Driscoll: The Next CAPI Evolution - Completing Web Surveys on Cell-Enabled iPads. AAPOR
  • 12. Device type: GPS coordinates example (cont.) 12Dayton, J & H. Driscoll: The Next CAPI Evolution - Completing Web Surveys on Cell-Enabled iPads. AAPOR 2011
  • 14. Direct paradata: Questionnaire navigation 1 Mouse clicks and mouse coordinates Mouse clicks and its position can be captured with a JavaScript. Excessive mouse movements can be a sign of problems with the question Change of answers Change of answers is an indicator of potential confusion with a question and can be used to improve questionnaire design Typing and keystrokes Typing and keystrokes can create an audit trail for each survey and used to detect unusual behavior both from the respondent side and the interviewer side 14
  • 15. Questionnaire navigation paradata example lXNtoilre7_2|1|M677|13|1320# M548|174|830# M160|101|1750# M366|192|550# M728|4|7690# M489|247|610# C493|229|3301# R110|1# C493|280|4301# R110|3# C493|345|3901# R110|5# C521|399|3801# SU521|399|60|undefined#| 15 Stieger and Reips (2010, p. 1490)
  • 16. Change of answers ex. (Haraldsen et al, 2005) 16
  • 17. Fully labeled vs. polar point vs. polar point with numbers vs. answer box 17 Stern (2008, p. 384)
  • 18. Fully labeled vs. polar point vs. polar point with numbers vs. answer box Mean ratings 18 2 2 2 3 1 2 3 4 5 Fully labeled Polar point Polar point w/#'s Answer box Stern (2008) & Christian (2003)
  • 19. Fully labeled vs. polar point vs. polar point with numbers vs. answer box % of reciprocal changes 19 2 7 6 8 0 2 4 6 8 10 Fully labeled Polar point Polar point w/ #'s Answer box Stern (2008)
  • 21. Direct paradata: Questionnaire navigation 2 Order of answering In a page with multiple questions the order of answering is an indicator on how the respondent reads the questions Movements across the questionnaire (forward/backward) If the questionnaire allows going backward or going forward by skipping questions, unusual movements are a symptom of issues with the questionnaire or the respondent Scrolling The amount of scrolling depends on the screen size of the device used and on the size of the browser window used by the respondent 21
  • 22. Time latency paradata Time spent per question/screen This is the most published topic in paradata research: time latency information. There are many studies focusing on major themes: • Attitude strength • Response uncertainty • Question wording • Response error (e.g. speeding) • Satisficing / Optimizing 22
  • 23. Order of response categories: Positive vs. negative orientation POSITIVE How accessible have your instructors been both in and outside of class? Very accessible Somewhat accessible Neutral Somewhat inaccessible Very inaccessible Don’t know 23 NEGATIVE How accessible have your instructors been both in and outside of class? Very inaccessible Somewhat inaccessible Neutral Somewhat accessible Very accessible Don’t know Christian, Parsons & Dillman (2009)
  • 24. Positive vs. negative orientation Results in % 24 0 10 20 30 40 50 Positive order Negative order Christian, Parsons & Dillman (2009)
  • 25. Positive vs. negative orientation Time spent answering the question 25 0 0.4 0.8 1.2 1.6 2 2.4 Positive order Negative order Christian, Parsons & Dillman (2009)
  • 26. Privacy and ethical issues in collecting paradata Should we tell respondents we are collecting paradata? What happens when we tell respondents we are collecting paradata and we ask permission to use them? • 59.5% agreed in the LISS Dutch panel (across experimental manipulations) • 65.6% agreed in the Knowledge Networks U.S. panel (across experiment manipulations) • 69.3% agreed in a U.S. volunteer non-probability panel (across experimental manipulations) (Couper and Singer, 2013, studies done using vignettes) 26
  • 28. Conclusions on paradata • The amount of paradata that can be collected grow as the technological capabilities grow • Although paradata can be collected “easily” and at a low cost, we should not underestimate the cost of managing and analysing paradata (Nicolaas, 2011) • Paradata should not replace other ways of pretesting the questionnaire because it does not answer all the research questions • Paradata analysis is another tool to use in assessing the quality of a survey and in making improvements to the questionnaire and the entire online survey experience 28
  • 29. References on Paradata for web surveys Callegaro, M. (2013). Paradata in web surveys (Chapter 11). In F. Kreuter (Ed.), Improving surveys with paradata: Analytic use of process information (pp. 261–279). Hoboken, NJ: Wiley. PDF available at http://research.google.com/pubs/MarioCallegaro.html Callegaro, Lozar Manfreda & Vehovar (2015). Web survey methodology. London: Sage 29