SlideShare a Scribd company logo
Predicting User Engagement with Direct Displays Using
Mouse Cursor Information
Ioannis Arapakis (Eurecat), Luis Leiva (Sciling)
Contents
§Introduction & motivation
§Merits of mouse cursor analysis
§Experimental setup
§Predictive modelling
§Performance assessment
§Results
§Conclusions
Introduction
§ In recent years direct displays (DDs)
have become a standard component
on the SERPs of all major web search
engines
§ DDs serve two main purposes:
• Provide well-structured summary of
information which is difficult or time-
consuming to access
• Help tidy up the SERP section that contains
the universal search results
Knowledge Module
§ One such prominent example is the Knowledge Module (KM)
display which provides users with information about the named
entities they are searching
§ The content presented in the KM display is obtained in a semi-
structured format from curated entity databases (e.g., Freebase,
Wikipedia)
§ This raw information is further enriched by the search engine,
e.g., ranking of related entities, explanations of their relationship
or with related multimedia and social media content
Motivation
§ In this context, most research has focused on general backend
system tasks, the most important being knowledge base
construction, or more specific backend tasks such as related
entity recommendation
§ This work attempts to understand how users engage with a DD
like the KM display in entity-centric search tasks
§ We are interested in predicting user engagement with a DD in
the absence of explicit feedback (e.g., self-report data)
Addressing the gap
§ Existing modelling techniques make a simplifying assumption
when analysing web search log data: the user is assumed to
be equally engaged with all parts of the SERP
§ In practice this assumption is not always true:
• A user may click on certain links on the page, but not all links
• May read a certain result snippet in the SERP but not necessarily the entire
list of results
• May ignore the SERP content completely and focus only on the images
shown in the KM display or other DDs
Mouse cursor tracking
§ Navigation & interaction with a digital
environment usually involves the use of a
mouse (i.e., selecting, hovering, clicking)
§ Can be easily performed in a non-invasive
manner, without removing users from their
natural setting
§ Several works have shown that the mouse
cursor is a proxy of gaze (attention)
§ Low-cost, scalable alternative to eye-
tracking
sigir16
Crowdsourcing study
§ We conducted a crowdsourcing study and examine how users
engage with DDs like the knowledge module (KM) display
§ We collected and analysed more than 115K mouse cursor
positions from 300 users
§ With this study we aim to predict:
• When a user notices the KM display on the SERP
• If it is perceived as a useful aid to their search tasks
• Whether interacting with the KM display alters the users’ perception of how
fast they complete the search tasks
Experimental design
§ Repeated-measures design
§ One independent variable: KM display (with two levels: “visible”
or “hidden”)
§ Three dependent variables: (i) KM display noticeability, (ii) KM
display usefulness and (iii) perceived task accomplishment
speed
§ Two short search tasks were completed using the Yahoo search
engine: one task with the KM display on the SERP and one
without it*
* The KM display visibility was controlled with client-side scripting.
Search UI
§ Participants accessed the search engine through a custom proxy
which did not alter the original look and feel of the SERPs
§ This allowed us to capture user interactions with the SERP
without interfering with the actual web search engine interface in
production
§ For each search task, participants were presented with a
question and were suggested a search query to begin with
Search query sample
§ Query set consisted of 32 unique query patterns (144 different queries in total)
§ The selected query patterns belonged to four different topics (famous people,
movies, athletes, sport teams) and required either single or multiple answers
Mouse cursor tracking tool
§ To collect mouse cursor data we used EVTRACK*, an open
source JavaScript event tracking library that is part of the smt2ε
system
§ EVTRACK allows to specify what browser events should be
captured and how they should be captured, i.e., via event
listeners (the event is captured as soon as it is fired) or via event
polling
* https://github.com/luileito/evtrack
Self-reported measures of engagement
§ A mini-questionnaire on the SERPs gathered
ground truth labels for the mouse cursor data
§ The mini-questionnaire was initially hidden
and was shown to the user just before
leaving the SERP
§ It comprised 3 questions:
• Did you notice the knowledge module?
• To what extent did you find the knowledge module
useful in answering the question?
• To what extent did the knowledge module help you
answer the question faster?
Procedure
§ Participants were asked to evaluate two different backend
systems of Yahoo web search by performing two search tasks
§ For each task, participants had to answer a question by
searching for relevant information on the proxified search engine
§ In one task the KM display would be hidden (control condition)
and in the other task it would be visible (experimental condition)
§ The order of the tasks was randomized for each participant
§ Participants were presented with a suggested query to begin
their search but were free to submit additional queries
§ We used informational, entity-centric queries to introduce a
common starting point across all participants
Modelling user engagement
§ Our final dataset consists of ~115K cursor positions, collected
during 600 search task sessions
§ Out of those 600 search task sessions we analysed the 300
cases that correspond to the experimental condition with the
visible KM in the SERP
§ Our dataset is generally balanced, with 176 users having
reported noticing the KM display
§ We normalised the values for each feature so that feature values
that those that fall in greater numeric ranges do not dominate
over those in smaller numeric ranges
Feature Engineering
* These functions are computed for most base and meta-features.
Users who did not engage with the KG Users who engaged with the KG
Feature Engineering (cont.)
Predictive Modelling
§ We trained 10 RF* models (90% of data) and used
them to obtain the predictions for each of the held-
out set (10% of data) among the ten folds**
§ Excluded highly correlated and linearly dependent
features
§ Performed feature selection using recursive
feature elimination
§ We used a subset of our training data for fine-
tuning the classifier’s hyperparameters
* R packages “Caret” and “randomForest”.
** With stratified sampling.
Performance evaluation
§ Baselines:
• If the user clicked on the KM display (hasClickedKM, binary)
• If the mouse cursor hovered over the KM display (hasHoveredKM, binary)
• Time spent on the page (dwellTime) as a feature to the RF classifier
§ Performance evaluation:
• Precision / Recall
• Accuracy
• F-Measure
• AUC
Results
Attention
Precision Recall F-Measure Accuracy AUC
0
0.2
0.4
0.6
0.8
1
Attention
Click
Hover
DwellTime
Ours
Usefulness
Precision Recall F-Measure Accuracy AUC
0
0.2
0.4
0.6
0.8
1
Usefulness
Click
Hover
DwellTime
Ours
Perceived Task Duration
Precision Recall F-Measure Accuracy AUC
0
0.2
0.4
0.6
0.8
1
Task Duration
Click
Hover
DwellTime
Ours
Computational complexity
§ Mouse gesture techniques that rely on PCA
preprocessing and k-means clustering
• Covariance matrix computation + eigenvalue
decomposition ☞ O(p2N + p3)
• K-means ☞ O(icN)
§ Cursor Motifs that use Dynamic Time Warping
(DTW) and k-nearest neighbours (kNN)
• DTW ☞ O(N2)
• kNN ☞ O(N2k2)
§ Proposed method has linear ☞ O(N) or
quasilinear cost ☞ O(NlogN)
Conclusions
§ We conducted a crowdsourcing study that revealed the potential
benefits of using mouse cursor data to predict user
engagement with DDs
§ We demonstrated that our feature selection model outperforms
the standard baselines to measure three user engagement
proxies with the KM display
§ Our initial results suggest that it is possible to predict when the
user attention is captured by a DD using only simple, yet highly
discriminative features derived from mouse cursor activity
Conclusions (cont.)
§ Predicting accurately if a DD was truly noticed can:
• Increase the true negative prediction rate
• Reduce the false negative rate
§ Knowing when a user finds a DD useful has important
implications on the methodology for understanding the impact
of launching a new DD, modifying its existing design, and how
that change may affect search UIs
Conclusions (cont.)
§ Information about perceived task duration can be combined with
the previous grounds truths to understand better how users
engage with ads or multimedia content
§ The main practical use of our models is perhaps to
automatically select or lay out the DDs
§ DDs are optional for the SERPs and the user behaviour could
provide signals about whether DDs should be shown or not in
particular queries
§ Our method offers a computationally efficient way to analyse
mouse cursor data
Acknowledgments
§ We thank B. Barla Cambazoglu and Marios Koulakis for fruitful
discussions
Thank you for your attention!
iarapakis
arapakis.ioannis@gmail.com
https://es.linkedin.com/in/ioannisarapakis
http://www.slideshare.net/iarapakis/sigir16

More Related Content

PDF
fmelleHumanActivityRecognitionWithMobileSensors
PDF
Mobile Location Indexing Based On Synthetic Moving Objects
PPT
Quick introduction to the click-through filter
PPT
Better Search: Click-through filter plugin – a flexible tool for improving se...
PPT
Better Search: Prototype click-through filter results
PPTX
How to Build a Recommendation Engine on Spark
PPTX
Attentive Relational Networks for Mapping Images to Scene Graphs
PPTX
"Agro-Market Prediction by Fuzzy based Neuro-Genetic Algorithm"
fmelleHumanActivityRecognitionWithMobileSensors
Mobile Location Indexing Based On Synthetic Moving Objects
Quick introduction to the click-through filter
Better Search: Click-through filter plugin – a flexible tool for improving se...
Better Search: Prototype click-through filter results
How to Build a Recommendation Engine on Spark
Attentive Relational Networks for Mapping Images to Scene Graphs
"Agro-Market Prediction by Fuzzy based Neuro-Genetic Algorithm"

Similar to sigir16 (20)

PDF
Elements of AI Luxembourg - session 4
PPTX
Discovering Common Motifs in Cursor Movement Data
PDF
pedersen
PPTX
Search engine patterns
PDF
Design the Search Experience
PDF
CIKM 2014 - Understanding Within-Content Engagement through Pattern Analysis ...
PDF
CAPTURING USER BROWSING BEHAVIOUR INDICATORS
PDF
Cursorcomp ipm
PPT
Tech_overview
PDF
Context Sensitive Search String Composition Algorithm using User Intention to...
PDF
Learning Efficient Representations of Mouse Movements to Predict User Attention
PDF
Personalized search
PDF
From “Selena Gomez” to “Marlon Brando”: Understanding Explorative Entity Search
PDF
Search Engine Switching
PPT
SERPs: From keyword to click. BrightonSEO (18th September 2015)
PDF
IxD meets DS
PDF
Data driven search
PDF
Framework for web personalization using web mining
PDF
Framework for web personalization using web mining
PPTX
Detecting Good Abandonment in Mobile Search
Elements of AI Luxembourg - session 4
Discovering Common Motifs in Cursor Movement Data
pedersen
Search engine patterns
Design the Search Experience
CIKM 2014 - Understanding Within-Content Engagement through Pattern Analysis ...
CAPTURING USER BROWSING BEHAVIOUR INDICATORS
Cursorcomp ipm
Tech_overview
Context Sensitive Search String Composition Algorithm using User Intention to...
Learning Efficient Representations of Mouse Movements to Predict User Attention
Personalized search
From “Selena Gomez” to “Marlon Brando”: Understanding Explorative Entity Search
Search Engine Switching
SERPs: From keyword to click. BrightonSEO (18th September 2015)
IxD meets DS
Data driven search
Framework for web personalization using web mining
Framework for web personalization using web mining
Detecting Good Abandonment in Mobile Search
Ad

More from Telefonica Research (7)

PDF
A Price-Per-Attention Auction Scheme Using Mouse Cursor Information
PDF
User Behaviour Modelling - Online and Offline Methods, Metrics, and Challenges
PDF
SIGIR15-NeuroIR
PDF
System and User Aspects of Web Search Latency
PDF
SocInfo14 - On the Feasibility of Predicting News Popularity at Cold Start
PDF
SIGIR2014 - Impact of Response Latency on User Behavior in Web Search
A Price-Per-Attention Auction Scheme Using Mouse Cursor Information
User Behaviour Modelling - Online and Offline Methods, Metrics, and Challenges
SIGIR15-NeuroIR
System and User Aspects of Web Search Latency
SocInfo14 - On the Feasibility of Predicting News Popularity at Cold Start
SIGIR2014 - Impact of Response Latency on User Behavior in Web Search
Ad

Recently uploaded (20)

PPTX
Seminar Hypertension and Kidney diseases.pptx
PPTX
Hypertension_Training_materials_English_2024[1] (1).pptx
PDF
Sciences of Europe No 170 (2025)
PPTX
perinatal infections 2-171220190027.pptx
PPTX
CORDINATION COMPOUND AND ITS APPLICATIONS
PPT
1. INTRODUCTION TO EPIDEMIOLOGY.pptx for community medicine
PDF
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
DOCX
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
PDF
BET Eukaryotic signal Transduction BET Eukaryotic signal Transduction.pdf
PPT
Heredity-grade-9 Heredity-grade-9. Heredity-grade-9.
PDF
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
PPTX
The Minerals for Earth and Life Science SHS.pptx
PDF
Warm, water-depleted rocky exoplanets with surfaceionic liquids: A proposed c...
PPTX
Application of enzymes in medicine (2).pptx
PPTX
C1 cut-Methane and it's Derivatives.pptx
PDF
Lymphatic System MCQs & Practice Quiz – Functions, Organs, Nodes, Ducts
PDF
Assessment of environmental effects of quarrying in Kitengela subcountyof Kaj...
PDF
Placing the Near-Earth Object Impact Probability in Context
PPTX
Science Quipper for lesson in grade 8 Matatag Curriculum
PPTX
Biomechanics of the Hip - Basic Science.pptx
Seminar Hypertension and Kidney diseases.pptx
Hypertension_Training_materials_English_2024[1] (1).pptx
Sciences of Europe No 170 (2025)
perinatal infections 2-171220190027.pptx
CORDINATION COMPOUND AND ITS APPLICATIONS
1. INTRODUCTION TO EPIDEMIOLOGY.pptx for community medicine
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
BET Eukaryotic signal Transduction BET Eukaryotic signal Transduction.pdf
Heredity-grade-9 Heredity-grade-9. Heredity-grade-9.
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
The Minerals for Earth and Life Science SHS.pptx
Warm, water-depleted rocky exoplanets with surfaceionic liquids: A proposed c...
Application of enzymes in medicine (2).pptx
C1 cut-Methane and it's Derivatives.pptx
Lymphatic System MCQs & Practice Quiz – Functions, Organs, Nodes, Ducts
Assessment of environmental effects of quarrying in Kitengela subcountyof Kaj...
Placing the Near-Earth Object Impact Probability in Context
Science Quipper for lesson in grade 8 Matatag Curriculum
Biomechanics of the Hip - Basic Science.pptx

sigir16

  • 1. Predicting User Engagement with Direct Displays Using Mouse Cursor Information Ioannis Arapakis (Eurecat), Luis Leiva (Sciling)
  • 2. Contents §Introduction & motivation §Merits of mouse cursor analysis §Experimental setup §Predictive modelling §Performance assessment §Results §Conclusions
  • 3. Introduction § In recent years direct displays (DDs) have become a standard component on the SERPs of all major web search engines § DDs serve two main purposes: • Provide well-structured summary of information which is difficult or time- consuming to access • Help tidy up the SERP section that contains the universal search results
  • 4. Knowledge Module § One such prominent example is the Knowledge Module (KM) display which provides users with information about the named entities they are searching § The content presented in the KM display is obtained in a semi- structured format from curated entity databases (e.g., Freebase, Wikipedia) § This raw information is further enriched by the search engine, e.g., ranking of related entities, explanations of their relationship or with related multimedia and social media content
  • 5. Motivation § In this context, most research has focused on general backend system tasks, the most important being knowledge base construction, or more specific backend tasks such as related entity recommendation § This work attempts to understand how users engage with a DD like the KM display in entity-centric search tasks § We are interested in predicting user engagement with a DD in the absence of explicit feedback (e.g., self-report data)
  • 6. Addressing the gap § Existing modelling techniques make a simplifying assumption when analysing web search log data: the user is assumed to be equally engaged with all parts of the SERP § In practice this assumption is not always true: • A user may click on certain links on the page, but not all links • May read a certain result snippet in the SERP but not necessarily the entire list of results • May ignore the SERP content completely and focus only on the images shown in the KM display or other DDs
  • 7. Mouse cursor tracking § Navigation & interaction with a digital environment usually involves the use of a mouse (i.e., selecting, hovering, clicking) § Can be easily performed in a non-invasive manner, without removing users from their natural setting § Several works have shown that the mouse cursor is a proxy of gaze (attention) § Low-cost, scalable alternative to eye- tracking
  • 9. Crowdsourcing study § We conducted a crowdsourcing study and examine how users engage with DDs like the knowledge module (KM) display § We collected and analysed more than 115K mouse cursor positions from 300 users § With this study we aim to predict: • When a user notices the KM display on the SERP • If it is perceived as a useful aid to their search tasks • Whether interacting with the KM display alters the users’ perception of how fast they complete the search tasks
  • 10. Experimental design § Repeated-measures design § One independent variable: KM display (with two levels: “visible” or “hidden”) § Three dependent variables: (i) KM display noticeability, (ii) KM display usefulness and (iii) perceived task accomplishment speed § Two short search tasks were completed using the Yahoo search engine: one task with the KM display on the SERP and one without it* * The KM display visibility was controlled with client-side scripting.
  • 11. Search UI § Participants accessed the search engine through a custom proxy which did not alter the original look and feel of the SERPs § This allowed us to capture user interactions with the SERP without interfering with the actual web search engine interface in production § For each search task, participants were presented with a question and were suggested a search query to begin with
  • 12. Search query sample § Query set consisted of 32 unique query patterns (144 different queries in total) § The selected query patterns belonged to four different topics (famous people, movies, athletes, sport teams) and required either single or multiple answers
  • 13. Mouse cursor tracking tool § To collect mouse cursor data we used EVTRACK*, an open source JavaScript event tracking library that is part of the smt2ε system § EVTRACK allows to specify what browser events should be captured and how they should be captured, i.e., via event listeners (the event is captured as soon as it is fired) or via event polling * https://github.com/luileito/evtrack
  • 14. Self-reported measures of engagement § A mini-questionnaire on the SERPs gathered ground truth labels for the mouse cursor data § The mini-questionnaire was initially hidden and was shown to the user just before leaving the SERP § It comprised 3 questions: • Did you notice the knowledge module? • To what extent did you find the knowledge module useful in answering the question? • To what extent did the knowledge module help you answer the question faster?
  • 15. Procedure § Participants were asked to evaluate two different backend systems of Yahoo web search by performing two search tasks § For each task, participants had to answer a question by searching for relevant information on the proxified search engine § In one task the KM display would be hidden (control condition) and in the other task it would be visible (experimental condition) § The order of the tasks was randomized for each participant § Participants were presented with a suggested query to begin their search but were free to submit additional queries § We used informational, entity-centric queries to introduce a common starting point across all participants
  • 16. Modelling user engagement § Our final dataset consists of ~115K cursor positions, collected during 600 search task sessions § Out of those 600 search task sessions we analysed the 300 cases that correspond to the experimental condition with the visible KM in the SERP § Our dataset is generally balanced, with 176 users having reported noticing the KM display § We normalised the values for each feature so that feature values that those that fall in greater numeric ranges do not dominate over those in smaller numeric ranges
  • 17. Feature Engineering * These functions are computed for most base and meta-features.
  • 18. Users who did not engage with the KG Users who engaged with the KG
  • 20. Predictive Modelling § We trained 10 RF* models (90% of data) and used them to obtain the predictions for each of the held- out set (10% of data) among the ten folds** § Excluded highly correlated and linearly dependent features § Performed feature selection using recursive feature elimination § We used a subset of our training data for fine- tuning the classifier’s hyperparameters * R packages “Caret” and “randomForest”. ** With stratified sampling.
  • 21. Performance evaluation § Baselines: • If the user clicked on the KM display (hasClickedKM, binary) • If the mouse cursor hovered over the KM display (hasHoveredKM, binary) • Time spent on the page (dwellTime) as a feature to the RF classifier § Performance evaluation: • Precision / Recall • Accuracy • F-Measure • AUC
  • 23. Attention Precision Recall F-Measure Accuracy AUC 0 0.2 0.4 0.6 0.8 1 Attention Click Hover DwellTime Ours
  • 24. Usefulness Precision Recall F-Measure Accuracy AUC 0 0.2 0.4 0.6 0.8 1 Usefulness Click Hover DwellTime Ours
  • 25. Perceived Task Duration Precision Recall F-Measure Accuracy AUC 0 0.2 0.4 0.6 0.8 1 Task Duration Click Hover DwellTime Ours
  • 26. Computational complexity § Mouse gesture techniques that rely on PCA preprocessing and k-means clustering • Covariance matrix computation + eigenvalue decomposition ☞ O(p2N + p3) • K-means ☞ O(icN) § Cursor Motifs that use Dynamic Time Warping (DTW) and k-nearest neighbours (kNN) • DTW ☞ O(N2) • kNN ☞ O(N2k2) § Proposed method has linear ☞ O(N) or quasilinear cost ☞ O(NlogN)
  • 27. Conclusions § We conducted a crowdsourcing study that revealed the potential benefits of using mouse cursor data to predict user engagement with DDs § We demonstrated that our feature selection model outperforms the standard baselines to measure three user engagement proxies with the KM display § Our initial results suggest that it is possible to predict when the user attention is captured by a DD using only simple, yet highly discriminative features derived from mouse cursor activity
  • 28. Conclusions (cont.) § Predicting accurately if a DD was truly noticed can: • Increase the true negative prediction rate • Reduce the false negative rate § Knowing when a user finds a DD useful has important implications on the methodology for understanding the impact of launching a new DD, modifying its existing design, and how that change may affect search UIs
  • 29. Conclusions (cont.) § Information about perceived task duration can be combined with the previous grounds truths to understand better how users engage with ads or multimedia content § The main practical use of our models is perhaps to automatically select or lay out the DDs § DDs are optional for the SERPs and the user behaviour could provide signals about whether DDs should be shown or not in particular queries § Our method offers a computationally efficient way to analyse mouse cursor data
  • 30. Acknowledgments § We thank B. Barla Cambazoglu and Marios Koulakis for fruitful discussions
  • 31. Thank you for your attention! iarapakis arapakis.ioannis@gmail.com https://es.linkedin.com/in/ioannisarapakis http://www.slideshare.net/iarapakis/sigir16