Evaluating the Practical Applications
Evaluating the Practical Applications
        of Eye Tracking in Museums
                   Ed Bachta & Silvia Filippini-Fantoni
Exploring Visitors’ Engagement
With Artworks
Research results
Research so far has indicated
that visitors spend little time
looking at artworks:

• Hein, 1998

• Smith, 2001

• Worths, 2003 (‘grazing’)

• Viewing Project, IMA 2012
Observation vs. eye tracking
                                      • Use of observational rubric is
                                        labor-intensive and not always
                                        precise

                                      • Eye tracking technology has
                                        the potential of being more
                                        precise and less time-
                                        consuming

                                      • More recent development of
                                        less intrusive devices (non
                                        head mounted)
Photo from Milekic MW 2010
Sparks! grant objectives
    •   Gauge the practicality of
        using such devices in a
        museum gallery setting.

    •   Assess the ability of current
        eye tracking technology to
        reveal what visitors are
        looking at and for how long.

    •   Explore the potential use of
        this equipment in a practical
        setting (e.g. VTS discussion)
Infrared Emitters




                       Camera




                    EyeTech VT2
Device initial testing
•   Eye tracker range
    limitations.

•   Too much variation in
    height when the person is
    standing.

•   For the experiments the
    viewer has to be seated,
    with the tracker placed in a
    fixed position between the
    tracker and the painting.
Experiment 1
               © Edward Hopper.
Experiment 1: objectives
     •   Distinguish when the
         participant looks
         inside/outside of the
         painting.

     •   Measure time spent looking
         inside/outside of the
         painting.

     •   Track where looking inside
         the field of the artwork.

           Calibration performed once
The device was installed on a cart between the work of art and
the seated participant and calibrated to the first participant.
Participants’ standing and seated height were measured and
distance from mid-eye to floor.
Experiment 1: part 1
 •   22 participants were asked to look in
     and outside the painting for 1 minute.

 •   First 10 participants could not adjust
     their chair position to optimize eye
     tracking, while the next 12 were asked
     to do so.

 •   Participants’ gazes (inside the field of
     the painting) were tracked by 2
     research assistants with stopwatches.

 •   The times were averaged and
     compared to the time tracked by the
     device
Experiment 1: part 2
 •   A subset of participants (8 of
     the 22) were asked to look
     (over a period of 60 seconds)
     at 6 different areas of the
     work for 10 seconds each in
     sequence prompted by a
     research assistant.

 •   Tracker data was logged in the
     same manner as the previous
     experiment.
Relative quantity of valid gaze data
                                             Missing data for >25% of session time for 6 participants
                                 20
                                 10
difference (% of session time)




                                  0
                                 -10
                                 -20
                                 -30
                                 -40
                                 -50
                                 -60
                                 -70
                                 -80
                                             Fixed seat position                      Adjusted seat position
                                       1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22
Relative quantity of valid gaze data
                                          Poor performance for 4 of 6 participants at low eye level (<50”)
                                 20
                                 10
difference (% of session time)




                                  0
                                 -10
                                 -20
                                 -30
                                 -40
                                 -50
                                 -60
                                 -70
                                 -80
                                       1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22
Relative quantity of valid gaze data
                                                  Missing > 10% for 7 of 10 glasses wearers
                                 20
                                 10
difference (% of session time)




                                  0
                                 -10
                                 -20
                                 -30
                                 -40
                                 -50
                                 -60
                                 -70
                                 -80
                                       1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22
Comparison against
                                      manual measurement
                            100
                             90
error (% of session time)




                             80
                             70
                             60
                             50
                             40
                             30
                             20
                             10
                              0
                                  1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22
                                                         Participant
Comparison against
             manual measurement

                      Fixed seat position    Adjusted seat position
Within 5%                          0                     5
Within 10%                         1                     7



• Allowing the participant to adjust the seat position produces
  better results

• We were still hoping for better accuracy
Gap Handling
                            100
                             90
error (% of session time)




                             80
                             70
                             60
                                                                                             Raw
                             50
                             40                                                              100ms
                             30                                                              500ms
                             20                                                              1s
                             10
                              0
                                  1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22
                                                        Participant
Gap Handling
                         Raw   100ms   500ms    1s
Within 5% (fixed)         0      1       1      1
Within 10% (fixed)        1      3       3      4
Within 5% (adjusted)      5      5       8      7
Within 10% (adjusted)     7      7       8      8


• Applying a gap algorithm appears to improve
  results with a 500ms threshold

• Accuracy was within 10% for two thirds of
  participants when allowing the seat position to
  be adjusted and using the algorithm
Gaze locations (calibrated)
       0   10   20   30   40   50   60   70   80   90   100
  0

  10

  20

  30

  40

  50

  60

  70

  80

  90

 100
Gaze locations (calibrated)
Gaze locations (typical uncalibrated)
Experiment 1: Summary
    •   Device was not able to
        continuously track the gaze
        of a seated viewer.

    •   An attempt to improve
        results by handling gaps in
        the data were successful, but
        only to a degree.

    •   Vertical gaze location was not
        accurate for uncalibrated
        viewers.
Experiment 2
Experiment 2: objectives
                    & methodology
               •   Measure whether the
                   device could be more precise
                   when calibrated for each
                   participant

               •   We repeated experiment 1
                   (part 1 and 2) with 12
                   participants but calibrated the
                   devices individually

               •   This second experiment was set
                   in a lab, where the exact size of
                   the painting was reproduced on a
                   board
Relative quantity of valid gaze data
                                 100




                                  80
difference (% of session time)




                                  60




                                  40




                                  20




                                   0
                                       1   2   3   4   5   6   7   8   9   10   11   12


                                 -20
Gaze Duration
       Delta          Max. calibration        Glasses          Seated eye
(% of session time)      “score”                                elevation

      7.719                6.70          did without glasses       49
      15.159               2.76                  yes              51.5
      6.306                13.90                 no                54
      6.728                6.13                  yes               52
      5.164                3.30                  no              51.25
      10.071               6.78                  no                48
      0.007                5.13                  no              53.75
      6.082                4.07          did without glasses     47.75
      4.572                4.36                  yes              50.5
      2.732                10.10                 no                52
      12.240               12.15                 yes              49.5
      3.790                3.77                  no                51
Gaze Duration
Accuracy            Raw data     100ms threshold   500ms threshold
Within 2%              1               4                 6
Within 5%              4               7                 8
Within 10%             9               12                12



• An improvement over the first experiment

• One third of participants were in the 5-10% range
Gaze Location
    Best session
Gaze Location
    Worst session
Comparisons
  Average error     Max. calibration “score”   Glasses
(degrees of FOV)
     0.96                    6.70               no
     1.60                    2.76               yes
     1.72                    13.90              no
     2.47                    6.13               yes
     0.88                    3.30               no
     1.61                    6.78               no
     0.90                    5.13               no
     1.24                    4.07               no
     1.18                    4.36               yes
     1.76                    10.10              no
     3.00                    12.15              yes
     2.22                    3.77               no
Experiment 2: Summary
   •   Applying the gap handling
       algorithm brought all sessions
       within 10% of the manual
       measurement

   •   Gaze duration results were
       better than in the uncalibrated
       study, but still not what we
       hoped for

   •   Gaze location results were also
       better than in the uncalibrated
       study, but not as accurate as
       expected
Future Work

                            • Experiment 3 will
                              make use of the
                              tracker during a VTS
                              session

                            • We will evaluate
                              whether the data
                              recorded assists in
                              understanding what
                              VTS participants look
Photo from PAAM.org
                              at during a session
Thank You

ebachta@imamuseum.org
sfantoni@imamuseum.org

More Related Content

PPTX
Quality&amp;Info Management Ed Presentation
KEY
Privacy protection of visual information
PDF
DYNAMIC REGION OF INTEREST TRANSCODING FOR MULTIPOINT VIDEO ...
PDF
Scrambling For Video Surveillance
PPTX
Pros and Cons of Eyetracking
PPTX
Dmitry Stepanov - Detector of interest point from region of interest on NBI ...
PDF
ICME 2016 - Tutorial on Interactive Search in Video & Lifelog Repositories
PPTX
Extraction of region of interest in an image
Quality&amp;Info Management Ed Presentation
Privacy protection of visual information
DYNAMIC REGION OF INTEREST TRANSCODING FOR MULTIPOINT VIDEO ...
Scrambling For Video Surveillance
Pros and Cons of Eyetracking
Dmitry Stepanov - Detector of interest point from region of interest on NBI ...
ICME 2016 - Tutorial on Interactive Search in Video & Lifelog Repositories
Extraction of region of interest in an image

Viewers also liked (12)

PDF
Video Forgery Detection: Literature review
PDF
Eye Tracking & Consumer Behavior
PDF
Eye tracking and its economic feasibility
PPTX
An eye tracker analysis of the influence of applicant attractiveness on emplo...
PPTX
Interactive Video Search - Tutorial at ACM Multimedia 2015
PPT
Region Of Interest Extraction
PPTX
Image Processing Based Signature Recognition and Verification Technique Using...
PPT
Eye-tracking presentation
PPT
Theeye tribe, it s a eye tracking device which makes the usage of PC, laptops...
PPTX
Eye Tracking & Design
PPTX
Real time image processing ppt
PDF
Workshopvin4 Region Of Interest Advanced Video Coding
Video Forgery Detection: Literature review
Eye Tracking & Consumer Behavior
Eye tracking and its economic feasibility
An eye tracker analysis of the influence of applicant attractiveness on emplo...
Interactive Video Search - Tutorial at ACM Multimedia 2015
Region Of Interest Extraction
Image Processing Based Signature Recognition and Verification Technique Using...
Eye-tracking presentation
Theeye tribe, it s a eye tracking device which makes the usage of PC, laptops...
Eye Tracking & Design
Real time image processing ppt
Workshopvin4 Region Of Interest Advanced Video Coding
Ad

Similar to Mw2012 eyetracking (20)

PDF
Lookingglass Presentation4
PDF
Sentient Computing For Innovation
PDF
Usability Study Mouse Zoom
PDF
Logovinsky Alzforum Feb 28 2013 final
PDF
Measuring User Experience
PPTX
GReAT Aphasia Technology Event January 2012
PDF
The Evolution Of Eye Tracking
PPT
Qualifications And Experience Presentation
PPTX
Process Improvement using Lean Six Sigma Methodology
PPTX
Towards Interaction Models Derived From Eye-tracking Data .
PPT
Motion diagrams, data charts, graphs
PPT
Motion diagrams, data charts, graphs
PPT
Motion diagrams, data charts, graphs
PPT
Motion diagrams, data charts, graphs
PDF
Evaluating Touch Gesture Usability
PDF
Session 61 Albert Kircher
PDF
Towards Neuro–Information Science
PPT
Wesnes Alzforum Feb 28 2013
PPTX
Final Presentation
PDF
Keynote qa&test okt 2012 testing for the business, it’s a people’s job v03
Lookingglass Presentation4
Sentient Computing For Innovation
Usability Study Mouse Zoom
Logovinsky Alzforum Feb 28 2013 final
Measuring User Experience
GReAT Aphasia Technology Event January 2012
The Evolution Of Eye Tracking
Qualifications And Experience Presentation
Process Improvement using Lean Six Sigma Methodology
Towards Interaction Models Derived From Eye-tracking Data .
Motion diagrams, data charts, graphs
Motion diagrams, data charts, graphs
Motion diagrams, data charts, graphs
Motion diagrams, data charts, graphs
Evaluating Touch Gesture Usability
Session 61 Albert Kircher
Towards Neuro–Information Science
Wesnes Alzforum Feb 28 2013
Final Presentation
Keynote qa&test okt 2012 testing for the business, it’s a people’s job v03
Ad

More from Silvia Fantoni (7)

PDF
To charge or not to charge. The admission dilemma
PDF
The Challenge of Engaging Millennials in Art Museums
PDF
Visitor-Centered Exhibition Design: A paradigm Shift for Art Museums
PPTX
Engaging Visitors in Innovative Ways: Process, Data and Participation
PPTX
Whats the point? Two case studies of introducing digital in-gallery experiences
PPTX
Participatory Exhibition Design
PPTX
Mw2012 eyetracking
To charge or not to charge. The admission dilemma
The Challenge of Engaging Millennials in Art Museums
Visitor-Centered Exhibition Design: A paradigm Shift for Art Museums
Engaging Visitors in Innovative Ways: Process, Data and Participation
Whats the point? Two case studies of introducing digital in-gallery experiences
Participatory Exhibition Design
Mw2012 eyetracking

Mw2012 eyetracking

  • 1. Evaluating the Practical Applications Evaluating the Practical Applications of Eye Tracking in Museums Ed Bachta & Silvia Filippini-Fantoni
  • 3. Research results Research so far has indicated that visitors spend little time looking at artworks: • Hein, 1998 • Smith, 2001 • Worths, 2003 (‘grazing’) • Viewing Project, IMA 2012
  • 4. Observation vs. eye tracking • Use of observational rubric is labor-intensive and not always precise • Eye tracking technology has the potential of being more precise and less time- consuming • More recent development of less intrusive devices (non head mounted) Photo from Milekic MW 2010
  • 5. Sparks! grant objectives • Gauge the practicality of using such devices in a museum gallery setting. • Assess the ability of current eye tracking technology to reveal what visitors are looking at and for how long. • Explore the potential use of this equipment in a practical setting (e.g. VTS discussion)
  • 6. Infrared Emitters Camera EyeTech VT2
  • 7. Device initial testing • Eye tracker range limitations. • Too much variation in height when the person is standing. • For the experiments the viewer has to be seated, with the tracker placed in a fixed position between the tracker and the painting.
  • 8. Experiment 1 © Edward Hopper.
  • 9. Experiment 1: objectives • Distinguish when the participant looks inside/outside of the painting. • Measure time spent looking inside/outside of the painting. • Track where looking inside the field of the artwork. Calibration performed once
  • 10. The device was installed on a cart between the work of art and the seated participant and calibrated to the first participant.
  • 11. Participants’ standing and seated height were measured and distance from mid-eye to floor.
  • 12. Experiment 1: part 1 • 22 participants were asked to look in and outside the painting for 1 minute. • First 10 participants could not adjust their chair position to optimize eye tracking, while the next 12 were asked to do so. • Participants’ gazes (inside the field of the painting) were tracked by 2 research assistants with stopwatches. • The times were averaged and compared to the time tracked by the device
  • 13. Experiment 1: part 2 • A subset of participants (8 of the 22) were asked to look (over a period of 60 seconds) at 6 different areas of the work for 10 seconds each in sequence prompted by a research assistant. • Tracker data was logged in the same manner as the previous experiment.
  • 14. Relative quantity of valid gaze data Missing data for >25% of session time for 6 participants 20 10 difference (% of session time) 0 -10 -20 -30 -40 -50 -60 -70 -80 Fixed seat position Adjusted seat position 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22
  • 15. Relative quantity of valid gaze data Poor performance for 4 of 6 participants at low eye level (<50”) 20 10 difference (% of session time) 0 -10 -20 -30 -40 -50 -60 -70 -80 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22
  • 16. Relative quantity of valid gaze data Missing > 10% for 7 of 10 glasses wearers 20 10 difference (% of session time) 0 -10 -20 -30 -40 -50 -60 -70 -80 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22
  • 17. Comparison against manual measurement 100 90 error (% of session time) 80 70 60 50 40 30 20 10 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 Participant
  • 18. Comparison against manual measurement Fixed seat position Adjusted seat position Within 5% 0 5 Within 10% 1 7 • Allowing the participant to adjust the seat position produces better results • We were still hoping for better accuracy
  • 19. Gap Handling 100 90 error (% of session time) 80 70 60 Raw 50 40 100ms 30 500ms 20 1s 10 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 Participant
  • 20. Gap Handling Raw 100ms 500ms 1s Within 5% (fixed) 0 1 1 1 Within 10% (fixed) 1 3 3 4 Within 5% (adjusted) 5 5 8 7 Within 10% (adjusted) 7 7 8 8 • Applying a gap algorithm appears to improve results with a 500ms threshold • Accuracy was within 10% for two thirds of participants when allowing the seat position to be adjusted and using the algorithm
  • 21. Gaze locations (calibrated) 0 10 20 30 40 50 60 70 80 90 100 0 10 20 30 40 50 60 70 80 90 100
  • 23. Gaze locations (typical uncalibrated)
  • 24. Experiment 1: Summary • Device was not able to continuously track the gaze of a seated viewer. • An attempt to improve results by handling gaps in the data were successful, but only to a degree. • Vertical gaze location was not accurate for uncalibrated viewers.
  • 26. Experiment 2: objectives & methodology • Measure whether the device could be more precise when calibrated for each participant • We repeated experiment 1 (part 1 and 2) with 12 participants but calibrated the devices individually • This second experiment was set in a lab, where the exact size of the painting was reproduced on a board
  • 27. Relative quantity of valid gaze data 100 80 difference (% of session time) 60 40 20 0 1 2 3 4 5 6 7 8 9 10 11 12 -20
  • 28. Gaze Duration Delta Max. calibration Glasses Seated eye (% of session time) “score” elevation 7.719 6.70 did without glasses 49 15.159 2.76 yes 51.5 6.306 13.90 no 54 6.728 6.13 yes 52 5.164 3.30 no 51.25 10.071 6.78 no 48 0.007 5.13 no 53.75 6.082 4.07 did without glasses 47.75 4.572 4.36 yes 50.5 2.732 10.10 no 52 12.240 12.15 yes 49.5 3.790 3.77 no 51
  • 29. Gaze Duration Accuracy Raw data 100ms threshold 500ms threshold Within 2% 1 4 6 Within 5% 4 7 8 Within 10% 9 12 12 • An improvement over the first experiment • One third of participants were in the 5-10% range
  • 30. Gaze Location Best session
  • 31. Gaze Location Worst session
  • 32. Comparisons Average error Max. calibration “score” Glasses (degrees of FOV) 0.96 6.70 no 1.60 2.76 yes 1.72 13.90 no 2.47 6.13 yes 0.88 3.30 no 1.61 6.78 no 0.90 5.13 no 1.24 4.07 no 1.18 4.36 yes 1.76 10.10 no 3.00 12.15 yes 2.22 3.77 no
  • 33. Experiment 2: Summary • Applying the gap handling algorithm brought all sessions within 10% of the manual measurement • Gaze duration results were better than in the uncalibrated study, but still not what we hoped for • Gaze location results were also better than in the uncalibrated study, but not as accurate as expected
  • 34. Future Work • Experiment 3 will make use of the tracker during a VTS session • We will evaluate whether the data recorded assists in understanding what VTS participants look Photo from PAAM.org at during a session

Editor's Notes

  • #3: Visitor engagement with works of art has been the subject of numerous studies in the course of the last 30 years.
  • #4: Data shows that visitors spend little time at individual exhibit components and seldom read labels (Hein, 1998)Studies at the MET found a mean time of less than 30 seconds viewing an object (Smith, 2001)Visitors wander slowly past many artworks spending only seconds looking at a particular one (“grazing” – Worts, 2003)Time spent looking at an artwork between 4 and 31 seconds (median time) covering a number of installations (IMA’s Viewing Project Report, 2012)
  • #5: Techniques used by researchers for measuring visitor gaze are based on observational rubrics.These techniques are labor intensive and unable to tell us at exactly what the visitor is looking.Eye tracking technology can be useful in this respect, especially more recently with the development of less intrusive (as opposed to a head-mounted device) eye tracking devices that can be used to track gaze from afar.
  • #6: Given the recent developments in eye tracking technology and its ability to track visitors gaze less obtrusively, the Indianapolis Museum of Art has been awarded in the spring of 2011 an IMLS grant with the aim of:1) Gauging the practicality of using such devices in a gallery setting.2) Determining the ability of current eye tracking technology to measure precisely how long and what people are looking at.3)Explore the potential use of this equipment in a practical setting like a VTS session or as a way to activate content relevant to a work of art.These issues were to be addressed and tested in the context of 3 separate experiments to be conducted at the IMA between July 2011 and June 2012.
  • #7: In determining which hardware technology to use for our research, head mounted trackers were ruled out because one of the project goals is to determine whether an eye tracking system can be used to detect the gaze of a general museum visitor. Of the non head-mounted models available, the EyeTech Digital Systems VT2 eye tracker was chosen to be used in the experiments (http://www.eyetechds.com/). How does it work? During operation, the eye tracker emits IR light toward the viewer and captures a series of images (called frames) with a camera. Each frame is analyzed by the tracker to determine the position of the reflections of the IR light on the surface of the eyes. The tracker reports a set of data related to each frame, the calculated gaze point relative to the points used for calibration (if calibrated), and other parameters derived during the calculation.
  • #8: In our discussion with the representatives from EyeTech and experience during development, we found that the tracker has a series of limitations when it comes to its range. The tracker has an ideal viewing range of 25 inches from thefront panel. It is also unable to detect the eyes if they are looking too far past the left or right edges of the device or toohigh above the device. What this meant for our experiments was that detecting the eyes of a person with arbitrary height when standing was not going to be reliable, as adjustments to the position and/or angle of the tracker would be required.Our experimentation was therefore restricted to a scenario in which the viewer is seated, with the tracker placed in a fixed position between the tracker and the painting.
  • #9: The piece chosen was Edward Hopper’s Hotel Lobby due to:It being on display for the grant period.It being suitable for a VTS discussion or to trigger audio content.There being noticeable representational details that could be used to focus one’s attention.
  • #10: Understanding the device’s effectiveness when non-calibrated for each individual user:Distinguish when the participant looks inside/outside of the painting.Measure time spent looking inside/outside of the painting.Track where looking inside the field of the artwork.
  • #16: Overall, invalid frames surpassed the amount of time spent looking away by 20% of the session time for 6 of the participants. While this was the case during phase 1 (participants 1-10) for 4 people (in fact no valid frames were reported during two of these sessions), an improvement can be seen in phase 2 (participants 11-22), where the seat position was adjusted.
  • #17: The duration represented by invalid frames surpassed the amount of time spent looking away by over 40% of the session time for 4 of the 6 participants whose eye level was below 50” from the ground. The tracker was not able to register any valid data for two of these participants in phase 1, and two participants in phase 2 had difficulty getting into a good position. There did not appear to be a correlation to the quantity of valid data for seated eye levels between 50 and 53.75 inches (the highest seated eye level in the study).
  • #21: With 500ms threshold, 9 sessions are within 5% (was 5)Also 11 sessions are within 10% (was 7)Increasing to 1s introduces error
  • #24: The distribution of points within a cluster is approx 5%This corresponds to roughly 1.5% of the field of view, which is the region within which the eye moves involuntarily while fixating
  • #26: Clustering was performed without removing outliers
  • #29: As issues of accuracy in determining exactly where and for how long the participant is looking have emerged from experiment 1, it was decided to repeat the procedure for experiment 2 but this time we calibrated the device for each participant (12 in total) to see if the results would be more accurate.Unlike the first experiment which took place in the gallery, this second experiment was set in one of the IMA Adult Lecture room, where the exact size of the painting was reproduced on a white board. The use of the board as opposed to a painting reduced the chance of distraction during this test of accuracy, and more exact reference points allowed for higher precision in evaluation.
  • #38: Average error: 1.6 degrees