SlideShare a Scribd company logo
Creating Session Data from
eTextbook Event Streams
SAMNYEONG HEO, MOHAMMED
FARGHALLY, MOSTAFA MOHAMMED,
CLIFF SHAFFER
DEPARTMENT OF COMPUTER SCIENCE,
VIRGINIA TECH
JULY 3, 2023
FIFTH WORKSHOP ON INTELLIGENT TEXTBOOKS ( ITEXTBOOKS)
PROBLEM
 Intelligent Textbooks typically provide text and
interactive elements
 Easy to collect detailed event-level data
 Raw (event-level) data can be hard to interpret
 A large class can generate a million+ events
OBJECTIVES
 Group raw event data into study sessions
 Recognize behaviors
 Find correlations behavior and student
performance
BACKGROUND - OpenDSA
 Open-source electronic textbook system
 Provides contents on CS-related topics
• DSA, Formal Languages, Finite Automata
 Provides algorithm visualizations, interactive
exercises, proficiency exercises
Module
• A fundamental unit of
OpenDSA
• Resembles a specific topic or
section in a traditional
textbook
• Groups of modules form
“chapters”
Structure of OpenDSA
Creating Session Data from eTextbook Event Streams
Proficiency Exercises
• A type of exercise used in
FLA course
• Student creates a state
machine represented by a
graph
• Solutions are tested using test
cases
OpenDSA Exercises
PIFrameset: Programmed
Instruction Frameset
• Customized slideshows used
in FLA course
• Must answer questions on
slides in order to move to the
next
• A credit will be awarded after
completing the slide set
OpenDSA PIFramesets
User Interaction Data
 Core information to analyze students’ behavior
 Records user activities in OpenDSA
 Window events
 Button clicks
 Credit events
User Interaction Data
Session
• A session is a key organizing unit of behavior
• Study sessions are a continuous stream of
interactions without a “significant” gap
• The time threshold for the gap is 10 minutes
SESSION DATA
Abstracting User Interaction Data
Abstracting User Interaction Data
Abstracted Interaction Data
EXPERIMENT DATA: Courses
 Data collected from 4 classes
 FLA Fall 2020 / Spring 2021
 DSA Fall 2020 / Spring 2021 (Data
aggregated)
EXPERIMENT - Behaviors
 Session-level data is rather a lot of information
for instructors to absorb
 “Behaviors” are a higher-level of abstraction
BEHAVIOR: Reading Prose
BEHAVIOR: Reading Prose
 A transition from “module opening” to “exercise
state” (load-to-exercise) or leaving the page is
made within the time threshold (15 to 120
seconds)
 Expect to see three clusters
 1. Not reading – Less than 15 seconds
 2. Reading – Between 15 and 120 seconds
 3. Stepped Away – More than 120 seconds
SESSION STATES
 State
• Series of events on the same content
element
• Such as Document State, Window State, PE
State, FF State, Other State
• Defining session states allows us to recognized
multi-state behaviors.
BEHAVIOR: Credit Seeking
 A transition from “module opening” to “exercise
state” (load-to-exercise) is made less than the
time threshold (15 seconds).
 Students seeking credits from completing
slideshows and solving exercises without
reading the material.
BEHAVIOR: PI Credit Seeking
 A transition from FF state to the same FF state is
triggered within the threshold (8 seconds)
 Student advances to the next slide within 8
seconds, indicating a possible credit seeking
(guessing)
 Instead of raw counts of the behavior, we
consider the ratio of the total number of PIFrame
related events and the PI credit seeking events
SESSION STATES: Uses
 Instructor investigation: Present abstract session
data to instructors or researchers to examine.
 Ex: Instructor can see from session data that
student X is exhibiting high credit-seeking
behavior. Useful to know when counseling
student.
 Automated Analysis: Preprocessing step
BEHAVIOR: Automated Analysis
1. Define various behaviors
2. Recognize those behaviors from session data
3. Check correlation between behaviors and performance
• Assign students into two groups: High vs Low
performers
• Investigate whether there are differences between
the groups on candidate behaviors
• If significant difference exists, then examine whether
behavior affects performance
Reading Prose Behavior
C S 5 0 4 0
H i g h P e r f o r m e r L o w p e r f o r m e r
# of Students 35 21
Lowest occurrence 0 0
Highest occurrence 61 69
Average occurrence 18 15
Std. Deviation 15 16
P-value 0.54
PI Credit Seeking Behavior
C S 4 1 1 4 F C S 4 1 1 4 S
H i g h P e r f o r m e r L o w p e r f o r m e r H i g h P e r f o r m e r L o w p e r f o r m e r
# of Students 33 25 24 23
Lowest Ratio 1.302 1.472 1.32 0
Highest Ratio 2.452 2.732 2.69 7.67
Average Ratio 1.790 1.883 1.67 1.82
Std. Deviation 0.265 0.310 0.32 1.32
P-value 0.24 0.60
• Only Activity Sessions behavior
shows a significant relationship
to the performance in terms of
overall course grades
• The HP group tends to
generate more Activity
Sessions behavior, and less PI
Credit Seeking behavior ratio
C S 4 1 1 4 F C S 4 1 1 4 S
H P L P H P L P
Activity
Sessions
Avg.
occurrence
287 182 441 231
P-value 0.013 0.01
PI Credit
Seeking
Avg. Ratio 1.79 1.88 1.67 1.82
P-value 0.24 0.60
FLA Activity Sessions
CONCLUSIONS
 Main goal is to group the raw interaction data
into sessions, then generate abstracted
session-level data
 Suggested uses for abstracted data
 Defined possible behaviors that can be
observed
FUTURE WORK
 Additional Learning Behaviors
• Frequently jumping in and out of a module
• Opens multiple modules simultaneously
 Time considerations for sessions
• Tracking scroll events
Questions
?

More Related Content

PPT
Cst analysis overview sdusd princ 11-16-10
PPT
Assessing for Improvement: learning outcomes assessment for library instruction
DOCX
Running Header PROJECT PHASE 1 .docx
PDF
2016 McCabe-Brown Student Feedback - Benchmark Results
PDF
2016 McCabe Brown Student Feedback - Benchmark Results
PPTX
Gamifying Research Activity Support System
PDF
Framing Blended learning, teaching, and education
PPTX
Technology Enhanced Learning of Modeling Skills in the Field of Information S...
Cst analysis overview sdusd princ 11-16-10
Assessing for Improvement: learning outcomes assessment for library instruction
Running Header PROJECT PHASE 1 .docx
2016 McCabe-Brown Student Feedback - Benchmark Results
2016 McCabe Brown Student Feedback - Benchmark Results
Gamifying Research Activity Support System
Framing Blended learning, teaching, and education
Technology Enhanced Learning of Modeling Skills in the Field of Information S...

Similar to Creating Session Data from eTextbook Event Streams (20)

PPTX
Analytics - Presentation in DkIT
PPT
PeerPigeon - eLearn 2008
PPTX
UXD II - Usability Study Final Presentation
PPTX
University of Washington Global Health E-Learning
PPTX
The why and what of testa
PDF
A Learning Analytics Approach
PPT
Modern information Retrieval-Relevance Feedback
PDF
(Ebook) Teaching Online: A Practical Guide by Susan Ko; Steve Rossen ISBN 978...
PPTX
Getting to grips with TESTA methods
PPT
Model Blended Course TESL Ontario2009
PDF
Addictive links, Keynote talk at WWW 2014 workshop
PPT
Ipt syllabus overview_zarkovic
PPTX
Assessment Tools and Strategies
DOCX
S3 9789-mfy293-managerial and cost accounting
PPTX
Jisc learning analytics service oct 2016
PPTX
A Course by Any Other Name.v3
PPT
Distance Learning Amatyc Access 2008
PPT
Sample for ETEC 603
PPT
Lifemikeh
PPT
Lifemikeh
Analytics - Presentation in DkIT
PeerPigeon - eLearn 2008
UXD II - Usability Study Final Presentation
University of Washington Global Health E-Learning
The why and what of testa
A Learning Analytics Approach
Modern information Retrieval-Relevance Feedback
(Ebook) Teaching Online: A Practical Guide by Susan Ko; Steve Rossen ISBN 978...
Getting to grips with TESTA methods
Model Blended Course TESL Ontario2009
Addictive links, Keynote talk at WWW 2014 workshop
Ipt syllabus overview_zarkovic
Assessment Tools and Strategies
S3 9789-mfy293-managerial and cost accounting
Jisc learning analytics service oct 2016
A Course by Any Other Name.v3
Distance Learning Amatyc Access 2008
Sample for ETEC 603
Lifemikeh
Lifemikeh
Ad

More from Sergey Sosnovsky (20)

PPTX
Harnessing Textbooks for High-Quality Labeled Data: An Approach to Automatic ...
PDF
Toward Eliminating Hallucinations: GPT-based Explanatory AI for Intelligent T...
PDF
Layout- and Activity-based Textbook Modeling for Automatic PDF Textbook Extra...
PPTX
Exploring the Content Ecosystem of the First Open-source Adaptive Tutor and i...
PPTX
Advancing Intelligent Textbooks with Automatically Generated Practice: A Larg...
PDF
Augmenting Digital Textbooks with Reusable Smart Learning Content: Solutions ...
PDF
Interactions of reading and assessment activities
PDF
Parallel Construction: A Parallel Corpus Approach for Automatic Question Gene...
PDF
YAI4Edu: an Explanatory AI to Generate Interactive e-Books for Education
PDF
Automatic Question Generation for Evidence-based Online Courseware Engineering
PDF
Reading Comprehension Quiz Generation using Generative Pre-trained Transformers
PPTX
Mathematical Language Processing via Tree Embeddings
PPTX
Contextual Definition Generation
PPTX
Transforming Textbooks into Learning by Doing Environments: An Evaluation of ...
PPTX
Generation of Assessment Questions from Textbooks Enriched with Knowledge Models
PPTX
Using Semantics of Textbook Highlights to Predict Student Comprehension and K...
PPTX
Dental TutorBot: Exploitation of Dental Textbooks for Automated Learning
PDF
What's in a textbook
PPTX
Using Programmed Instruction to Help Students Engage with eTextbook Content
PPTX
Adding Intelligence to a Textbook for Human Anatomy with a Causal Concept Map...
Harnessing Textbooks for High-Quality Labeled Data: An Approach to Automatic ...
Toward Eliminating Hallucinations: GPT-based Explanatory AI for Intelligent T...
Layout- and Activity-based Textbook Modeling for Automatic PDF Textbook Extra...
Exploring the Content Ecosystem of the First Open-source Adaptive Tutor and i...
Advancing Intelligent Textbooks with Automatically Generated Practice: A Larg...
Augmenting Digital Textbooks with Reusable Smart Learning Content: Solutions ...
Interactions of reading and assessment activities
Parallel Construction: A Parallel Corpus Approach for Automatic Question Gene...
YAI4Edu: an Explanatory AI to Generate Interactive e-Books for Education
Automatic Question Generation for Evidence-based Online Courseware Engineering
Reading Comprehension Quiz Generation using Generative Pre-trained Transformers
Mathematical Language Processing via Tree Embeddings
Contextual Definition Generation
Transforming Textbooks into Learning by Doing Environments: An Evaluation of ...
Generation of Assessment Questions from Textbooks Enriched with Knowledge Models
Using Semantics of Textbook Highlights to Predict Student Comprehension and K...
Dental TutorBot: Exploitation of Dental Textbooks for Automated Learning
What's in a textbook
Using Programmed Instruction to Help Students Engage with eTextbook Content
Adding Intelligence to a Textbook for Human Anatomy with a Causal Concept Map...
Ad

Recently uploaded (20)

PPTX
neck nodes and dissection types and lymph nodes levels
PPTX
Classification Systems_TAXONOMY_SCIENCE8.pptx
PDF
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
PDF
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
PPTX
Microbiology with diagram medical studies .pptx
PPTX
Taita Taveta Laboratory Technician Workshop Presentation.pptx
PPTX
Comparative Structure of Integument in Vertebrates.pptx
PPTX
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
PPTX
2. Earth - The Living Planet Module 2ELS
PPTX
7. General Toxicologyfor clinical phrmacy.pptx
PPTX
Introduction to Fisheries Biotechnology_Lesson 1.pptx
PDF
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
PPTX
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
PDF
Phytochemical Investigation of Miliusa longipes.pdf
PDF
An interstellar mission to test astrophysical black holes
PDF
HPLC-PPT.docx high performance liquid chromatography
PDF
The scientific heritage No 166 (166) (2025)
PPT
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
PPTX
DRUG THERAPY FOR SHOCK gjjjgfhhhhh.pptx.
PDF
bbec55_b34400a7914c42429908233dbd381773.pdf
neck nodes and dissection types and lymph nodes levels
Classification Systems_TAXONOMY_SCIENCE8.pptx
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
Microbiology with diagram medical studies .pptx
Taita Taveta Laboratory Technician Workshop Presentation.pptx
Comparative Structure of Integument in Vertebrates.pptx
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
2. Earth - The Living Planet Module 2ELS
7. General Toxicologyfor clinical phrmacy.pptx
Introduction to Fisheries Biotechnology_Lesson 1.pptx
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
Phytochemical Investigation of Miliusa longipes.pdf
An interstellar mission to test astrophysical black holes
HPLC-PPT.docx high performance liquid chromatography
The scientific heritage No 166 (166) (2025)
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
DRUG THERAPY FOR SHOCK gjjjgfhhhhh.pptx.
bbec55_b34400a7914c42429908233dbd381773.pdf

Creating Session Data from eTextbook Event Streams

  • 1. Creating Session Data from eTextbook Event Streams SAMNYEONG HEO, MOHAMMED FARGHALLY, MOSTAFA MOHAMMED, CLIFF SHAFFER DEPARTMENT OF COMPUTER SCIENCE, VIRGINIA TECH JULY 3, 2023 FIFTH WORKSHOP ON INTELLIGENT TEXTBOOKS ( ITEXTBOOKS)
  • 2. PROBLEM  Intelligent Textbooks typically provide text and interactive elements  Easy to collect detailed event-level data  Raw (event-level) data can be hard to interpret  A large class can generate a million+ events
  • 3. OBJECTIVES  Group raw event data into study sessions  Recognize behaviors  Find correlations behavior and student performance
  • 4. BACKGROUND - OpenDSA  Open-source electronic textbook system  Provides contents on CS-related topics • DSA, Formal Languages, Finite Automata  Provides algorithm visualizations, interactive exercises, proficiency exercises
  • 5. Module • A fundamental unit of OpenDSA • Resembles a specific topic or section in a traditional textbook • Groups of modules form “chapters” Structure of OpenDSA
  • 7. Proficiency Exercises • A type of exercise used in FLA course • Student creates a state machine represented by a graph • Solutions are tested using test cases OpenDSA Exercises
  • 8. PIFrameset: Programmed Instruction Frameset • Customized slideshows used in FLA course • Must answer questions on slides in order to move to the next • A credit will be awarded after completing the slide set OpenDSA PIFramesets
  • 9. User Interaction Data  Core information to analyze students’ behavior  Records user activities in OpenDSA  Window events  Button clicks  Credit events
  • 11. Session • A session is a key organizing unit of behavior • Study sessions are a continuous stream of interactions without a “significant” gap • The time threshold for the gap is 10 minutes SESSION DATA
  • 15. EXPERIMENT DATA: Courses  Data collected from 4 classes  FLA Fall 2020 / Spring 2021  DSA Fall 2020 / Spring 2021 (Data aggregated)
  • 16. EXPERIMENT - Behaviors  Session-level data is rather a lot of information for instructors to absorb  “Behaviors” are a higher-level of abstraction
  • 18. BEHAVIOR: Reading Prose  A transition from “module opening” to “exercise state” (load-to-exercise) or leaving the page is made within the time threshold (15 to 120 seconds)  Expect to see three clusters  1. Not reading – Less than 15 seconds  2. Reading – Between 15 and 120 seconds  3. Stepped Away – More than 120 seconds
  • 19. SESSION STATES  State • Series of events on the same content element • Such as Document State, Window State, PE State, FF State, Other State • Defining session states allows us to recognized multi-state behaviors.
  • 20. BEHAVIOR: Credit Seeking  A transition from “module opening” to “exercise state” (load-to-exercise) is made less than the time threshold (15 seconds).  Students seeking credits from completing slideshows and solving exercises without reading the material.
  • 21. BEHAVIOR: PI Credit Seeking  A transition from FF state to the same FF state is triggered within the threshold (8 seconds)  Student advances to the next slide within 8 seconds, indicating a possible credit seeking (guessing)  Instead of raw counts of the behavior, we consider the ratio of the total number of PIFrame related events and the PI credit seeking events
  • 22. SESSION STATES: Uses  Instructor investigation: Present abstract session data to instructors or researchers to examine.  Ex: Instructor can see from session data that student X is exhibiting high credit-seeking behavior. Useful to know when counseling student.  Automated Analysis: Preprocessing step
  • 23. BEHAVIOR: Automated Analysis 1. Define various behaviors 2. Recognize those behaviors from session data 3. Check correlation between behaviors and performance • Assign students into two groups: High vs Low performers • Investigate whether there are differences between the groups on candidate behaviors • If significant difference exists, then examine whether behavior affects performance
  • 24. Reading Prose Behavior C S 5 0 4 0 H i g h P e r f o r m e r L o w p e r f o r m e r # of Students 35 21 Lowest occurrence 0 0 Highest occurrence 61 69 Average occurrence 18 15 Std. Deviation 15 16 P-value 0.54
  • 25. PI Credit Seeking Behavior C S 4 1 1 4 F C S 4 1 1 4 S H i g h P e r f o r m e r L o w p e r f o r m e r H i g h P e r f o r m e r L o w p e r f o r m e r # of Students 33 25 24 23 Lowest Ratio 1.302 1.472 1.32 0 Highest Ratio 2.452 2.732 2.69 7.67 Average Ratio 1.790 1.883 1.67 1.82 Std. Deviation 0.265 0.310 0.32 1.32 P-value 0.24 0.60
  • 26. • Only Activity Sessions behavior shows a significant relationship to the performance in terms of overall course grades • The HP group tends to generate more Activity Sessions behavior, and less PI Credit Seeking behavior ratio C S 4 1 1 4 F C S 4 1 1 4 S H P L P H P L P Activity Sessions Avg. occurrence 287 182 441 231 P-value 0.013 0.01 PI Credit Seeking Avg. Ratio 1.79 1.88 1.67 1.82 P-value 0.24 0.60 FLA Activity Sessions
  • 27. CONCLUSIONS  Main goal is to group the raw interaction data into sessions, then generate abstracted session-level data  Suggested uses for abstracted data  Defined possible behaviors that can be observed
  • 28. FUTURE WORK  Additional Learning Behaviors • Frequently jumping in and out of a module • Opens multiple modules simultaneously  Time considerations for sessions • Tracking scroll events