SlideShare a Scribd company logo
Enhancing Online Problems Through Instructor-
Centered Tools for Randomized Experiments
Joseph Jay Williams
University of Toronto Computer Science ( Nat. U. of Singapore)
www.josephjaywilliams.com/papers, tiny.cc/icepdf
Anna Rafferty, Andrew Ang, Dustin Tingley, Walter Lasecki, Juho Kim
[I’m originally from the Caribbean,
Trinidad and Tobago]
Postdoc at U of T (www.josephjaywilliams.com/postdoc)
Computer Science PhD positions to do Education Research
CHI 2019 subcommittee on “Learning/Families” (Amy Ogan &
I are SCs)
How Can We Help Instructors Conduct A/B Experiments?
• Opportunity: Collect data about alternative instructional
approaches, instead of relying on intuition
• Obstacle: Time and effort to program experiments
• Elaboration Messages in online problems
x = matrix(rnorm(m*n),m,n)
What is the standard error?
Answer:
A z-score is defined as the number of
standard deviations a specific point is
away from the mean.
Elaboration Messages:
Explanations,
Hints,
Learning Tips
Related Work
• Technology for A/B Experimentation (Optimizely,
edX, ASSISTments) (Heffernan & Heffernan, 2014)
• Involving instructors in research (Barab & Squire, 2014)
• Elaboration messages in online problems (Shute, 2008)
Overview
• Design goals for instructor-centered
experimentation
• DynamicProblem, an end-user tool (on-campus
courses & MOOCs)
• Reinforcement learning for dynamic
experimentation
• Insights from deployment with 3 instructors
Goals for Instructor Centered Experimentation
• 1. Deploy experiments and obtain data with
minimal programming
– Provide end-user plug-in, DynamicProblem
• 2. Use data for practical improvement
– Use reinforcement learning to automatically give
more effective conditions to future students
DynamicProblem Plug-In for Courses
• Embed into any Learning Management System (e.g. Canvas)
or MOOC, via Learning Tools Interoperability Standard
Student View of DynamicProblem
Linda is training for a marathon, which is a race that is 26
miles long.
Her average training time for the 26 miles is 208 minutes,
but the day of the marathon she was x minutes faster than
her average time.
What was Linda's running speed for the marathon in miles
per minute?
Elaboration Message
Linda's speed is the distance she ran divided by the time it took. The
distance Linda ran was 26 miles. The time it took her was 208 – x.
Linda's speed was 26/(208 - x)
26/(208 - x)
How helpful was the above information for your learning?
Completely Perfectly
Unhelpful Helpful
0 1 2 3 4 5 6 7 8 9 10
A
B
ACM Learning @ Scale 2016
Instructor View of DynamicProblem
View Elaboration Messages
Add Elaboration Messages
View Elaboration Messages
Learning Tip
Data Dashboard (Instructor 3)
Learning Tip
Probability
of
Message
Observations from Deployment with 3 Instructors
• Lowered Barriers: “not aware of any tools that do this
sort of thing”, “even if I found one, wouldn’t have the
technical expertise to incorporate it in my course”
• Reflection on pedagogy: “I never really seriously
considered [testing] multiple versions as we are now
doing. So even if we don't get any significant data, that
will have been a benefit in my mind”
• Making research practical: “a valuable tool. Putting in
the hands of the teacher to understand how their
students learn. Not just in broad terms, but specifically
in their course”….
2. Use Data For Practical Improvement
• Instructor concerns:
– Experiments advance researchers’ goals, but do not
directly help their students
– Ethics of giving students unhelpful conditions
• Approach of Dynamic Experimentation:
– Analyze data in real-time
– Give higher-rated messages to future students
Model
Action a
Dynamic Experimentation: Exploration vs Exploitation
• Multi-Armed Bandit (Reinforcement Learning)
A
Reward R
Policy
Elaboration Message A
The probability is 3/7 * 5/8, because the number of
cookies is changing.
Rating
How helpful was the above information for your learning?
0 1 2 3 4 5 6 7 8 9 10
A B
70% 30%
(Probability of Message being Helpful)
(0 to 10 Rating by Student)
Elaboration Message B
The number of cookies is changing..
Randomized Probability
Matching (Thompson Sampling)
Dynamically Weighted Randomization
• 50/50 probability of assignment
• 60/40
• 70/30
• … 100/0
• Probability a student assigned to a message =
• Probability of that message being highest rated
Instructor 2’s Experiment
Probability
of
Message
Observations from Deployment with 3 Instructors
• Directly helping students: “improved the experience
of many of the students by giving them answers that
are more helpful… the earlier ones can help improve
the experience of the later students. That’s pretty
neat”
Student Perceptions?
• Students weren’t surprised by, and appreciated the
approach:
• “I assume companies are always A/B testing on me”
• “now the data I provide can help other people learn”
Limitations & Future Work
• Conduct better experiments
– You can use the hosted tool: www.josephjaywilliams.com/dynamicproblem
github.com/kunanit/adaptive-mooclet-lti
• Go beyond subjective student judgments
• Instructor-researcher collaboration
• Personalization of Elaboration Messages
–MOOClet WebService for dynamic A/B testing
(www.josephjaywilliams.com/mooclet)
Review
• Design goals for instructor experimentation
• DynamicProblem, an end-user tool (Use it at
www.josephjaywilliams.com/dynamicproblem)
• Multi-armed bandits for dynamic experimentation
• Insights from deployment with 3 instructors
• Limitations & Future Work
• PhD students can do education at U of T Comp Sci
• www.josephjaywilliams.com/postdoc
• Learning/Families subcommittee for CHI 2019 (Amy
Ogan & I are SCs)
Thank you!
• National University of Singapore Information
Systems & Analytics
• Harvard VPAL Research Group

More Related Content

PDF
Michael Gage SOED 2016
PDF
Infuse Student Engagement and Assessment Using InfuseLearning
PPTX
Adaptive Multilevel Clustering Model for the Prediction of Academic Risk
PPT
Clicker technologies as a tool to increase student engagement and motivation
PPT
micro testing teaching learning analytics
PPTX
Digging deep
PDF
Teaching Electronic Resources to Students with Computer Anxiety
PPTX
Our Education system
Michael Gage SOED 2016
Infuse Student Engagement and Assessment Using InfuseLearning
Adaptive Multilevel Clustering Model for the Prediction of Academic Risk
Clicker technologies as a tool to increase student engagement and motivation
micro testing teaching learning analytics
Digging deep
Teaching Electronic Resources to Students with Computer Anxiety
Our Education system

What's hot (20)

PDF
Teaching electronic resources to students with computer anxiety
PPT
Observed to Improve
PPTX
Surviving (and thriving) in the Age of Technology-Enhanced Instruction
PPTX
Multimodal Learning Analytics
PDF
Some experiences from evaluating and stress testing digital examination systems
PPTX
Introduction to College Studies
PPTX
DIT_Clickers_Workshop Sept 2011
PPTX
Between a Rock and a Hard place
PPTX
Getting bums on virtual seats for library training, the benefit of hindsight ...
PPTX
Poll everywhere
ODP
Experience Sharing of Introducing Free Software to Campus: Example of Geng-fa...
PPTX
Multimodal Learning Analytics
PPT
Blog virtual manipulatives research ppt
PPT
Netbook trial 21st century learning 2010
PPT
Analysis on ABCSTTU Model of Homdepot
PPTX
Elac team project2
PPTX
AMATYC Ignite 2017 2nd Half
PPTX
Presentation BCME8 April 17th 2014
PPTX
Teaching coding workshops
PPTX
Data collection and analysis
Teaching electronic resources to students with computer anxiety
Observed to Improve
Surviving (and thriving) in the Age of Technology-Enhanced Instruction
Multimodal Learning Analytics
Some experiences from evaluating and stress testing digital examination systems
Introduction to College Studies
DIT_Clickers_Workshop Sept 2011
Between a Rock and a Hard place
Getting bums on virtual seats for library training, the benefit of hindsight ...
Poll everywhere
Experience Sharing of Introducing Free Software to Campus: Example of Geng-fa...
Multimodal Learning Analytics
Blog virtual manipulatives research ppt
Netbook trial 21st century learning 2010
Analysis on ABCSTTU Model of Homdepot
Elac team project2
AMATYC Ignite 2017 2nd Half
Presentation BCME8 April 17th 2014
Teaching coding workshops
Data collection and analysis
Ad

Similar to CHI (Computer Human Interaction) 2019 enhancing online problems through instructor centered tools for randomized experiments (20)

PDF
Joseph Jay Williams - WESST - Bridging Research via MOOClets and Collaborativ...
PDF
Joseph Jay Williams - WESST - Bridging Research and Practice via MOOClets & C...
PPTX
Learner Analytics and the “Big Data” Promise for Course & Program Assessment
PPTX
How online educational resources provide novel affordances for conducting pra...
PPTX
The why and what of testa
PPTX
Defining Adaptive Learning Technology: What it is, how it works, and why it’s...
PPTX
CCC-M F2F meeting_141007
PPTX
Using Data-Driven Discovery Techniques for the Design and Improvement of Educ...
PPTX
Lessons Learned from Moodle VLE/LMS Data in the Field
PPTX
TESTA to FASTECH Presentation
PDF
WCOL2019: Learning analytics for learning design or learning design for learn...
PPTX
CCD Jump into JiTT - Feb 2014
PPTX
1 why do testa
PPTX
The power of learning analytics to unpack learning and teaching: a critical p...
PPTX
Changing the assessment narrative
PPTX
TESTA to FASTECH (November 2011)
PDF
Learning analytics overview: Building evidence based practice
PDF
An evidence based model
PPTX
Defining Adaptive Learning Technology
PPTX
CHECO Retreat - Changing landscape of teaching
Joseph Jay Williams - WESST - Bridging Research via MOOClets and Collaborativ...
Joseph Jay Williams - WESST - Bridging Research and Practice via MOOClets & C...
Learner Analytics and the “Big Data” Promise for Course & Program Assessment
How online educational resources provide novel affordances for conducting pra...
The why and what of testa
Defining Adaptive Learning Technology: What it is, how it works, and why it’s...
CCC-M F2F meeting_141007
Using Data-Driven Discovery Techniques for the Design and Improvement of Educ...
Lessons Learned from Moodle VLE/LMS Data in the Field
TESTA to FASTECH Presentation
WCOL2019: Learning analytics for learning design or learning design for learn...
CCD Jump into JiTT - Feb 2014
1 why do testa
The power of learning analytics to unpack learning and teaching: a critical p...
Changing the assessment narrative
TESTA to FASTECH (November 2011)
Learning analytics overview: Building evidence based practice
An evidence based model
Defining Adaptive Learning Technology
CHECO Retreat - Changing landscape of teaching
Ad

More from Joseph Jay Williams (11)

PPTX
Learning Engineering of MOOClets: Simultaneously benefiting Professional Lear...
PPTX
Experiments in Educational Research and Practice
PDF
Learning innovation at scale chi 2014 workshop extended abstract
PPTX
Experiments in Educational Research & Practice
PPTX
Supporting Instructors in MOOCs: Using cognitive science research to guide pe...
PPTX
Using Experiments and Cognitive Science Research to Improve the Design of Onl...
PDF
Doing online learning research with both scientific and financial value
PPTX
How can Cognitive Science improve Online Learning & Education?
PPT
Joseph Williams – Bloomsburg Corporate Advisory Council Meeting
PDF
Williams lombrozo2010
PPTX
Explanation & learning slides (talk @ pittsburgh science of learning center)
Learning Engineering of MOOClets: Simultaneously benefiting Professional Lear...
Experiments in Educational Research and Practice
Learning innovation at scale chi 2014 workshop extended abstract
Experiments in Educational Research & Practice
Supporting Instructors in MOOCs: Using cognitive science research to guide pe...
Using Experiments and Cognitive Science Research to Improve the Design of Onl...
Doing online learning research with both scientific and financial value
How can Cognitive Science improve Online Learning & Education?
Joseph Williams – Bloomsburg Corporate Advisory Council Meeting
Williams lombrozo2010
Explanation & learning slides (talk @ pittsburgh science of learning center)

Recently uploaded (20)

PPTX
Institutional Correction lecture only . . .
PDF
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PPTX
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PDF
Classroom Observation Tools for Teachers
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PPTX
202450812 BayCHI UCSC-SV 20250812 v17.pptx
PPTX
master seminar digital applications in india
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PPTX
GDM (1) (1).pptx small presentation for students
PPTX
Final Presentation General Medicine 03-08-2024.pptx
Institutional Correction lecture only . . .
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
Pharmacology of Heart Failure /Pharmacotherapy of CHF
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
O5-L3 Freight Transport Ops (International) V1.pdf
2.FourierTransform-ShortQuestionswithAnswers.pdf
Classroom Observation Tools for Teachers
Supply Chain Operations Speaking Notes -ICLT Program
human mycosis Human fungal infections are called human mycosis..pptx
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
202450812 BayCHI UCSC-SV 20250812 v17.pptx
master seminar digital applications in india
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
Abdominal Access Techniques with Prof. Dr. R K Mishra
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
Module 4: Burden of Disease Tutorial Slides S2 2025
FourierSeries-QuestionsWithAnswers(Part-A).pdf
GDM (1) (1).pptx small presentation for students
Final Presentation General Medicine 03-08-2024.pptx

CHI (Computer Human Interaction) 2019 enhancing online problems through instructor centered tools for randomized experiments

  • 1. Enhancing Online Problems Through Instructor- Centered Tools for Randomized Experiments Joseph Jay Williams University of Toronto Computer Science ( Nat. U. of Singapore) www.josephjaywilliams.com/papers, tiny.cc/icepdf Anna Rafferty, Andrew Ang, Dustin Tingley, Walter Lasecki, Juho Kim [I’m originally from the Caribbean, Trinidad and Tobago] Postdoc at U of T (www.josephjaywilliams.com/postdoc) Computer Science PhD positions to do Education Research CHI 2019 subcommittee on “Learning/Families” (Amy Ogan & I are SCs)
  • 2. How Can We Help Instructors Conduct A/B Experiments? • Opportunity: Collect data about alternative instructional approaches, instead of relying on intuition • Obstacle: Time and effort to program experiments • Elaboration Messages in online problems x = matrix(rnorm(m*n),m,n) What is the standard error? Answer: A z-score is defined as the number of standard deviations a specific point is away from the mean. Elaboration Messages: Explanations, Hints, Learning Tips
  • 3. Related Work • Technology for A/B Experimentation (Optimizely, edX, ASSISTments) (Heffernan & Heffernan, 2014) • Involving instructors in research (Barab & Squire, 2014) • Elaboration messages in online problems (Shute, 2008)
  • 4. Overview • Design goals for instructor-centered experimentation • DynamicProblem, an end-user tool (on-campus courses & MOOCs) • Reinforcement learning for dynamic experimentation • Insights from deployment with 3 instructors
  • 5. Goals for Instructor Centered Experimentation • 1. Deploy experiments and obtain data with minimal programming – Provide end-user plug-in, DynamicProblem • 2. Use data for practical improvement – Use reinforcement learning to automatically give more effective conditions to future students
  • 6. DynamicProblem Plug-In for Courses • Embed into any Learning Management System (e.g. Canvas) or MOOC, via Learning Tools Interoperability Standard
  • 7. Student View of DynamicProblem Linda is training for a marathon, which is a race that is 26 miles long. Her average training time for the 26 miles is 208 minutes, but the day of the marathon she was x minutes faster than her average time. What was Linda's running speed for the marathon in miles per minute? Elaboration Message Linda's speed is the distance she ran divided by the time it took. The distance Linda ran was 26 miles. The time it took her was 208 – x. Linda's speed was 26/(208 - x) 26/(208 - x) How helpful was the above information for your learning? Completely Perfectly Unhelpful Helpful 0 1 2 3 4 5 6 7 8 9 10 A B ACM Learning @ Scale 2016
  • 8. Instructor View of DynamicProblem
  • 12. Data Dashboard (Instructor 3) Learning Tip Probability of Message
  • 13. Observations from Deployment with 3 Instructors • Lowered Barriers: “not aware of any tools that do this sort of thing”, “even if I found one, wouldn’t have the technical expertise to incorporate it in my course” • Reflection on pedagogy: “I never really seriously considered [testing] multiple versions as we are now doing. So even if we don't get any significant data, that will have been a benefit in my mind” • Making research practical: “a valuable tool. Putting in the hands of the teacher to understand how their students learn. Not just in broad terms, but specifically in their course”….
  • 14. 2. Use Data For Practical Improvement • Instructor concerns: – Experiments advance researchers’ goals, but do not directly help their students – Ethics of giving students unhelpful conditions • Approach of Dynamic Experimentation: – Analyze data in real-time – Give higher-rated messages to future students
  • 15. Model Action a Dynamic Experimentation: Exploration vs Exploitation • Multi-Armed Bandit (Reinforcement Learning) A Reward R Policy Elaboration Message A The probability is 3/7 * 5/8, because the number of cookies is changing. Rating How helpful was the above information for your learning? 0 1 2 3 4 5 6 7 8 9 10 A B 70% 30% (Probability of Message being Helpful) (0 to 10 Rating by Student) Elaboration Message B The number of cookies is changing.. Randomized Probability Matching (Thompson Sampling)
  • 16. Dynamically Weighted Randomization • 50/50 probability of assignment • 60/40 • 70/30 • … 100/0 • Probability a student assigned to a message = • Probability of that message being highest rated
  • 18. Observations from Deployment with 3 Instructors • Directly helping students: “improved the experience of many of the students by giving them answers that are more helpful… the earlier ones can help improve the experience of the later students. That’s pretty neat”
  • 19. Student Perceptions? • Students weren’t surprised by, and appreciated the approach: • “I assume companies are always A/B testing on me” • “now the data I provide can help other people learn”
  • 20. Limitations & Future Work • Conduct better experiments – You can use the hosted tool: www.josephjaywilliams.com/dynamicproblem github.com/kunanit/adaptive-mooclet-lti • Go beyond subjective student judgments • Instructor-researcher collaboration • Personalization of Elaboration Messages –MOOClet WebService for dynamic A/B testing (www.josephjaywilliams.com/mooclet)
  • 21. Review • Design goals for instructor experimentation • DynamicProblem, an end-user tool (Use it at www.josephjaywilliams.com/dynamicproblem) • Multi-armed bandits for dynamic experimentation • Insights from deployment with 3 instructors • Limitations & Future Work • PhD students can do education at U of T Comp Sci • www.josephjaywilliams.com/postdoc • Learning/Families subcommittee for CHI 2019 (Amy Ogan & I are SCs)
  • 22. Thank you! • National University of Singapore Information Systems & Analytics • Harvard VPAL Research Group