SlideShare a Scribd company logo
Learning with PurposeLearning with Purpose
Generating Adequate Distractors for
Multiple-Choice Questions
Authors: Cheng Zhang, Yicheng Sun,
Hejia Chen and Jie Wang
Presenter: Cheng Zhang
University of Massachusetts Lowell, USA
Learning with Purpose
1. Introduction
2. Distractor Generation
3. Evaluations
4. Conclusions
Overview
Learning with Purpose
An approach to automatic generation of adequate distractors for
a given question answer pair (QAP) to form an adequate
multiple-choice question (MCQ).
Combination of part-of-speech tagging, named-entity tagging,
semantic-role labeling, regular expressions, domain knowledge
bases, word embeddings, word edit distance, WordNet, and
other algorithms.
Evaluations by human judges, each MCQ has at least one
adequate distractor and 84% of MCQs have three adequate
distractors
Introduction
Abstract
Learning with Purpose
Methods of generating adequate distractors are typically
following two directions (Pho et al., 2014; Rao and Saha, 2018):
• 1. Domain specific knowledge bases
• 2. Semantic similarity
Previous efforts have focused on finding some forms of
distractors, instead of making them look more distracting.
Introduction
Background
Learning with Purpose
The Generated adequate distractor must satisfy the following
requirements:
• It is an incorrect answer to the question.
• It is grammatically correct.
• It is semantically related to the correct answer.
• It must provide enough distraction.
Introduction
Our Goals
Learning with Purpose
Input:
• Original article
• Answer in QAP
The fixed order of distractor generation for each target word:
1. Subjects,
2. Objects,
3. Adjectives for subjects,
4. Adjectives for objects,
5. Predicates,
6. Adverbs
Distractor Generation
Output:
• Distractors
Learning with Purpose
Three type of target word:
• Type-1: time point, time range,
numerical number, ordinal
number.
• Type-2: person, location,
organization.
• Type-3: others.
Distractor Generation
Learning with Purpose
Distractor candidates for Type-3:
• Semantic similarly words
• Hypernyms
• Antonyms
Filter out unsuitable candidates:
• Distractor candidates that contain the target word.
• Distractor candidates that have the same prefix of the target word
with edit distance less than three.
• E.g. Misspelled: “knowledge” vs “knowladge”
• Different tense: “try” vs “tries”
Distractor Generation
Target word in Type-3
Learning with Purpose
For each distractor candidate 𝑊𝑐 with target word 𝑊𝑡 :
𝑆 𝑣 = Word embedding cosine similarity score.
𝑆 𝑛 = WordNet WUP (Wu and Palmer, 1994) similarity score.
𝑆 𝑑 = Edit distance score.
where E is the edit distance.
Distractor Generation
Ranking Algorithm
Learning with Purpose
R = Ranking score.
if 𝑊𝑐 is an antonym of 𝑊𝑡
otherwise
Note that 𝑆 𝑣, 𝑆 𝑛, 𝑆 𝑑 are each between 0 and 1, and so 𝑅′(𝑊𝑐, 𝑊𝑡)
is between 0 and 1, which implies that log 𝑅′
𝑊𝑐, 𝑊𝑡 > 0 .
Also note that we give more weight to antonyms.
Distractor Generation
Ranking Algorithm
Learning with Purpose
U.S. SAT practice reading tests as a dataset.
Total of 303 distractors for evaluation.
Evaluated by human judgment.
Evaluation result:
• All distractors generated by our method are grammatically correct.
• 98% distractors are relevant to the QAP with distraction.
• 96% distractors provide sufficient distraction.
• 84% MCQs are adequate.
• All MCQs are acceptable (i.e., with at least one adequate distractor).
Evaluation
Learning with Purpose
What did Chie hear? (SAT practice test 1 article 1)
• her soft scuttling footsteps, the creak of the driveway.
• her soft scuttling footsteps, the creak of the stairwell.
• her soft scuttling footsteps, the knock of the door.
• her soft scuttling footsteps, the creak of the door. (Correct answer)
When should ethics apply? (SAT practice test 2 article 2)
• when someone makes an economic request.
• when someone makes an economic proposition.
• when someone makes a political decision.
• when someone makes an economic decision. (Correct answer)
Evaluation
Examples
Learning with Purpose
We presented a novel method using various NLP tools for
generating adequate distractors.
Improve the ranking measure to help select a better distractor
for a target word from a list of candidates.
Explore how to produce generative distractors using neural
networks, instead of just replacing a few target words in a given
answer.
Conclusions
Learning with PurposeLearning with Purpose
Thank you

More Related Content

PPTX
Rajesh babajee from the back of the class to the front
PDF
Questionnaires
PDF
Questionnaire Design - Meaning, Types, Layout and Process of Designing Questi...
PDF
Englsih language investigation
PPT
The Rhetoric of Argument
PPTX
Math-English Continuum - How To Improve Delivery of Algebra Content
PPTX
Questionnaire
PDF
Handout: Cite It Right! Scoring GED RLA Test Extended Responses
Rajesh babajee from the back of the class to the front
Questionnaires
Questionnaire Design - Meaning, Types, Layout and Process of Designing Questi...
Englsih language investigation
The Rhetoric of Argument
Math-English Continuum - How To Improve Delivery of Algebra Content
Questionnaire
Handout: Cite It Right! Scoring GED RLA Test Extended Responses

What's hot (20)

PPTX
True or false
PDF
Marketing Research: Quantitative Research(data - survey)
PPTX
Survey Methodology and Questionnaire Design Theory Part II
PDF
Case study evaluation rubric
PPTX
Sentiment Analysis
DOCX
Liberty university psyc 341 module 1 exam
PDF
Sample mba 3 spring 2015
PDF
How to Create Problem Solving Test Items
PPT
Ml ppt
PPT
Objective Test Guide
ODP
Binary Choice Presentation
PDF
Step Up Your Survey Research - Dawn of the Data Age Lecture Series
DOCX
Mb0050 research methodology
DOCX
Ms 95 - research methodology for management decisions
ODT
Rubric to assess a debate
PPTX
Questionnarie
PDF
Testrocker presentation
PPTX
Preparation of questionnaires
PPTX
Lesson 5 writing a research title
PDF
Selasturkiye Rm Social Surveys
True or false
Marketing Research: Quantitative Research(data - survey)
Survey Methodology and Questionnaire Design Theory Part II
Case study evaluation rubric
Sentiment Analysis
Liberty university psyc 341 module 1 exam
Sample mba 3 spring 2015
How to Create Problem Solving Test Items
Ml ppt
Objective Test Guide
Binary Choice Presentation
Step Up Your Survey Research - Dawn of the Data Age Lecture Series
Mb0050 research methodology
Ms 95 - research methodology for management decisions
Rubric to assess a debate
Questionnarie
Testrocker presentation
Preparation of questionnaires
Lesson 5 writing a research title
Selasturkiye Rm Social Surveys
Ad

Similar to Generating Adequate Distractors for Multiple-Choice Questions (20)

PDF
A Gentle Introduction to Text Analysis :)
PDF
A Gentle Introduction to Text Analysis I
PPTX
Qualitative approaches to learning analytics
PPT
Research design ii
PPT
Exploratory research design
PPT
Research design ii
PPT
Multiple choice tests
PPT
Coding.ppt
PPS
Cue Forum2008
DOCX
Assignment Surveys and R Assignment Surveys and Response R.docx
PPTX
Qualitative research methods kanhaiya sapkota
PDF
II-SDV 2016 Srinivasan Parthiban - KOL Analytics from Biomedical Literature
PPTX
Known knowns & unknown unknowns
PPTX
Semi supervised approach for word sense disambiguation
PPTX
Methodology and research process
DOCX
In this module we learned about who may be included in our research .docx
PPTX
Decoding word association 3 - sentence completion test
PPTX
The Key Challenge in Behavioural Research
PPT
QualitativeAnalysis_W2015.ppt
A Gentle Introduction to Text Analysis :)
A Gentle Introduction to Text Analysis I
Qualitative approaches to learning analytics
Research design ii
Exploratory research design
Research design ii
Multiple choice tests
Coding.ppt
Cue Forum2008
Assignment Surveys and R Assignment Surveys and Response R.docx
Qualitative research methods kanhaiya sapkota
II-SDV 2016 Srinivasan Parthiban - KOL Analytics from Biomedical Literature
Known knowns & unknown unknowns
Semi supervised approach for word sense disambiguation
Methodology and research process
In this module we learned about who may be included in our research .docx
Decoding word association 3 - sentence completion test
The Key Challenge in Behavioural Research
QualitativeAnalysis_W2015.ppt
Ad

Recently uploaded (20)

PDF
Softaken Excel to vCard Converter Software.pdf
PDF
Cost to Outsource Software Development in 2025
PPTX
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
PPTX
Embracing Complexity in Serverless! GOTO Serverless Bengaluru
PPTX
L1 - Introduction to python Backend.pptx
PDF
Nekopoi APK 2025 free lastest update
PPTX
Computer Software and OS of computer science of grade 11.pptx
PPTX
Introduction to Artificial Intelligence
PPTX
CHAPTER 2 - PM Management and IT Context
PDF
Wondershare Filmora 15 Crack With Activation Key [2025
PDF
Digital Strategies for Manufacturing Companies
PPTX
assetexplorer- product-overview - presentation
PDF
Which alternative to Crystal Reports is best for small or large businesses.pdf
PDF
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
PPTX
Odoo POS Development Services by CandidRoot Solutions
PDF
Navsoft: AI-Powered Business Solutions & Custom Software Development
PDF
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
PDF
Design an Analysis of Algorithms II-SECS-1021-03
PPTX
Transform Your Business with a Software ERP System
PPTX
Operating system designcfffgfgggggggvggggggggg
Softaken Excel to vCard Converter Software.pdf
Cost to Outsource Software Development in 2025
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
Embracing Complexity in Serverless! GOTO Serverless Bengaluru
L1 - Introduction to python Backend.pptx
Nekopoi APK 2025 free lastest update
Computer Software and OS of computer science of grade 11.pptx
Introduction to Artificial Intelligence
CHAPTER 2 - PM Management and IT Context
Wondershare Filmora 15 Crack With Activation Key [2025
Digital Strategies for Manufacturing Companies
assetexplorer- product-overview - presentation
Which alternative to Crystal Reports is best for small or large businesses.pdf
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
Odoo POS Development Services by CandidRoot Solutions
Navsoft: AI-Powered Business Solutions & Custom Software Development
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
Design an Analysis of Algorithms II-SECS-1021-03
Transform Your Business with a Software ERP System
Operating system designcfffgfgggggggvggggggggg

Generating Adequate Distractors for Multiple-Choice Questions

  • 1. Learning with PurposeLearning with Purpose Generating Adequate Distractors for Multiple-Choice Questions Authors: Cheng Zhang, Yicheng Sun, Hejia Chen and Jie Wang Presenter: Cheng Zhang University of Massachusetts Lowell, USA
  • 2. Learning with Purpose 1. Introduction 2. Distractor Generation 3. Evaluations 4. Conclusions Overview
  • 3. Learning with Purpose An approach to automatic generation of adequate distractors for a given question answer pair (QAP) to form an adequate multiple-choice question (MCQ). Combination of part-of-speech tagging, named-entity tagging, semantic-role labeling, regular expressions, domain knowledge bases, word embeddings, word edit distance, WordNet, and other algorithms. Evaluations by human judges, each MCQ has at least one adequate distractor and 84% of MCQs have three adequate distractors Introduction Abstract
  • 4. Learning with Purpose Methods of generating adequate distractors are typically following two directions (Pho et al., 2014; Rao and Saha, 2018): • 1. Domain specific knowledge bases • 2. Semantic similarity Previous efforts have focused on finding some forms of distractors, instead of making them look more distracting. Introduction Background
  • 5. Learning with Purpose The Generated adequate distractor must satisfy the following requirements: • It is an incorrect answer to the question. • It is grammatically correct. • It is semantically related to the correct answer. • It must provide enough distraction. Introduction Our Goals
  • 6. Learning with Purpose Input: • Original article • Answer in QAP The fixed order of distractor generation for each target word: 1. Subjects, 2. Objects, 3. Adjectives for subjects, 4. Adjectives for objects, 5. Predicates, 6. Adverbs Distractor Generation Output: • Distractors
  • 7. Learning with Purpose Three type of target word: • Type-1: time point, time range, numerical number, ordinal number. • Type-2: person, location, organization. • Type-3: others. Distractor Generation
  • 8. Learning with Purpose Distractor candidates for Type-3: • Semantic similarly words • Hypernyms • Antonyms Filter out unsuitable candidates: • Distractor candidates that contain the target word. • Distractor candidates that have the same prefix of the target word with edit distance less than three. • E.g. Misspelled: “knowledge” vs “knowladge” • Different tense: “try” vs “tries” Distractor Generation Target word in Type-3
  • 9. Learning with Purpose For each distractor candidate 𝑊𝑐 with target word 𝑊𝑡 : 𝑆 𝑣 = Word embedding cosine similarity score. 𝑆 𝑛 = WordNet WUP (Wu and Palmer, 1994) similarity score. 𝑆 𝑑 = Edit distance score. where E is the edit distance. Distractor Generation Ranking Algorithm
  • 10. Learning with Purpose R = Ranking score. if 𝑊𝑐 is an antonym of 𝑊𝑡 otherwise Note that 𝑆 𝑣, 𝑆 𝑛, 𝑆 𝑑 are each between 0 and 1, and so 𝑅′(𝑊𝑐, 𝑊𝑡) is between 0 and 1, which implies that log 𝑅′ 𝑊𝑐, 𝑊𝑡 > 0 . Also note that we give more weight to antonyms. Distractor Generation Ranking Algorithm
  • 11. Learning with Purpose U.S. SAT practice reading tests as a dataset. Total of 303 distractors for evaluation. Evaluated by human judgment. Evaluation result: • All distractors generated by our method are grammatically correct. • 98% distractors are relevant to the QAP with distraction. • 96% distractors provide sufficient distraction. • 84% MCQs are adequate. • All MCQs are acceptable (i.e., with at least one adequate distractor). Evaluation
  • 12. Learning with Purpose What did Chie hear? (SAT practice test 1 article 1) • her soft scuttling footsteps, the creak of the driveway. • her soft scuttling footsteps, the creak of the stairwell. • her soft scuttling footsteps, the knock of the door. • her soft scuttling footsteps, the creak of the door. (Correct answer) When should ethics apply? (SAT practice test 2 article 2) • when someone makes an economic request. • when someone makes an economic proposition. • when someone makes a political decision. • when someone makes an economic decision. (Correct answer) Evaluation Examples
  • 13. Learning with Purpose We presented a novel method using various NLP tools for generating adequate distractors. Improve the ranking measure to help select a better distractor for a target word from a list of candidates. Explore how to produce generative distractors using neural networks, instead of just replacing a few target words in a given answer. Conclusions
  • 14. Learning with PurposeLearning with Purpose Thank you