SlideShare a Scribd company logo
Reading Comprehension Quiz Generation
using Generative Pre-trained Transformers
Ramon Dijkstra, Zülküf Genç, Subhradeep Kayal and Jaap Kamps
The 23th International Conference on Artificial Intelligence in Education (AIED’2022)
Fourth Workshop on Intelligent Textbooks (iTextbooks)
27 July 2022
Agenda
• Background
• Goal
• Demo
• Approach
• Experimental Setup
• Experimental Results
• Analysis
• Revisiting the demo
• Main Takeaways
• Q&A
Background
Quiz Generation
• Question Generation
• Question Answering
• Distractor Generation
Goal
Educational Text → Multiple-choice quiz
Why?
• Enhance intelligent textbooks with assessments
• Students could test themselves during the learning phase
• Teachers could use the tool to generate assessments
Demo
Approach
Large pre-trained transformers have shown superior perfomances on several text
generation tasks.
Generative Pre-trained Transformer 3 (GPT-3) can be finetuned to downstream
tasks using the API of OpenAI:
• Train on prompt-completion pairs
• Give a never-seen before prompt during inference
Approach
Prompt: Educational Text
Completion: Quiz
End-to-End Quiz Generation Template
Question: . . .
True answer: . . .
False answer: . . .
False answer: . . .
False answer: . . .
We will call this finetuned model EduQuiz.
Experimental Setup – quiz generation techniques
Two quiz generation techniques:
• Step-Wise Quiz Generation (SWQG)
• End-to-End Quiz Generation (EEQG)
Experimental Setup – models
Two models:
• GPT-3
• Macaw-11b
Macaw-11b is a general-purpose model trained on various text generation tasks.
Experimental Setup – dataset
EQG-RACE dataset:
• 18.501 train
• 1.035 validation
• 950 test
Processed RACE dataset where only examination questions are kept.
Experimental Setup – automatic evaluation
Metrics:
• BLEU-4: measures the 4-gram similarity between a prediction and ground truth
instances
• ROUGE-L: measures the longest common sub-sequence between the prediction
and ground truth instances
• METEOR: is similar in comparison to BLEU-4 but also takes synonyms, stemming,
and paraphrasing into account
Experimental Results
Macaw-11b did have the configurations to perform EEQG but our experiments
showed that only 14% of the generated quizzes contained all the elements of the
quiz. Therefore, we will not use Macaw-11b for EEQG.
Experimental Results
Original quiz:
Question: What is Edell’s strongest character?
True answer: Liking to try new things.
False answer: Liking to remember things.
False answer: Liking to help others.
False answer: Liking to think of problems differently.
Macaw-11b SWQG generated quiz (BLEU-4 = 0.0):
Question: The writer’s mother told her that if she wanted
to be successful in life, she should be prepared for any
kind of change. What did Edell do to prepare for the
Asian MTV Awards?
True answer: she read the biographies of the guests
False answer: she practiced her English with her friends
False answer: she made a list of the names of the guests
False answer: she made a plan to talk with the guests
GPT-3 SWQG generated quiz (BLEU-4 = 0.0):
Question: What advice does Edell give us?
True answer: Get ready for any opportunity in life.
False answer: Try to live a colorful life.
False answer: Take any chance that comes up.
False answer: Explore your potential talents.
EduQuiz-generated quiz (BLEU-4 = 0.0):
Question: What advice does Edell give to young people?
True answer: Try to get yourself well-prepared in life.
False answer: Have a rich collection of CDs.
False answer: Never miss an opportunity to learn ballet.
False answer: Be a hostess of the Asian MTV Awards.
Analysis - metrics
Analysis - results
Revisiting the demo
Revisiting the demo
Main Takeaways
• Already useful for formative feedback and to increase engagement during the
learning phase
• Currently only limited to English language and reading comprehension texts
• Too early to replace educational professionals
• Current performances require a human-in-the-loop to check the quality
Q&A

More Related Content

PDF
SRE 讀書會 - 導讀:第一章
PPT
Querying the Semantic Web with SPARQL
PPTX
Workshop Trend Micro
PPTX
Azure App Service
PPTX
Edge Computing.pptx
PPTX
Service Oriented Architecture (SOA)
PPTX
DevOps and Continuous Delivery Reference Architectures (including Nexus and o...
PPTX
Microservices Decomposition Patterns
SRE 讀書會 - 導讀:第一章
Querying the Semantic Web with SPARQL
Workshop Trend Micro
Azure App Service
Edge Computing.pptx
Service Oriented Architecture (SOA)
DevOps and Continuous Delivery Reference Architectures (including Nexus and o...
Microservices Decomposition Patterns

What's hot (20)

PDF
Cloud Migration Checklist | Microsoft Azure Migration
PPTX
What is cloud backup?
PDF
MuleSoft Sizing Guidelines - VirtualMuleys
PPTX
Implementing white box testing
PPTX
Oracle Cloud Infrastructure Overview Deck.pptx
PDF
Refactoring 101
PDF
Bounded Context - DDD Europe Foundation Track
PDF
Datacenter migration using vmware
PPT
Introduction to Service Oriented Architecture
PPTX
Cloud Native: what is it? Why?
PDF
Microsoft Azure Active Directory
PPTX
Network Virtualization
PDF
Docker vs VM | | Containerization or Virtualization - The Differences | DevOp...
PDF
Universal React apps in Next.js
PDF
VMware Tanzu Application Service as an Integration Platform
PDF
Ethermint 2.0: An Ethereum Scaling Solution by Cosmos
PPTX
WebRTC presentation
PDF
AWS Concepts - Internship Presentation - week 10
PPTX
PDF
Low code development platform
Cloud Migration Checklist | Microsoft Azure Migration
What is cloud backup?
MuleSoft Sizing Guidelines - VirtualMuleys
Implementing white box testing
Oracle Cloud Infrastructure Overview Deck.pptx
Refactoring 101
Bounded Context - DDD Europe Foundation Track
Datacenter migration using vmware
Introduction to Service Oriented Architecture
Cloud Native: what is it? Why?
Microsoft Azure Active Directory
Network Virtualization
Docker vs VM | | Containerization or Virtualization - The Differences | DevOp...
Universal React apps in Next.js
VMware Tanzu Application Service as an Integration Platform
Ethermint 2.0: An Ethereum Scaling Solution by Cosmos
WebRTC presentation
AWS Concepts - Internship Presentation - week 10
Low code development platform
Ad

Similar to Reading Comprehension Quiz Generation using Generative Pre-trained Transformers (20)

PPTX
Chat GPT and Generative AI in Higher Education - Empowering Educators and Lea...
PPTX
Teaching with ChatGPT-Practical Tips and Strategies
PDF
Intro to LLMs
PDF
How to Use ChatGPT to Generate Content ? - By PrepAI
PDF
ChatGPT: Friend or Foe?
PPTX
ChatGPT and Moodle: An Interesting Mix
PDF
Automatic Question Generation for Evidence-based Online Courseware Engineering
PPTX
Generation of Assessment Questions from Textbooks Enriched with Knowledge Models
PPTX
The updated non-technical introduction to ChatGPT SEDA March 2023.pptx
PDF
Generative Models and ChatGPT
PPTX
AI and ChatGPT in Online Education
PPTX
A study on impact of Chat GPT in Education.pptx
PDF
ITB 2023 - Chatgpt Box! AI All The Things - Scott Steinbeck.pdf
PDF
ITB_2023_Chatgpt_Box_Scott_Steinbeck.pdf
PDF
Exploring ChatGPT For Effective Teaching
PPTX
Exploring ChatGPT for Effective Teaching and Learning.pptx
PDF
PDF
slideshareClasec07255.pdf
PDF
exploringchatgptforeffectiveteachingandlearning-230319174748-fbc07255.pdf
PPTX
exploringchatgptforeffectiveteachingandlearning-230319174748-fbc07255.pptx
Chat GPT and Generative AI in Higher Education - Empowering Educators and Lea...
Teaching with ChatGPT-Practical Tips and Strategies
Intro to LLMs
How to Use ChatGPT to Generate Content ? - By PrepAI
ChatGPT: Friend or Foe?
ChatGPT and Moodle: An Interesting Mix
Automatic Question Generation for Evidence-based Online Courseware Engineering
Generation of Assessment Questions from Textbooks Enriched with Knowledge Models
The updated non-technical introduction to ChatGPT SEDA March 2023.pptx
Generative Models and ChatGPT
AI and ChatGPT in Online Education
A study on impact of Chat GPT in Education.pptx
ITB 2023 - Chatgpt Box! AI All The Things - Scott Steinbeck.pdf
ITB_2023_Chatgpt_Box_Scott_Steinbeck.pdf
Exploring ChatGPT For Effective Teaching
Exploring ChatGPT for Effective Teaching and Learning.pptx
slideshareClasec07255.pdf
exploringchatgptforeffectiveteachingandlearning-230319174748-fbc07255.pdf
exploringchatgptforeffectiveteachingandlearning-230319174748-fbc07255.pptx
Ad

More from Sergey Sosnovsky (20)

PPTX
Harnessing Textbooks for High-Quality Labeled Data: An Approach to Automatic ...
PDF
Toward Eliminating Hallucinations: GPT-based Explanatory AI for Intelligent T...
PDF
Layout- and Activity-based Textbook Modeling for Automatic PDF Textbook Extra...
PPTX
Exploring the Content Ecosystem of the First Open-source Adaptive Tutor and i...
PPTX
Advancing Intelligent Textbooks with Automatically Generated Practice: A Larg...
PPTX
Creating Session Data from eTextbook Event Streams
PDF
Augmenting Digital Textbooks with Reusable Smart Learning Content: Solutions ...
PDF
Interactions of reading and assessment activities
PDF
Parallel Construction: A Parallel Corpus Approach for Automatic Question Gene...
PDF
YAI4Edu: an Explanatory AI to Generate Interactive e-Books for Education
PPTX
Mathematical Language Processing via Tree Embeddings
PPTX
Contextual Definition Generation
PPTX
Transforming Textbooks into Learning by Doing Environments: An Evaluation of ...
PPTX
Using Semantics of Textbook Highlights to Predict Student Comprehension and K...
PPTX
Dental TutorBot: Exploitation of Dental Textbooks for Automated Learning
PDF
What's in a textbook
PPTX
Using Programmed Instruction to Help Students Engage with eTextbook Content
PPTX
Adding Intelligence to a Textbook for Human Anatomy with a Causal Concept Map...
PPTX
Interlingua: Linking Textbooks Across Different Languages
PPTX
Student Modeling with Automatic Knowledge Component Extraction for Adaptive T...
Harnessing Textbooks for High-Quality Labeled Data: An Approach to Automatic ...
Toward Eliminating Hallucinations: GPT-based Explanatory AI for Intelligent T...
Layout- and Activity-based Textbook Modeling for Automatic PDF Textbook Extra...
Exploring the Content Ecosystem of the First Open-source Adaptive Tutor and i...
Advancing Intelligent Textbooks with Automatically Generated Practice: A Larg...
Creating Session Data from eTextbook Event Streams
Augmenting Digital Textbooks with Reusable Smart Learning Content: Solutions ...
Interactions of reading and assessment activities
Parallel Construction: A Parallel Corpus Approach for Automatic Question Gene...
YAI4Edu: an Explanatory AI to Generate Interactive e-Books for Education
Mathematical Language Processing via Tree Embeddings
Contextual Definition Generation
Transforming Textbooks into Learning by Doing Environments: An Evaluation of ...
Using Semantics of Textbook Highlights to Predict Student Comprehension and K...
Dental TutorBot: Exploitation of Dental Textbooks for Automated Learning
What's in a textbook
Using Programmed Instruction to Help Students Engage with eTextbook Content
Adding Intelligence to a Textbook for Human Anatomy with a Causal Concept Map...
Interlingua: Linking Textbooks Across Different Languages
Student Modeling with Automatic Knowledge Component Extraction for Adaptive T...

Recently uploaded (20)

PPTX
Introduction to Cardiovascular system_structure and functions-1
PPTX
ECG_Course_Presentation د.محمد صقران ppt
PDF
Biophysics 2.pdffffffffffffffffffffffffff
PPTX
BIOMOLECULES PPT........................
PPTX
EPIDURAL ANESTHESIA ANATOMY AND PHYSIOLOGY.pptx
PDF
Phytochemical Investigation of Miliusa longipes.pdf
PPTX
GEN. BIO 1 - CELL TYPES & CELL MODIFICATIONS
PDF
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
DOCX
Viruses (History, structure and composition, classification, Bacteriophage Re...
PPTX
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
PPTX
Microbiology with diagram medical studies .pptx
PPTX
INTRODUCTION TO EVS | Concept of sustainability
PDF
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
PDF
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
PPTX
TOTAL hIP ARTHROPLASTY Presentation.pptx
PPTX
2Systematics of Living Organisms t-.pptx
PDF
An interstellar mission to test astrophysical black holes
PDF
HPLC-PPT.docx high performance liquid chromatography
PPTX
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
PDF
AlphaEarth Foundations and the Satellite Embedding dataset
Introduction to Cardiovascular system_structure and functions-1
ECG_Course_Presentation د.محمد صقران ppt
Biophysics 2.pdffffffffffffffffffffffffff
BIOMOLECULES PPT........................
EPIDURAL ANESTHESIA ANATOMY AND PHYSIOLOGY.pptx
Phytochemical Investigation of Miliusa longipes.pdf
GEN. BIO 1 - CELL TYPES & CELL MODIFICATIONS
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
Viruses (History, structure and composition, classification, Bacteriophage Re...
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
Microbiology with diagram medical studies .pptx
INTRODUCTION TO EVS | Concept of sustainability
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
TOTAL hIP ARTHROPLASTY Presentation.pptx
2Systematics of Living Organisms t-.pptx
An interstellar mission to test astrophysical black holes
HPLC-PPT.docx high performance liquid chromatography
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
AlphaEarth Foundations and the Satellite Embedding dataset

Reading Comprehension Quiz Generation using Generative Pre-trained Transformers

  • 1. Reading Comprehension Quiz Generation using Generative Pre-trained Transformers Ramon Dijkstra, Zülküf Genç, Subhradeep Kayal and Jaap Kamps The 23th International Conference on Artificial Intelligence in Education (AIED’2022) Fourth Workshop on Intelligent Textbooks (iTextbooks) 27 July 2022
  • 2. Agenda • Background • Goal • Demo • Approach • Experimental Setup • Experimental Results • Analysis • Revisiting the demo • Main Takeaways • Q&A
  • 3. Background Quiz Generation • Question Generation • Question Answering • Distractor Generation
  • 4. Goal Educational Text → Multiple-choice quiz Why? • Enhance intelligent textbooks with assessments • Students could test themselves during the learning phase • Teachers could use the tool to generate assessments
  • 6. Approach Large pre-trained transformers have shown superior perfomances on several text generation tasks. Generative Pre-trained Transformer 3 (GPT-3) can be finetuned to downstream tasks using the API of OpenAI: • Train on prompt-completion pairs • Give a never-seen before prompt during inference
  • 7. Approach Prompt: Educational Text Completion: Quiz End-to-End Quiz Generation Template Question: . . . True answer: . . . False answer: . . . False answer: . . . False answer: . . . We will call this finetuned model EduQuiz.
  • 8. Experimental Setup – quiz generation techniques Two quiz generation techniques: • Step-Wise Quiz Generation (SWQG) • End-to-End Quiz Generation (EEQG)
  • 9. Experimental Setup – models Two models: • GPT-3 • Macaw-11b Macaw-11b is a general-purpose model trained on various text generation tasks.
  • 10. Experimental Setup – dataset EQG-RACE dataset: • 18.501 train • 1.035 validation • 950 test Processed RACE dataset where only examination questions are kept.
  • 11. Experimental Setup – automatic evaluation Metrics: • BLEU-4: measures the 4-gram similarity between a prediction and ground truth instances • ROUGE-L: measures the longest common sub-sequence between the prediction and ground truth instances • METEOR: is similar in comparison to BLEU-4 but also takes synonyms, stemming, and paraphrasing into account
  • 12. Experimental Results Macaw-11b did have the configurations to perform EEQG but our experiments showed that only 14% of the generated quizzes contained all the elements of the quiz. Therefore, we will not use Macaw-11b for EEQG.
  • 13. Experimental Results Original quiz: Question: What is Edell’s strongest character? True answer: Liking to try new things. False answer: Liking to remember things. False answer: Liking to help others. False answer: Liking to think of problems differently. Macaw-11b SWQG generated quiz (BLEU-4 = 0.0): Question: The writer’s mother told her that if she wanted to be successful in life, she should be prepared for any kind of change. What did Edell do to prepare for the Asian MTV Awards? True answer: she read the biographies of the guests False answer: she practiced her English with her friends False answer: she made a list of the names of the guests False answer: she made a plan to talk with the guests GPT-3 SWQG generated quiz (BLEU-4 = 0.0): Question: What advice does Edell give us? True answer: Get ready for any opportunity in life. False answer: Try to live a colorful life. False answer: Take any chance that comes up. False answer: Explore your potential talents. EduQuiz-generated quiz (BLEU-4 = 0.0): Question: What advice does Edell give to young people? True answer: Try to get yourself well-prepared in life. False answer: Have a rich collection of CDs. False answer: Never miss an opportunity to learn ballet. False answer: Be a hostess of the Asian MTV Awards.
  • 18. Main Takeaways • Already useful for formative feedback and to increase engagement during the learning phase • Currently only limited to English language and reading comprehension texts • Too early to replace educational professionals • Current performances require a human-in-the-loop to check the quality
  • 19. Q&A