SlideShare a Scribd company logo
Using corpora to
enhance language
learning
Michael Barlow
Overview
wordlists
collocation lists
online concordancers
text analysis software
concordancers
ParaConc and Collocate
web-based exercises
data-driven learning materials
Wordlists – general and
specialised
Wordlists have been around since before the
invention of computers. General wordlists are
used for curriculum development, textbook
writing etc.
Also possible to produce a word list for a
reading (or a possibly textbook)
Wordlists – general
Use existing wordlists such as West's General
Service List and recent updates. Coxhead's
Academic Wordlist. Kilgarriff's Wordlists
based on the BNC.
Kilgarriff Page
Academic Word List
Academic Word List
Academic Word List
• receptive list (based on morphological
derivations)
• the list excludes words found in non-academic
texts (even if they occur in academic texts)
• do we need subject or genre-specific
wordlists? (Hyland)
Specialised Word List
• Create a wordlist from a corpus (using
concordancer or other utilities)
• May need to create your own corpus –
BootCaT ?? Silvia Bernadini
BootCaT
Vocab Profile
• Tom Cobb's Vocab Profile
• http://www.lextutor.ca/vp/eng/
Collocation lists
• More difficult to find – use Collocation
Dictionary??
• Biber's work on lexical bundles
• Use concordancer or utility to create ngram
lists or locate collocations
• Collocate – shown below
Concordancers
• Online concordancer
Concordancers
Concordancers –
americancorpus.org
Concordancers
• Using a concordancer in the classroom
• Corpus as a reference tool – query the corpus
– can you say “the government are”
– what is the difference between “for
instance” and “for example”
– Tim Johns – Data-driven Learning
• (...caused economic
development...)
Concordancers – text
reconstruction exercises
Data-driven learning
(deductive)
Data-driven learning
(inductive)
Concordance data
• DDL – highlighting/noticing/discovery learning
• Highlight unexpected (for the learner)
distinctions, uses etc.
• Sequence data to build up knowledge
Parallel concordance
data
• Parallel concordance works on translation
corpus
• Students need to have same L1
Concordance data
issues
• KWIC format
• Google effect
• Data overload
• Reauthenticating data
– Sabine Braun – includes discourse
perspective (Why did the speaker use
that form?)
Parallel Corpora – DDL
(CHUJO, Kiyomi)
Parallel Corpora – DDL
(Chujo, Kiyomi)
Collocate
Software to extract collocations/terms
Word search + Span (2 words, 3 words etc.)
n-gram (bigram, trigram) list
Full extract -- collocations in a corpus
Enhancing Language Learning Using Corpora
Search for analysis
(Span = 2)
analysis - frequency
analysis - t-score
analysis - MI
Enhancing Language Learning Using Corpora
Trigram search
Trigram -- by freq
Trigram -- alphabetical
Trigram -- by MI
Using batch mode –
Corpuslab.com
Familiar exercise authoring
Currently offline
Aims
avoid duplication of tasks -- identifying
common collocations in Business English
Provide corpus/analysis resources
Bring corpus resources together with
familiar exercise authoring
Enhancing Language Learning Using Corpora
Student View
Student View
Student View
Student View
Exercise types
Matching
Fill-the-gap
Multiple Choice
Reorder
Categorise
Exercise types
Matching*
Fill-the-gap
Multiple Choice
Reorder
Categorise*
Teacher view
Teacher view
Teacher view
Enhancing Language Learning Using Corpora
Teacher view -
Resources
Resources
Teacher-generated resources
uploaded frequency lists
worksheets
Tracking
Teachers can track their exercises
“Class teachers” track students in their class
Tracking
Report for exercise Cat1
Tracking of student
School view
Register as a school
Create class names
Assign teachers to classes
Track students in classes
School view
School view
Resources
Site resources
corpora and simple concordancer
text analysis utilities
Text analysis utilities
Create frequency lists
Text analysis in terms of frequency bands
Collocational analysis of texts
Enhancing Language Learning Using Corpora
Enhancing Language Learning Using Corpora
Enhancing Language Learning Using Corpora
Corpora
Teacher/Author resource
Sample corpus -- CSPAE
Add other corpora such as MICASE
Create various options for searching that
make use of corpus annotation
Simple searching
Enhancing Language Learning Using Corpora
Enhancing Language Learning Using Corpora
Aims
Create a language learning site
Encourage and facilitate use of corpus data
Matching exercise (up to 5 columns)
Provide access to word lists etc
Provide text analysis tools
Aims
Use traditional exercise types that teachers
are familiar with
Give examples of creative uses of these
standard exercises
Enhancing Language Learning Using Corpora
Thank you

More Related Content

PPT
An Intuitive Natural Language Understanding System
PPTX
Enriching the semantic web tutorial session 1
PPTX
Using the internet for search
PPT
Bridging Formal and Informal Learning for Second Language Writing in FLAX
PPT
Pedagogical applications of corpus data for English for General and Specific ...
PPT
Corpus linguistics in language learning
PDF
Corpus Tools for Language Teaching
PPTX
Corpus linguistics
An Intuitive Natural Language Understanding System
Enriching the semantic web tutorial session 1
Using the internet for search
Bridging Formal and Informal Learning for Second Language Writing in FLAX
Pedagogical applications of corpus data for English for General and Specific ...
Corpus linguistics in language learning
Corpus Tools for Language Teaching
Corpus linguistics

Similar to Enhancing Language Learning Using Corpora (20)

PDF
Corpus linguistics intro
PPT
Concordancing 1
PPTX
Not just for reference: Dictionaries and corpora as language acquisition tools
PPT
The Corpus In The Classroom
PDF
Corpus Linguistics for Language Teaching and Learning
PPTX
2017.09.26 corpus
PDF
2018/06/13 Corpus
PDF
2018/3/07 corpus
PPTX
Corpus Construction & Specialist Vocabulary Learning
PPTX
online references tool
PPT
Concordancing and ESL
PPTX
Using online corpus for literacy teachers
PPTX
Making English Real Anna Gates
PDF
New Trends In Corpora And Language Learning Ana Frankenberggarcia Lynne Flowe...
PPTX
Corpus study design
PPTX
Eccup webinar part 1
PPTX
Concordancer
PPTX
Updated concordancer
PPTX
Two Hot Topics in Online Language Learning: Corpus Linguistics and Telecollab...
PPTX
New Pedagogical Trends in the ENGLISH Classroom
Corpus linguistics intro
Concordancing 1
Not just for reference: Dictionaries and corpora as language acquisition tools
The Corpus In The Classroom
Corpus Linguistics for Language Teaching and Learning
2017.09.26 corpus
2018/06/13 Corpus
2018/3/07 corpus
Corpus Construction & Specialist Vocabulary Learning
online references tool
Concordancing and ESL
Using online corpus for literacy teachers
Making English Real Anna Gates
New Trends In Corpora And Language Learning Ana Frankenberggarcia Lynne Flowe...
Corpus study design
Eccup webinar part 1
Concordancer
Updated concordancer
Two Hot Topics in Online Language Learning: Corpus Linguistics and Telecollab...
New Pedagogical Trends in the ENGLISH Classroom
Ad

Recently uploaded (20)

PDF
O7-L3 Supply Chain Operations - ICLT Program
PDF
Anesthesia in Laparoscopic Surgery in India
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PDF
RMMM.pdf make it easy to upload and study
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PDF
Computing-Curriculum for Schools in Ghana
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PPTX
Presentation on HIE in infants and its manifestations
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
01-Introduction-to-Information-Management.pdf
PPTX
master seminar digital applications in india
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
Microbial disease of the cardiovascular and lymphatic systems
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PPTX
Pharma ospi slides which help in ospi learning
PPTX
Final Presentation General Medicine 03-08-2024.pptx
O7-L3 Supply Chain Operations - ICLT Program
Anesthesia in Laparoscopic Surgery in India
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
RMMM.pdf make it easy to upload and study
FourierSeries-QuestionsWithAnswers(Part-A).pdf
Computing-Curriculum for Schools in Ghana
STATICS OF THE RIGID BODIES Hibbelers.pdf
Presentation on HIE in infants and its manifestations
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
01-Introduction-to-Information-Management.pdf
master seminar digital applications in india
Module 4: Burden of Disease Tutorial Slides S2 2025
Supply Chain Operations Speaking Notes -ICLT Program
Microbial disease of the cardiovascular and lymphatic systems
Final Presentation General Medicine 03-08-2024.pptx
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
Pharma ospi slides which help in ospi learning
Final Presentation General Medicine 03-08-2024.pptx
Ad

Enhancing Language Learning Using Corpora