SlideShare a Scribd company logo
3
Most read
5
Most read
9
Most read
How to Use Corpora in
Language Teaching
Brody Bluemel
Department of Applied Linguistics
The Pennsylvania State University

LANGUAGE TEACHING WORKSHOP SERIES
The Pennsylvania State University, February 2014
Sponsored by the Center for Language Acquisition (CLA) and the
Center for Advanced Language Proficiency Education and
Research (CALPER).
Outline
 URL: https://sites.google.com/site/corpusteaching/
 Presentation:


What are language corpora?



Approaches to using corpora in language teaching



Introduction to several available resources

 Collaborate:


What ideas do you have for using corpora in your classroom?

 Discussion:


Share ideas
What are corpora?


Leech (1992): “an unexciting phenomenon, a helluva lot of
text, stored on a computer”



Sinclair (1991): “a collection of naturally-occurring language
text, chosen to characterize a state or a variety of language”



Sinclair (2004): “a collection of pieces of language text in
electronic form, selected according to external criteria to
represent, as far as possible, a language or language variety
as a source of data for linguistic research”



Corpora: A systematized set of texts, typically accessed
electronically, that are used for linguistics research and
pedagogy.
Types of Corpora
 General vs. Specialized
 Native vs. Learner Corpora
 Monolingual vs. Translation Corpora
 Parallel Corpora, Comparable Corpora, Equivalent

Corpora
 Language Variation Corpora
 Synchronic vs. Diachronic Corpora
 Spoken vs. Written Corpora
Approaches to using corpora in
language teaching
 General vs. Specialized Corpora
 Grammar, lexicon, rhetoric, style, expressions, Form

ulaic Speech
 British National Corpus
 American National Corpus

 BYU Corpus Interface

 MiCase
Approaches to using corpora in
language teaching
 Native vs. Learner Corpora
 Comparison, Analysis, Error Analysis, L1 specific

challenges
 International Corpus of Learner English (ICLE)
 Extensive List of Multilingual Learner Corpora
Approaches to using corpora in
language teaching
 Translation Corpora
 Parallel Corpora
 Phrasing, conceptualizing complex concepts, reading

comprehension
 www.parallelcorpus.com
 EU Joint Research Centre
 E-C Concord
 www.linguee.com
Approaches to using corpora in
language teaching
 Language Variation Corpora
 Exploration of dialects
 Phonemica
 International Corpus of English (ICE)

 Synchronic vs. Diachronic Corpora
 Language change, modern speech, Understanding

novels and other texts
 Spoken vs. Written Corpora
 Genre and use
Online Resources
 Presentation URL: https://sites.google.com/site/corpusteaching/
 Multilingual Corpora:

 Additional Resources:

 Non-English Corpora

 Corpus Tools & Websites

 www.linguee.com

 Extensive list of Online Corpora

 Learner Corpora



Bookmarks for corpus-based linguist

 Athel Corpus Resources



The corpora list



CALPER Corpus Tutorial

 One of my favorites:
 http://dict.bing.com.cn/
Primary Resources
 Books and journals
 Aijmer (2009): Corpora and Language Teaching

 Hunston (2002): Corpora in Applied Linguistics
 McEnery (2006): Corpus-Based Language

Studies
 Sinclair (2004): How to Use Corpora in
Language Teaching
 International Journal of Corpus Linguistics
 Corpora

10
Collaborate


In groups of 3-4, discuss ideas, innovations, and questions
you have about applying corpus technology in the classroom.



Specific questions to consider:




Questions or applications of corpora that haven’t been
discussed?



What challenges do you foresee in applying corpora in
teaching?





What unique features about YOUR classroom should be
considered? (characteristics of the language you teach,
student population, etc.)

How would this technology benefit you in your teaching?

How do you plan to use corpus technology in your classroom?
Discussion
 Share:
 Ideas and possible applications of corpora generated

in your group discussion
 Any key features or aspects of corpora we haven’t

yet considered
 Questions:
 Any questions regarding using corpora, finding

resources, or anything else.
Thank You!
Contact: Brody Bluemel (btb5129@psu.edu)
The Pennsylvania State University
Department of Applied Linguistics

More Related Content

PPTX
Corpora in language teaching
PPTX
Content based syllabus
PPTX
Ch. 8 ethnicity and social networks
PPTX
Systemic Functional Linguistics
PPTX
Corpus linguistics
PPTX
Context and co text
PPSX
Presupposition
PPTX
Pidgin & creoles
Corpora in language teaching
Content based syllabus
Ch. 8 ethnicity and social networks
Systemic Functional Linguistics
Corpus linguistics
Context and co text
Presupposition
Pidgin & creoles

What's hot (20)

PPT
Notional functional syllabus design
PPTX
Pragmatic Referece and Inference
PPTX
Needs Analysis
PPTX
Pragmatics - George Yule
PPTX
Corpus linguistics
PPTX
SEMANTICS AND PRAGMATICS - PRESUPPOSITIONS AND ENTAILMENTS
PPTX
Computational linguistics
PPT
Pragmatics:Adjacency pairs
PPTX
Situational syllabi
PPTX
Corpus linguistics
PPTX
Language maintenance and shift.
PPTX
Semantic roles and semantic features
PPTX
Discourse structure as process
PPTX
Task based syllabus
PPTX
Content based syllabi
PPTX
Michael halliday
PPTX
Corpus linguistics
PDF
Task-based syllabus design and task sequencing
PPTX
Contrastive analysis
PPTX
Interlanguage hypothesis
Notional functional syllabus design
Pragmatic Referece and Inference
Needs Analysis
Pragmatics - George Yule
Corpus linguistics
SEMANTICS AND PRAGMATICS - PRESUPPOSITIONS AND ENTAILMENTS
Computational linguistics
Pragmatics:Adjacency pairs
Situational syllabi
Corpus linguistics
Language maintenance and shift.
Semantic roles and semantic features
Discourse structure as process
Task based syllabus
Content based syllabi
Michael halliday
Corpus linguistics
Task-based syllabus design and task sequencing
Contrastive analysis
Interlanguage hypothesis
Ad

Viewers also liked (11)

PPT
Corpus linguistics in language learning
PDF
Corpus Tools for Language Teaching
PPTX
Corpus linguistics
PDF
Foreign Language Classroom Assessment in Support of Teaching and Learning
PPTX
Corpus linguistics
PPT
Applications of CL to FLT
ODP
Quantitative Individuated Corpus Linguistics
PPTX
What can a corpus tell us about grammar
PPTX
Introduction to corpus linguistics 1
PDF
Analysing Word Meaning over Time by Exploiting Temporal Random Indexing
PPT
Tracking Learning: Using Corpus Linguistics to Assess Language Development
Corpus linguistics in language learning
Corpus Tools for Language Teaching
Corpus linguistics
Foreign Language Classroom Assessment in Support of Teaching and Learning
Corpus linguistics
Applications of CL to FLT
Quantitative Individuated Corpus Linguistics
What can a corpus tell us about grammar
Introduction to corpus linguistics 1
Analysing Word Meaning over Time by Exploiting Temporal Random Indexing
Tracking Learning: Using Corpus Linguistics to Assess Language Development
Ad

Similar to How to Use Corpora in Language Teaching (20)

PDF
How to Use Corpora in Language Teaching John Mchardy Sinclair
PDF
How to Use Corpora in Language Teaching John Mchardy Sinclair
PDF
How to Use Corpora in Language Teaching John Mchardy Sinclair
PPTX
Corpora analysis bruno natalia sarah
PPTX
Developing corpus-based resources for language learning: looking back in "hope"
PDF
Corpus linguistics intro
PPTX
Two Hot Topics in Online Language Learning: Corpus Linguistics and Telecollab...
PDF
Corpus Linguistics: An Introduction
PPTX
Using do-it-yourself corpora in EAP-A tailore-made resource
PPTX
PPT
The Corpus In The Classroom
PPT
session 13 arabic teacher lecture on hand
PDF
Corpora And Language Learners Studies In Corpus Linguistics Guy Aston
PDF
Corpus Based Language Studies An advanced resource book 1st Edition Tony Mcenery
PDF
Using corpora in instruction
PDF
Applying Corpus-Based Findings To Form-Focused Instruction The Case Of Repor...
PPTX
Corpus Linguistics II.pptx
PDF
Corpus Linguistics for Language Teaching and Learning
PDF
Corpus Linguistics
PPTX
Language Teaching and Learning DDL Corpora.pptx
How to Use Corpora in Language Teaching John Mchardy Sinclair
How to Use Corpora in Language Teaching John Mchardy Sinclair
How to Use Corpora in Language Teaching John Mchardy Sinclair
Corpora analysis bruno natalia sarah
Developing corpus-based resources for language learning: looking back in "hope"
Corpus linguistics intro
Two Hot Topics in Online Language Learning: Corpus Linguistics and Telecollab...
Corpus Linguistics: An Introduction
Using do-it-yourself corpora in EAP-A tailore-made resource
The Corpus In The Classroom
session 13 arabic teacher lecture on hand
Corpora And Language Learners Studies In Corpus Linguistics Guy Aston
Corpus Based Language Studies An advanced resource book 1st Edition Tony Mcenery
Using corpora in instruction
Applying Corpus-Based Findings To Form-Focused Instruction The Case Of Repor...
Corpus Linguistics II.pptx
Corpus Linguistics for Language Teaching and Learning
Corpus Linguistics
Language Teaching and Learning DDL Corpora.pptx

More from CALPER (6)

PDF
Poehner_Lantolf_2003_Dynamic_assessment_of_L2_development_CALPERWP
PDF
A Corpus-based Approach to Tracking L2 Development
PPTX
Helping Teachers Meet Learner Needs Through Innovative Online Diagnostic Asse...
PPT
Implementing E-portfolios in the Business Language Curriculum: A French Case
PPS
Learning through Listening towards Advanced Japanese
PPT
Developing Teaching Materials with Authentic Data and Corpus Analysis Tools
Poehner_Lantolf_2003_Dynamic_assessment_of_L2_development_CALPERWP
A Corpus-based Approach to Tracking L2 Development
Helping Teachers Meet Learner Needs Through Innovative Online Diagnostic Asse...
Implementing E-portfolios in the Business Language Curriculum: A French Case
Learning through Listening towards Advanced Japanese
Developing Teaching Materials with Authentic Data and Corpus Analysis Tools

Recently uploaded (20)

PDF
Weekly quiz Compilation Jan -July 25.pdf
PDF
Chinmaya Tiranga quiz Grand Finale.pdf
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
Paper A Mock Exam 9_ Attempt review.pdf.
PDF
A systematic review of self-coping strategies used by university students to ...
PDF
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PDF
LNK 2025 (2).pdf MWEHEHEHEHEHEHEHEHEHEHE
PDF
Classroom Observation Tools for Teachers
PDF
Microbial disease of the cardiovascular and lymphatic systems
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
DOC
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
PDF
LDMMIA Reiki Yoga Finals Review Spring Summer
PDF
Practical Manual AGRO-233 Principles and Practices of Natural Farming
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PDF
Updated Idioms and Phrasal Verbs in English subject
PDF
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
PDF
Trump Administration's workforce development strategy
PDF
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
PPTX
History, Philosophy and sociology of education (1).pptx
Weekly quiz Compilation Jan -July 25.pdf
Chinmaya Tiranga quiz Grand Finale.pdf
Supply Chain Operations Speaking Notes -ICLT Program
Paper A Mock Exam 9_ Attempt review.pdf.
A systematic review of self-coping strategies used by university students to ...
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
Module 4: Burden of Disease Tutorial Slides S2 2025
LNK 2025 (2).pdf MWEHEHEHEHEHEHEHEHEHEHE
Classroom Observation Tools for Teachers
Microbial disease of the cardiovascular and lymphatic systems
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
LDMMIA Reiki Yoga Finals Review Spring Summer
Practical Manual AGRO-233 Principles and Practices of Natural Farming
STATICS OF THE RIGID BODIES Hibbelers.pdf
Updated Idioms and Phrasal Verbs in English subject
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
Trump Administration's workforce development strategy
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
History, Philosophy and sociology of education (1).pptx

How to Use Corpora in Language Teaching

  • 1. How to Use Corpora in Language Teaching Brody Bluemel Department of Applied Linguistics The Pennsylvania State University LANGUAGE TEACHING WORKSHOP SERIES The Pennsylvania State University, February 2014 Sponsored by the Center for Language Acquisition (CLA) and the Center for Advanced Language Proficiency Education and Research (CALPER).
  • 2. Outline  URL: https://sites.google.com/site/corpusteaching/  Presentation:  What are language corpora?  Approaches to using corpora in language teaching  Introduction to several available resources  Collaborate:  What ideas do you have for using corpora in your classroom?  Discussion:  Share ideas
  • 3. What are corpora?  Leech (1992): “an unexciting phenomenon, a helluva lot of text, stored on a computer”  Sinclair (1991): “a collection of naturally-occurring language text, chosen to characterize a state or a variety of language”  Sinclair (2004): “a collection of pieces of language text in electronic form, selected according to external criteria to represent, as far as possible, a language or language variety as a source of data for linguistic research”  Corpora: A systematized set of texts, typically accessed electronically, that are used for linguistics research and pedagogy.
  • 4. Types of Corpora  General vs. Specialized  Native vs. Learner Corpora  Monolingual vs. Translation Corpora  Parallel Corpora, Comparable Corpora, Equivalent Corpora  Language Variation Corpora  Synchronic vs. Diachronic Corpora  Spoken vs. Written Corpora
  • 5. Approaches to using corpora in language teaching  General vs. Specialized Corpora  Grammar, lexicon, rhetoric, style, expressions, Form ulaic Speech  British National Corpus  American National Corpus  BYU Corpus Interface  MiCase
  • 6. Approaches to using corpora in language teaching  Native vs. Learner Corpora  Comparison, Analysis, Error Analysis, L1 specific challenges  International Corpus of Learner English (ICLE)  Extensive List of Multilingual Learner Corpora
  • 7. Approaches to using corpora in language teaching  Translation Corpora  Parallel Corpora  Phrasing, conceptualizing complex concepts, reading comprehension  www.parallelcorpus.com  EU Joint Research Centre  E-C Concord  www.linguee.com
  • 8. Approaches to using corpora in language teaching  Language Variation Corpora  Exploration of dialects  Phonemica  International Corpus of English (ICE)  Synchronic vs. Diachronic Corpora  Language change, modern speech, Understanding novels and other texts  Spoken vs. Written Corpora  Genre and use
  • 9. Online Resources  Presentation URL: https://sites.google.com/site/corpusteaching/  Multilingual Corpora:  Additional Resources:  Non-English Corpora  Corpus Tools & Websites  www.linguee.com  Extensive list of Online Corpora  Learner Corpora  Bookmarks for corpus-based linguist  Athel Corpus Resources  The corpora list  CALPER Corpus Tutorial  One of my favorites:  http://dict.bing.com.cn/
  • 10. Primary Resources  Books and journals  Aijmer (2009): Corpora and Language Teaching  Hunston (2002): Corpora in Applied Linguistics  McEnery (2006): Corpus-Based Language Studies  Sinclair (2004): How to Use Corpora in Language Teaching  International Journal of Corpus Linguistics  Corpora 10
  • 11. Collaborate  In groups of 3-4, discuss ideas, innovations, and questions you have about applying corpus technology in the classroom.  Specific questions to consider:   Questions or applications of corpora that haven’t been discussed?  What challenges do you foresee in applying corpora in teaching?   What unique features about YOUR classroom should be considered? (characteristics of the language you teach, student population, etc.) How would this technology benefit you in your teaching? How do you plan to use corpus technology in your classroom?
  • 12. Discussion  Share:  Ideas and possible applications of corpora generated in your group discussion  Any key features or aspects of corpora we haven’t yet considered  Questions:  Any questions regarding using corpora, finding resources, or anything else.
  • 13. Thank You! Contact: Brody Bluemel (btb5129@psu.edu) The Pennsylvania State University Department of Applied Linguistics

Editor's Notes

  • #2: Dear Colleagues,A friendly reminder of tomorrow's language teaching workshop:"How to Use Corpus Tools in Language Teaching"Wednesday, February 5, 20144:40-5:45 p.m.267 Willard This workshop offers an overview of how language corpora--collections of authentic textual and/or spoken language samples--can be highly valuable resources for the teaching and learning of second languages.  Examples of available corpora in various languages, including a new corpus tool for learning Chinese, will be shown as models. Topics to be addressed include:The event is free and open to the public.  Light refreshments will be provided.For further information, please contact mcd15@psu.edu. We hope you will join us!  This workshop  is sponsored by the Center for Language Acquisition (CLA) and the Center for Advanced Language Proficiency Education and Research (CALPER).
  • #3: What is a language corpus?How can learners benefit from working with corpus materials?What do corpus-based activities and assignments look like?How can teachers find and use language corpora in their teaching?
  • #8: Chinese – learning and using the orthographic system. (Bluemel, in press; Tsai & Choi, 2005)German – Learning gender, case, prepositions, and word order. (St. John, 2001)EFL/ESL – Learning articles, prepositions, and aspect (Frankenberg-Garcia, 2005; McEnery & Wilson, 2001)Italian – Verb Tense (Laviosa, 2002)Spanish – lexical and semantic analysis and differentiation (Lavid, Hita, & Zamorano-Mansilla, 2010)
  • #10: Source info – Learner: l1, gender, programSample – date, mode, task, genreWhich numbers matterNumber of tokens, types, categories, samples in each category, and words in each sampleDescriptive adequacyBigger corpus generally better for low frequency words, but note Zipf’s Law (1935) 100K words of spontaneous speech enough for descriptive studies of prosody0.5 million words enough for study of verb-form morphology0.5-1 million words enough for studies of most syntactic processes and high frequency vocabulary Reliability of smaller corpus can be empirically tested against larger corpus Biber (1990)Measured internal variation of 50 pairs of samples from same textsSamples: 2000-5000 words enoughBiber (1993)Used multivariate techniques of factor analysis and cluster analysis to study variationPilot studies necessary to fine-tune structureOne million words good for grammatical studies