Professional Documents
Culture Documents
Learning
J. Scott Payne
Penn State University
jspayne@psu.edu
Corpus-Based Approaches to L2
Instruction
Traditional Paradigm:
present > practice > produce
Data-Driven Learning:
observe > hypothesize > experiment
What is a corpus?
"A corpus is a body of text assembled according
to explicit design criteria for a specific purpose"
(Atkins & Clear, 1992: p.5).
John Sinclair (1991) on the development of the
field of Corpus Linguistics:
"Thirty years ago when this research started it was
considered impossible to process texts of several million
words in length. Twenty years ago it was considered
marginally possible but lunatic. Ten years ago it was
considered quite possible, but still lunatic. Today it is
very popular."
Comparing Genre
Qualitative techniques:
Concordances or keyword-in-context
Quantitative techniques:
Frequencies analysis of individual words and
collocations
Lexical diversity
Lexical density
What is Data-driven Learning
(DDL)?
Application of tools (concordancers) and
techniques from corpus linguistics in the service
of language learning.
Concordances as a tool for developing
instructional exercises.
Places “raw” linguistic primary source material in
the hands of learners - learners as “researchers”.
Learners have the opportunity to discover
language rules by themselves.
DDL Examples
Writing instruction.
Cobb 2004 used Lextutor to help students correct their own
writing errors. Only 8% of students indicated that the
concordance had helped them.
Corpus Tool for Data-Driven
Learning
KWICionary - http://conic.la.psu.edu/kwic/
OCAT - http://conic.la.psu.edu/ocat/
Activity