Links to Corpora and Corpus Resources
Here are a few links to online-searchable corpora, useful corpus tools and corpus projects that might be of interest.
The Michigan Corpus of Academic Spoken English; roughly 1.8 million words of academic spoken English
search MICASE
A simple search and browse interface to the Michigan Corpus of Upper-level Student Papers; 829 A-graded student papers from 16 disciplines; roughly 2.6 million words
search MICUSP
The British National Corpus; roughly 100 million words of spoken and written British English
search BNC
Brigham Young University interface to the British National Corpus; roughly 100 million words of spoken and written British English
search BYU-BNC
Brigham Young University Corpus of Contemporary American English; currently over 385 million words of spoken and written American English (20 million words are added each year)
search COCA
Brigham Young University TIME Magazine corpus of American English; roughly 100 million words; 275,000 article from TIME Magazine (1923-2006)
search TIME corpus
free software package for corpus analysis, developed by Laurence Anthony
AntConc homepage and download information
software package for corpus analysis, developed by Mike Scott (license required)
WordSmith Tools homepage and purchase information
software package for corpus analysis, developed by Michael Barlow (license required)
MonoConc Pro homepage and purchase information
free software for n-gram and phrase-frame extraction from corpora, developed by William H. Fletcher
kfNgram homepage and download information
software for n-gram and collocation extraction from corpora, developed by Michael Barlow (license required)
Collocate homepage and purchase information
The British Academic Spoken English corpus; roughly 1.6 million words of academic spoken British English
BASE project website
The British Academic Written English corpus; roughly 6.5 million words of academic writing by British university students
BAWE project website
The International Corpus of Learner English; roughly 3 million words of written learner English (21 different L1 backgrounds)
ICLE project website
See David Lee’s Bookmarks for Corpus-based Linguists
If you are a researcher or visiting scholar at the University of Michigan and would like to learn more about corpus analysis and share findings from your own corpus-based research, you may be interested in joining our Corpus Analysis Group.
The MCL team frequently provide introductions to corpus analysis and training in the use of corpus tools for scholars visiting the ELI and also in writing classes offered in the English Language Institute. There are opportunities for researchers to benefit from these resources through the ELI Visiting Scholar Programs.
This is a project to explore the factors involved in the measurement of repeated word sequences in language sampled from a range of corpora.
Using computational corpus analysis and experimental data this project aims to produce an extensive inventory of English Verb Argument constructions and to quantify aspects related to the frequency, semantic coherence and speaker accessibility of verbs in constructions.
The projects on this page provide examples of the kinds of research carried out by the researchers in the MCL team relating to a broad range of issues in academic discourse analysis and corpus and applied linguistics.
Here are a few links to online-searchable corpora, useful corpus tools and corpus projects that might be of interest.
On these pages you will find information about conferences and colloquia organized by and involving members of the MCL team.