Links to Corpora and Corpus Resources

Online Corpora

MICASE online

The Michigan Corpus of Academic Spoken English; roughly 1.8 million words of academic spoken English
search MICASE

MICUSP Simple

A simple search and browse interface to the Michigan Corpus of Upper-level Student Papers; 829 A-graded student papers from 16 disciplines; roughly 2.6 million words
search MICUSP

BNC online (simple search)

The British National Corpus; roughly 100 million words of spoken and written British English
search BNC

BNC online (BYU interface)

Brigham Young University interface to the British National Corpus; roughly 100 million words of spoken and written British English
search BYU-BNC

COCA online

Brigham Young University Corpus of Contemporary American English; currently over 385 million words of spoken and written American English (20 million words are added each year)
search COCA

TIME corpus

Brigham Young University TIME Magazine corpus of American English; roughly 100 million words; 275,000 article from TIME Magazine (1923-2006)
search TIME corpus

Corpus Tools

AntConc

free software package for corpus analysis, developed by Laurence Anthony
AntConc homepage and download information

WordSmith Tools

software package for corpus analysis, developed by Mike Scott (license required)
WordSmith Tools homepage and purchase information

MonoConc Pro

software package for corpus analysis, developed by Michael Barlow (license required)
MonoConc Pro homepage and purchase information

kfNgram

free software for n-gram and phrase-frame extraction from corpora, developed by William H. Fletcher
kfNgram homepage and download information

Collocate

software for n-gram and collocation extraction from corpora, developed by Michael Barlow (license required)
Collocate homepage and purchase information

Corpus Projects

BASE

The British Academic Spoken English corpus; roughly 1.6 million words of academic spoken British English
BASE project website

BAWE

The British Academic Written English corpus; roughly 6.5 million words of academic writing by British university students
BAWE project website

ICLE

The International Corpus of Learner English; roughly 3 million words of written learner English (21 different L1 backgrounds)
ICLE project website

For more relevant projects and resources…

See David Lee’s Bookmarks for Corpus-based Linguists

Contact / About Us