Ongoing Research of MCL Team Members
These projects illustrate the range of research being carried out by members of the MCL team. Many of the projects have particular relevance to issues of pedagogy, theories of language learning and tools and resources for language teaching.
This project examines the forms, functions and textual distribution of repeated word combinations in advanced student academic writing on different proficiency levels and across academic disciplines. It makes use of the newly compiled Michigan Corpus of Upper-Level Student Papers (MICUSP) to explore the distribution of phrases such as ‘the fact that’ and ‘it is clear’ across different academic disciplines and also the places within a text they tend to occur, e.g. beginning, middle or end of sentences and paragraphs.
Related publications: coming soon
Building on recent work that has applied the theory and methods of complex systems research to the study of language structure, acquisition and change (see LaCAS Conference page) we are investigating the distribution and behavior of formulaic language across a range of text-types. Scale-free or Zipfian distributions are one of the hallmarks of a complex system, found in some surprisingly different domains such as the number of in- and out-going links on web pages, the population size of major cities in the USA and protein interactions in yeast cells! Zipf’s original observation of the distribution was in the context of word frequency lists of texts and subsequent analysis shows how it holds for morphological, syntactic and semantic features of language. In this project we are are looking at recurrent sequences extracted from corpora (see Formulaic Language Project) to discover whether they follow a Zipfian distribution in terms of:
Related publications: coming soon
This project focuses on the identification and examination of meaningful units in a corpus of academic book reviews. It devises a new analytical model that leads to a profile of the central phraseological items in the selected text type.
Related publications:
Drawing on Michael Hoey’s corpus-driven theory of Lexical Priming and specifically his claims regarding the associations that exist between lexical items (words and phrases) and text linguistic features (cohesion, text structure and clause relations), the Textual Priming Project at Liverpool University, UK carried out work on a corpus of newspaper text to test out the claim that:
‘Every word is primed to occur in, or avoid, certain positions within the discourse; these are its textual colligations’ (Hoey 2005).
We found a surprising number of words and clusters that exhibited particularly preferences for text-initial and paragraph-initial positions within newspaper text.
The next stage aims to extend the analysis and methodological framework to look at textual colligation in academic writing.
Related publications:
In this project, we address a prominent topic in writing research and teaching: attended vs. unattended this. We revisit the variable realization of the demonstrative pronoun this attended by a noun or noun phrase, as in This behavior may also be due to the materials non-linearity, or standing alone, as in This may have implications for instructors who want students to produce academic text (examples taken from MICUSP). Our analyses of attended and unattended this in MICUSP show how a corpus approach can uncover aspects of the distribution, function and use of this language feature in apprentice academic writing across disciplines.
Related publications:
In this project we investigate citation practices and phraseological items in MICUSP and compare their use across academic disciplines and student levels.
Related publications: coming soon
Nick Ellis has been awarded NSF funding to research adult language acquisition in terms of the cognitive psychology of learning and transfer. This research bridges the psychologies of learning and development, cognitive science, linguistics, language acquisition, and education. Our interdisciplinary team here at the ELI and at Penn State University brings together psychology, applied linguistics, and second language acquisition and instruction to adopt a variety of triangulating methods: laboratory learning of temporal reference in a small subset of Latin under experimental conditions, eye-movement studies of attention in reading second language Spanish, analyses of development in regular university Spanish foreign language courses, training studies, and the development of classroom/language-lab based foreign language instructional interventions.
Related publications: coming soon
This project investigates the naturalistic second language acquisition of English verb-argument constructions (VACs: VL verb locative, VOL verb object locative, VOO ditransitive) with particular reference to: (1) Construction learning as concept learning following the general cognitive and associative processes of the induction of categories from experience of exemplars in usage obtained through interaction with conversation partners; (2) The empirical analysis of naturalistic usage by means of corpus linguistic descriptions of native speakers and non-native speech and of longitudinal acquisition (the interlanguage of second language learners); (3) The effects of the frequency and Zipfian type/token frequency distribution of exemplars within the Verb and other islands of the construction archipelago (e.g. [Subj V Obj Oblpath/loc]), by their prototypicality, their generic coverage, and their contingency of form-function mapping, and (4) Computational (Emergent connectionist) models of these various factors as they play out in the emergence of constructions as generalized linguistic schema.
Related publications: coming soon
If you are an instructor at the University of Michigan and would like to learn more about corpus analysis and how to use corpora with your students, you may be interested in joining our Corpus Analysis Group.
The MCL team frequently provide introductions to corpus analysis and training in the use of corpus tools for scholars visiting the ELI and also in writing classes offered in the English Language Institute.
This is a project to explore the factors involved in the measurement of repeated word sequences in language sampled from a range of corpora.
These projects illustrate the range of research being carried out by members of the MCL team.
This page lists a number of useful corpus tools and corpora you can search online to retrieve examples of real spoken and written English.
On these pages you will find information about conferences members of the MCL team presented at or helped organize.