Course contents
In this course we learn how to conduct corpus-based linguistic analysis, using methodologies and tools developed within corpus linguistics and computational linguistics. The student will be asked to set up and carry out an in-depth corpus-based analysis of a chosen linguistic phenomenon.
1. Types of corpora.
2. What counts as an annotated corpus. Annotation and mark-up.
3. Corpus query systems and methods.
4. Regular expressions.
5. Corpora for language learning and second language teaching.
6. Examples of linguistic analyses based on empirical evidence.
Reccomended or required readings
Introduction
Lenci A., Montemagni S., Pirrelli V. 2005. Testo e Computer, Roma: Carocci, Cap. 1: “I dati della lingua”.
Sinclair J. 2005. “Corpus and Text - Basic Principles”. In Wynne M. (ed.) Developing Linguistic Corpora: a Guide to Good Practice, Oxford: Oxbow Books: 1-16.
Annotation and mark-up
Lenci A., Montemagni S., Pirrelli V. 2005. Testo e Computer, Roma: Carocci, Cap. 8: “L’annotazione linguistica del testo”.
Leech G. 2005. “Adding Linguistic Annotation”. In Wynne M. (ed.) Developing Linguistic Corpora: a Guide to Good Practice, Oxford: Oxbow Books: 17-29.
Ide, Nancy. "Preparation and Analysis of Linguistic Corpora". In Susan Schreibman, Ray Siemens, John Unsworth (eds.), A Companion to Digital Humanities. Oxford: Blackwell. 2004.
http://www.digitalhumanities.org/companion/
Corpus Query Tools
Kilgarriff A., Rychly, P., Smrž, P. Tugwell,D. 2004. "The Sketch Engine". In Williams G. and S. Vessier (eds.), Proceedings of the XI Euralex International Congress, July 6-10, 2004, Lorient, France. 105-111.
Regular expressions
Thomas, J. 2016. Corpus Query Language in Details. In Discovering English with the Sketch Engine: A corpus-based approach to Language Exploration, Versatile Publisher, pp. 139-156.
Corpora for language learning and teaching
Baisa, V. and V. Suchomel. 2014. "SkELL Web Interface for English Language Learning". In A. Horák and P. Rychlý (eds.) Proceedings of Recent Advances in Slavonic Natural Language Processing, 63–70.
Available on Researchgate.
Granger, S. 2012. "Learner corpora". In Chapelle, C.A. (ed.) The Encyclopedia of Applied Linguistics. Oxford, UK: Wiley-Blackwell, 3235–3242.
Available on Researchgate.