Learning outcomes
The course introduces the basic concepts, methodologies and tools of the discipline, providing students with skills to manage and automatically process linguistic data
Course contents
After a brief historical introduction, the course is divided into two sections: (a) presentation of different types of language resources and methods for extracting information out of them, and (b) automatic production of linguistic metadata at both morphological and syntactic level (Natural Language Processing)
Reccomended or required readings
AA.VV. 1962. Le due culture. Almanacco Letterario 1962, pp. 143-144 & 313-318.
Abeillé Anne. 2003. Introduction. In Treebanks. Building and Using Parsed Corpora, Dordrecht (The Netherlands), Kluwer Academic Publisher, pp. xiii-xxvi.
Busa Roberto. 1962. L'Analisi linguistica nell'evoluzione mondiale dei mezzi d'informazione. Almanacco Letterario 1962, pp. 103-108.
Hutchins John. 1996. ALPAC: the (in)famous report. MT News International, no. 14, June 1996, pp. 9-12. http://www.hutchinsweb.me.uk/MTNI-14-1996.pdf
Mikheev Andrei. 2005. Text Segmentation. In Mitkov, Ruslan. The Oxford handbook of computational linguistics. Oxford University Press, pp. 201-218.
Snow, Charles Percy. 1959. The Two Cultures. In Snow, Charles Percy, The Two Cultures and the Scientific Revolution. The Rede Lecture. New York: Cambridge University Press, pp. 1-22.
http://www.jstor.org/stable/pdf/1578601.pdf
Trost Harald. 2005. Morphology. In Mitkov, Ruslan. The Oxford handbook of computational linguistics. Oxford University Press, pp. 25-47.
Voutilainen Atro. 2005. Part-of-Speech tagging. In Mitkov, Ruslan. The Oxford handbook of computational linguistics. Oxford University Press, pp. 219-232.
Wynne, M (editor). 2005. Developing Linguistic Corpora: a Guide to Good Practice. Oxford: Oxbow Books. Available online from http://www.ahds.ac.uk/creating/guides/linguistic-corpora/
Capitoli 1, 2 e 3