Studying at the University of Verona
Here you can find information on the organisational aspects of the Programme, lecture timetables, learning activities and useful contact details for your time at the University, from enrolment to graduation.
Study Plan
This information is intended exclusively for students already enrolled in this course.If you are a new student interested in enrolling, you can find information about the course of study on the course page:
Laurea in Lingue e letterature per l'editoria e i media digitali - Enrollment from 2025/2026The Study Plan includes all modules, teaching and learning activities that each student will need to undertake during their time at the University.
Please select your Study Plan based on your enrollment year.
1° Year
Modules | Credits | TAF | SSD |
---|
1st foreign language
2nd foreign language
1st foreign literature
2nd foreign literature
2° Year activated in the A.Y. 2020/2021
Modules | Credits | TAF | SSD |
---|
1st foreign language
2nd foreign language
1st foreign literature
2nd foreign literature
Philology related to 1st or 2nd foreign language
3° Year activated in the A.Y. 2021/2022
Modules | Credits | TAF | SSD |
---|
1st foreign language
2nd foreign language
Modules | Credits | TAF | SSD |
---|
1st foreign language
2nd foreign language
1st foreign literature
2nd foreign literature
Modules | Credits | TAF | SSD |
---|
1st foreign language
2nd foreign language
1st foreign literature
2nd foreign literature
Philology related to 1st or 2nd foreign language
Modules | Credits | TAF | SSD |
---|
1st foreign language
2nd foreign language
Legend | Type of training activity (TTA)
TAF (Type of Educational Activity) All courses and activities are classified into different types of educational activities, indicated by a letter.
Linguistics and Computer Science (2020/2021)
Teaching code
4S008105
Teacher
Coordinator
Credits
6
Language
Italian
Scientific Disciplinary Sector (SSD)
L-LIN/01 - HISTORICAL AND GENERAL LINGUISTICS
Period
II semestre (Lingue e letterature straniere) dal Feb 15, 2021 al May 29, 2021.
Learning outcomes
The course illustrates concepts of linguistics combined with those of computer science. It will introduce the students to the basics of the science of language, of the different dimensions of natural languages and of the CA (Computer-Aided) tools relevant to the representation and to the treatment of data. At the end of the course, the student: - will acquire basic knowledge about natural language, CA methods and tools for the linguistic analysis; - will apply the knowledge using some tools during class for the representation and the treatment of linguistic data; - will start developing critical thinking about theories, methods and tools; - will be able to communicate and interact more effectively with technicians and experts in multimedia and IT processes; - will be able to find out more autonomously further bibliography, data and tools.
Program
The course is split into two main sections, roughly equivalent in size, each with the aim of presenting Computational Linguistics starting from one of its two extremes: the first section will focus on text, its representation and the formalisation of linguistical data, while the second section will introduce mathematical/statistical concepts and methods for the analysis of language (mainly intended as written text).
After a general introduction about the objectives of Computational Linguistics and a sketch of the discipline's history, the first part will regard the following topics:
- linguistic resources: corpora
- text encoding and standardisation: Unicode & UTF8
- linguistic units and annotational layers
- from sentences to trees: Universal Dependencies as an example of morphosyntactic annotation
The second part will deal with the following topics:
- statistical concepts related to the linguistic phenomenon: Zipf's law and word graphs
- automated analysis and instruments: Markov chains, entropy...
- mentions to different possibilities of automatic analysis: morphological analysis, Named Entity Recognition, Sentiment Analysis...
- non-projectivity as a particular case study for parsing (i.e. automated syntactic analysis)
- evaluation cycles and methods
For what concerns the second part in particular, the course has among its goals to back some of the theory with practical sessions involving the use of methods and programs from the own terminal. To this end, further directions will be given at the start of the course.
The course might be subjected to slight variations, based on previous knowledge by part of the students, on the possible need to tackle some topics more in-depth, and on the management of technical details.
Author | Title | Publishing house | Year | ISBN | Notes |
---|---|---|---|---|---|
Chris Manning, Hinrich Schütze | Foundations of Statistical Natural Language Processing | MIT Press | 1999 | 9780262133609 | Sito di riferimento: https://nlp.stanford.edu/fsnlp/promo/ Sito della casa editrice: https://mitpress.mit.edu/books/foundations-statistical-natural-language-processing |
Gaetano Berruto, Massimo Cerruti | La linguistica. Un corso introduttivo | UTET Università | 2017 | 9788860084835 | Più approfondito ed esaustivo rispetto alla Lingusitica elementare del De Mauro. |
Maria Freddi | Linguistica dei corpora | Carocci | 2014 | 9788843071166 | Introduzione ai corpus per la linguistica, con spunti pratici |
Tullio De Mauro | Linguistica elementare | Laterza | 2003 | 9788842069775 | Panoramica delle nozioni di base della linguistica esposti sinteticamente, con esercizi, mappe e tabelle |
Jacob Perkins | Python 3 Text Processing with NLTK 3 Cookbook | Packt | 2014 | 9781782167853 | Prontuario estremamente pratico per molte operazioni comuni e non solo di linguistica computazionale ed elaborazione del linguaggio naturale. Sito della casa editrice: https://www.packtpub.com/product/python-3-text-processing-with-nltk-3-cookbook/9781782167853 |
Jan Goyvaerts, Steven Levithan | Regular Expressions Cookbook (Edizione 2) | O'Reilly Media | 2012 | 9781449319434 | Manuale che tratta la maggior parte delle implementazioni di espressioni regolari e fornisce una ricca casistica. Dai gestori del sito Regular-Expressions.info |
Dan Jurafsky, James H. Martin | Speech and Language Processing | Prentice Hall | 2008 | 9780131873216 | Completamente disponibile nell'ultima versione sul sito: https://web.stanford.edu/~jurafsky/slp3/ |
Alessandro Lenci, Simonetta Montemagni, Vito Pirrelli | Testo e computer: elementi di linguistica computazionale | Carocci Aulamagna | 2016 | 9788843083060 | Testo di riferimento principale |
Daniel Zeman | The world of tokens, tags and trees | Ústav formální a aplikované lingvistiky (ÚFAL) | 2018 | 9788088132097 | Panoramica generale sull'annotazione morfosintattica di un corpus, con presentazione e motivazione del formalismo delle Universal Dependencies. Estratto scaricabile dalla pagina ufficiale https://ufal.mff.cuni.cz/books/2018-zeman |
Examination Methods
The exam will consist of two phases: a written composition of the student's choice but agreed upon by the professor, and an oral interrogation, both equivalent with regard to the final grade.
The form of the elaborate is not fixed, but it will need to focus on or take inspiration from the topics dealt with during the course and should be i nthe format of a 4-10-page article. It will be posssible to touch upon more theoretical or more practical aspects. Some ideas:
- create a small corpus and comment how automated linguistic analysis works for it
- use Python to write a script for tokenisation, parser evaluation, or similar
- develop and discuss an annotation strategy following the Universal Dependencies formalism for a language at your choosing (preferably NOT English nor Italian), and possibly confront it with already existing annotation styles
Among the main evaluation criteria, formal correctness will come before originality of the topic.
The oral interrogation is expected to last around half an hour (duration might vary according to the number of students taking the exam) and, starting from the composition, it will touch upon any of the course's topics, possibly comprehending reasoning questions.