Studying at the University of Verona
Here you can find information on the organisational aspects of the Programme, lecture timetables, learning activities and useful contact details for your time at the University, from enrolment to graduation.
Study Plan
The Study Plan includes all modules, teaching and learning activities that each student will need to undertake during their time at the University.
Please select your Study Plan based on your enrollment year.
1° Year
| Modules | Credits | TAF | SSD |
|---|
2° Year It will be activated in the A.Y. 2026/2027
| Modules | Credits | TAF | SSD |
|---|
| Modules | Credits | TAF | SSD |
|---|
| Modules | Credits | TAF | SSD |
|---|
2 modules among:
- 1st year - Knowledge representation, Natural Language Processing, HCI - Multimodal Systems - delivered in 2025/2026
- 2nd year - AI & cloud - delivered in 2026/2027
- 1st and 2nd year - Advanced programming for AI, Computer vision & deep learning - delivered in 2025/2026 and in 2026/2027
2 courses among (mutually exclusive with the previous ones):
- 1st year - Knowledge representation, Natural language processing, HCI - multimodal systems - delivered in 2025/2026
- 2nd year - AI & cloud, Visual intelligence - delivered in 2026/2027
- 1st and 2nd year - Advanced programming for AI, Computer Vision & deep learning, Statistical learning - delivered in 2025/2026 and in 2026/2027 2 courses among the following
- A.A. 2025/2026 Network Science not activated
- A.A. 2026/2027: Complex Systems not activated1 course among the followingLegend | Type of training activity (TTA)
TAF (Type of Educational Activity) All courses and activities are classified into different types of educational activities, indicated by a letter.
Planning & Reinforcement Learning (2025/2026)
Teaching code
4S014024
Credits
12
Coordinator
Language
English
Also offered in courses:
- Artificial intelligence of the course Master's degree in Computer Science and Engineering
- Artificial intelligence of the course Master's degree in Computer Science and Engineering
Courses Single
AuthorizedThe teaching is organized as follows:
Learning objectives
The course is composed of two modules: Planning and Reinforcement Learning. The Planning module presents a selection of symbolic methods for planning. The students learn how to design, apply, and evaluate algorithms, procedures, and strategies for state space search and planning problems in different scenarios. At the end of the course the students know how to choose the most appropriate solution techniques for different
problems, and will be prepared to continue their studies in Artificial
Intelligence. The Reinforcement learning module introduces students to reinforcement learning and planning under uncertainty. In particular, it is focused on the design of algorithms that enable machines to learn based on reinforcements, hence from partial, implicit and delayed feedback obtained by repeatedly interact with the environment or users. At the end of the course, students will have to demonstrate that they have acquired the ability to i) tackle sequential decision problems with reinforcement learning techniques, ii) identify and apply the most effective and efficient algorithms to solve specific sequential decision problems, iii) designing new reinforcement learning algorithms. In particular, the acquired knowledge concerns advanced techniques for the resolution of Markov Decision Process (eg, research with Monte Carlo methods), bandit problems, model-based and model-free reinforcement learning, Bayesian reinforcement learning, deep reinforcement learning (DQN, Reinforce, A2C, PPO, DDPG, SAC), and advanced reinforcement learning techniques (safe policy improvement, partially observable environments, hierarchical reinforcement learning, imitation-based learning, inverse reinforcement learning, and meta-learning).
Prerequisites and basic notions
Being a first year, first/second semester exam, there are no specific prerequisites other than those required for access to the degree course.
Bibliography
Criteria for the composition of the final grade
The final grade is represented by the arithmetic average of the grades of the two parts (Planning/RL) of the course.
