Machine learning for data science
Scientific Disciplinary Sector (SSD)
ING-INF/05 - INFORMATION PROCESSING SYSTEMS
Language of instruction
Secondo semestre dal Mar 7, 2022 al Jun 10, 2022.
The course aims to provide the basic tools for machine learning, together with specific techniques to deal with large amounts of data, such as deep learning. Theory and techniques will be specifically addressed to data science issues with particular emphasis on data analysis.
At the end of the course the student has to show to have acquired the following skills:
● knowledge of the main types of data (e.g. binaries, texts, sounds, etc.)
● understanding and capability to use the basic elements of descriptive statistics, elementary probability, linear algebra with elements of optimization and regularization
● knowledge of basic machine learning techniques (e.g. support vector machines, random forest, etc.)
● knowledge of basic deep learning techniques (e.g. convolutional neural network, long-short memory machines, etc.)
● knowledge of the basics of Natural Language Processing for, for example, sentiment analysis
● knowledge of the basic issues in the context of measurement and Regression measures, e.g., RMSE (Root Mean Square Error), MAE, Rsquared and adjusted Rsquared)
● knowledge of the basic tools in supervised training, e.g., confusion matrix, accuracy, precision, recall, F1, Curve precision-recall, ROC, average precision, CMC NLP: Bleu, Spice