342SM-1 - Mod A: Probabilistic Machine Learning 2024
Schema della sezione
-
This is the moodle page of the Probabilistic Machine Learning of the Master Program in Data Science and Artificial Intelligence.
To be able to access videos, please check if you are a member of the course team (likely not), otherwise register to it using the code: p302pen
Here you will find the material of the course, including slides, videos, textbooks, the lab assignments, and the instructions for the final project.
Lectures will be in presence. Please attend them.
Records of the lectures will be made available, but I strongly discourage to use these recording if you have not attended the lecture, as (a) recording may fail and (b) they are not effective.
If you do not attend the lecture, use recorded videos available in moodle, for topics for which they are available. Unfortunately, due to some issues with a migration, a significant portion of these videos has been lost.
Lectures will start on Monday, March 3rd, 2025.
Lectures' timetable
Monday 12-14 - classroom 0B, H3 building
Tuesday 15-17 - classroom F, G building
Wednesday 09-11 - classroom A, C7 building
Notebooks of the labs will be available at a GitHub page [stay tuned]
-
Please write here your questions on theory and exercises, so that everybody can access the answers. Only personal questions will be answered by email.
-
-
PML notes File PDF
-
Textbooks containing the material treated in the course (and much more). In pdf, for your private use. Do not distribute.
-
-
-
Aperto: mercoledì, 26 marzo 2025, 11:07Termine consegna lunedì, 14 aprile 2025, 23:59
You can find the problems here:
https://github.com/r-doz/PML2025/blob/main/Homeworks/Homework_1.ipynb
You should write the solution to the problems in Markdown/Latex (not handwritten), within a notebook (.ipynb), pdf or html file. Some problems also require you to write some python code.
For Problem 6, you should submit the notebook 04_exact_inference.ipybn with your additional code (so you should submit two files in total).The deadline for the submission is on the 14th of April at 23:59.
-
Introduction to Bayesian statistics, and Bayesian linear regression and classification.
-
-
-
Termine consegna giovedì, 19 giugno 2025, 22:00
You can find the problems here: https://github.com/DavideScassola/PML2024/blob/main/Homeworks/03_homework.ipynb
You should write the solution to the problems in Markdown/Latex (not handwritten), within a notebook (.ipynb), pdf or html file.This homework is not mandatory: you need to deliver this homework in order to be able to obtain the 3 bonus points, otherwise the maximum will be 2. However, if you don't deliver this homework, it won't influence the average of the previous homeworks.
The deadline for the submission is on the 20th of June at 23:59.
-
Pandas is the main library to manipulate and explore datasets in Python. Here you will find Jupyter Notebooks tutorials and links to tutorials to learn Pandas. We will also cover some fundamentals of data visualization in matplotlib and seaborn. The last notebook is about scikit-learn, a coprehensive machine learning in Python.
Within the notebooks, you will also find some exercises. These are not to be submitted, but can help you learning.
I assume you have Python 3 up and running.
-
Exam will consist of two parts:
- a group project
- an individual oral exam on the topics seen during lectures.
These two parts can be done in separate dates.
Group project: each group (groups of 2-3 people have to be constructed as a result of self organization by you) has to choose a topic - agreeing it with the professor and the tutors (via email). Presentations should be submitted via email and via Moodle at least 5 days in advance w.r.t. the exam.
The topic can be a novel method (w.r.t to those presented in the course) related to PML and GenAI, which has to be presented and discussed. Each project must have an implementation part, in which the methodology is implemented and tested. Joint projects with Deep Learning require a more advanced (research-level) quality of results.
Each project will be presented jointly by the team in a presentation of maximum 15 minutes (including questions). The target audience are the other students (hence topics covered during the course are to be considered known and should not be presented in detail).
-
Here you can find useful links and other material to read.
-
Here you can find several datasets to experiment with.
-
Here you can find several datasets to experiment with classification techniques.