715SM - STATISTICA COMPUTAZIONALE 2016
Schema della sezione
-
This is the moodle page of the Computational Statistics Course. The page will be in maintained in English. Here you will find the material of the course, including slides, videos, textbooks, the lab assignments, and the instructions for the final project.
Lectures.
Monday 14.00-17.00
Tuesday 11.00-13.00
Wednesday 14.00-17.00
-
Course slides. There will be a non-annotatad copy and an annotated one (as during lectures).
-
Textbooks containing all the material treated in the course. And much more. In pdf, for your private use. Do not distribute.
-
Youtube link to lecture videos.
-
Data and instructions for the lab on linear regression.
-
Tasks File PDF
-
Data Sets File ZIP
-
-
Dataset and tasks
-
Data sets File ZIP
-
Task: use the implementation of SVM and of Gaussian Process regression and classification in scikit-learn to analyse the following datasets.
Datasets zip-files contain the data in csv format and a description of the dataset and of the supervised learning task.
-
Tasks for the lab on unsupervised learning.
TASK 1: Implement in Python a method for Parzen density estimation with Gaussian kernels (accepting a generic kernel function), and test it on 1d and 2d data. Use (cross) validation to set the lengthscale of the kernel.
TASK 2: Run clustering algorithms, exploring the scikit-learn library (kmeans, gaussian mixtures, hierarchical clustering, spectral clustering), on 2d density data and on any other dataset you like (for classification and regression datasets, ignore output features).
TASK 3: Experiment with PCA from scikit-learn on the dataset(s) available for the course.