Clustering, Data Analysis and Visualization of Complex Data
May 21-25, 2018, Catania (Italy)
The course is intended to achieve postgraduate training in special areas of statistics for both researchers and professional data analysts. The focus is on classification and clustering methods, in conjunction with related visualization techniques, with particular emphasis on modern high-dimensional data sets (MHDS). MHDS have recently emerged because of the fast improvement in data acquisition, storage and processing. The availability of massive data sets are of large interest also in machine learning, data science and computer science. It applies in many contexts such as biological experiments, financial markets, astronomy, etc. Classification and clustering play a key role in this new paradigm to discover the inhomogeneous structure often underlying these data, and become consequently even more emblematic methods of modern data analysis. Starting from basic concepts, the course will introduce the audience to novel techniques and software through extensive applications to real data.
Numerical applications will be performed through a variety of software, including some R packages and some cloud-computing platforms (SaaS, Software as a Service) issuing from research but targeting many kinds of practitioners