Statistical analysis and document mining

Degrees incorporating this pedagocial element :

Description

The aim of this course is to present the statistical approaches for analysing multivariate data. The information age has resulted in masses of multivariate data in many different field: finance, marketing, economy, biology, environmental sciences,…The theoretical and practical aspects of multivariate data analysis are given equal importance. This balance is achieved through practicals involving actual data analysis using the R software.

Recommended prerequisite

Elementary notions in probability theory (probability distribution, joint probability density function for random vectors, conditional distribution, expectation, variance, covariance, Gaussian distribution)

Elementary notions in mathematical statistics (estimator, confidence interval, statistical tests).

As a bonus: simple linear regression, linear algebra (matrix reductions), elementary notions in Rstudio and the R software.