• Votre sélection est vide.

    Enregistrez les diplômes, parcours ou enseignements de votre choix.

UE Statistical analysis and document mining

  • Niveau d'étude

    Bac +4

  • ECTS

    6 crédits

  • Crédits ECTS Echange

    6.0

  • Composante

    UFR IM2AG (informatique, mathématiques et mathématiques appliquées)

  • Période de l'année

    Printemps (janv. à avril/mai)

Description

The aim of this course is to present the statistical approaches for analysing multivariate data. The information age has resulted in masses of multivariate data in many different field: finance, marketing, economy, biology, environmental sciences,…The theoretical and practical aspects of multivariate data analysis are given equal importance. This balance is achieved through practicals involving actual data analysis using the R software.

Content

  1. Multiple linear regression. Least squares, Gaussian linear model, test of linear hypotheses, one-way analysis of variance.
  2. Principal Components Analysis (PCA).
  3. Classification, linear discriminant analysis, perceptron, Naive Bayes
  4. Text mining, numeric representation of texts, connexion with graph clustering.
Lire plus

Heures d'enseignement

  • CMCM16,5h
  • TDTD7,5h
  • TPTP25,5h

Pré-requis recommandés

Elementary notions in probability theory (probability distribution, joint probability density function for random vectors, conditional distribution, expectation, variance, covariance, Gaussian distribution)

Elementary notions in mathematical statistics (estimator, confidence interval, statistical tests).

As a bonus: simple linear regression, linear algebra (matrix reductions), elementary notions in Rstudio and the R software.

Lire plus

Période

Semestre 8

Liste des enseignements

  • Statistical analysis and document mining

  • Statistical analysis and document mining complementary