Massive datasets (M2)

Where and when ?

  • Monday 9h-12h. From 2021-02-08 to 2021-03-22.
  • ZOOM

References

  • Foundations of data science : Avrim Blum, John Hopcroft and Ravi Kannan.
  • Mining of Massive Datasets : Jure Leskovec, Anand Rajaraman , and Jeff Ullman. Cambridge University Press.
  • Statistics for High-Dimensional Data: Methods, Theory and Applications. Peter Bühlmann & S van de Geer. Springer.
  • An Introduction to Statistical Learning: with Applications in R Gareth James (Author), Daniela Witten (Author), Trevor Hastie (Author), Robert Tibshirani. Springer.
  • Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython. Wes McKinney. O’Reilly.

Software

Evaluation

  • Streams
  • Neighbors
  • Robustness
Previous
Next