Massive datasets (M2)
Where and when ?
- Monday 9h-12h. From 2021-02-08 to 2021-03-22.
-
ZOOM
Roadmap
- Streaming data
- Nearest neighbor methods
- Robust estimation
References
- Foundations of data science : Avrim Blum, John Hopcroft and Ravi Kannan.
- Mining of Massive Datasets : Jure Leskovec, Anand Rajaraman , and Jeff Ullman. Cambridge University Press.
- Statistics for High-Dimensional Data: Methods, Theory and Applications. Peter Bühlmann & S van de Geer. Springer.
- An Introduction to Statistical Learning: with Applications in R Gareth James (Author), Daniela Witten (Author), Trevor Hastie (Author), Robert Tibshirani. Springer.
- Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython. Wes McKinney. O’Reilly.
Software
Evaluation
- Streams
- Neighbors
- Robustness
Last updated on Jun 24, 2019