Modern data analysis relies on high-level languages such as Python or R for data handling and processing. However, behind standard libraries like Scikit-Learn are hidden implementations in low-level languages such as C or C++, for faster execution and optimized resource management or calculation.

References:

On data analysis:

  • Hastie, Tibshirani, Friedman: The Elements of Statistical Learning: Data Mining, Inference, and Prediction (2nd ed.). Springer, 2017.
  • Scott and Stain: Multi-dimensional Density Estimation. In Handbook of Statistics, Volume 23 (Data Mining and Computational Statistics), 2004.

On C++:

  • Stroustrup. The C++ Programming Language (4th ed.). Addison-Wesley, 2013.
  • Weiss: C++ for Java Programmers. Prentice Hall, 2003.