Systems for Big Data

This course covers the design principles and algorithmic foundation of influential software systems for Big Data Analytics. The course begins with the design of large enterprise data warehouses, query processing techniques for Online-Analytic Processing, and data mining over data warehouses. The course then examines fundamental architectural changes to scale data processing and analysis to a shared-nothing compute cluster, including parallel databases, MapReduce, column stores, and the support of batch processing, iterative algorithms, machine learning, and interactive analytics in this new context.

 

The prerequisite for this course is INF553. The coursework includes a series of written and programming assignments and a final exam.

The course will be taught in English.