Foundations in Data Engineering
- Prof. Dr. Thomas Neumann
- Alexander Beischl, M.Sc.
- Maximilian Reif, M.Sc.
For any questions regarding this lecture, please contact us using: [email protected]
Overview:
This website provides information on the course organization. Exercise and bonus project material will be provided via this moodle course. You are automatically added to the moodle course when you register for the lecture in TUMOnline. The moodle course will be updated the week before the lecture starts. If you cannot access the moodle course because you are not completely enrolled for your master yet, please mail at [email protected].
Lecture
Tuesdays, 4 - 6 p.m. Interims 1 Hörsaal 2, (5620.01.102) &
Thursdays, 4 - 6 p.m. Physik Hörsaal 2 (5101.EG.502)
Start: October 17, 2023
There is no live stream or recording.
Tutorial Session
There are TBA sessions available which cover the same topics.
Start: week of October 23, 2023
Please enroll via TUM-Online.
Exam
The exam will be held on-site at TUM and there will also be a retake exam (also on-site).
We will announce the exam dates as soon as we know them.
Intro slides for lecture and exercise
Lecture and exercise info slidesBonus System
There will be three bonus projects that accompany this lecture.
All three projects are programming tasks.
If you complete all three projects, you will receive a grade bonus of 0.3 on your final exam grade.
The grade bonus is only applied on passed exams (4.0 or better).
The programming tasks will be done in C++ and Scala.
Prior C++ knowledge is required, but you can also learn it yourself during the semester.
The bonus projects will be published in the moodle course.
C++ learning materials:
- Systems Programming in C++ (by our chair)
- Introduction to C++ standard library containers
- Topcoder: power up C++ with the standard template library part one
- If you prefer a well written book: A Tour of C++
Slides
- Introduction
- Basic Building Blocks
- Advanced SQL
- SQL Query Unnesting
- Distributed Data Processing
- No-SQL Databases
- Other Data Models
- Neo Join
- RDF Query Optimization
- Additional Lecture: Introduction to C++
Literature
- Anand Rajaraman, Jeffrey David Ullman Mining of Massive DatasetsCambridge University Press B31
- Maurice Herlihy, Nir Shavit The Art of Multiprocessor Programming Morgan Kaufmann, 2012.
- Garcia-Molina, Ullman, Widom Database Systems: The Complete Book Prentice Hall, 2000.
- Alfons Kemper, André Eickler Datenbanksysteme. Eine Einführung 10., aktualisierte und erweiterte Auflage, Oldenbourg Verlag, 2015.