Foundations in Data Engineering
For any questions regarding this lecture, please contact us using: [email protected]
Content:
This website provides preliminary information on the course organization. The lecture itself will be organized via this moodle course. You are automatically added to the moodle course when you register for the lecture in TUMOnline. The moodle course will be updated the week before the lecture starts.
For on-site participation please ensure:
- Your "3G proof" can be scanned with the "CovPass Check" app (https://www.digitaler-impfnachweis-app.de/en/covpasscheck-app) and
- bring your ID card/passport for identification!!!
Dates
- Lecture: Tuesdays, 4 - 6 p.m. & Thursdays, 4 - 6 p.m.
Start: October 19, 2021 - Tutorial Sessions: There are five sessions available which cover the same topics.
Please see the moodle course for details and how to enroll.
Start: week of October 27, 2021 - Moodle Course: If you cannot access the moodle course because you are not completely enrolled for your master yet, please mail the teaching assistants (Mr. Beischl & Mr. Reif) using [email protected].
Corona (COVID-19) Information
Due to the COVID-19 pandemic, the lecture will be taught in hybrid mode.
- Lecture: on-site and recordings (published on Moodle)
- Tutorials: on-site and online (Big Blue Button, link will be published in Moodle)
- Exam & Retake Exam: only on-site at TUM
Lecture
There are two lectures per week at TUM each Tuesday (4 - 6 p.m.) and Thursday (4 - 6 p.m.). Lecture recordings will be uploaded to the moodle course.
If the infection rate increases during the semester, we might need to adapt the lecture mode.
However, the lecture recordings are uploaded for students that can't visit the on-site lecture.
We will announce updates in Moodle.
Tutorial Session
The tutorial session will take place on-site but there will be alternative online live sessions using the tool Big Blue Button.
We offer online and on-site time slots, all covering the same content.
Thus, you only need to attend one tutorial session per week.
You can register for a slot via TUMOnline. This will be explained during the first lecture week.
After registration, we provide you a link to the tutorial session of your time slot.
Please always watch the lecture before participating in the tutorial session!
Exam
The exam will be held on-site (attendence in person required) at TUM and there will also be a retake exam (also on-site).
Bonus System
There will be three bonus projects that accompany this lecture.
All three projects are programming tasks.
If you complete all three projects, you will receive a grade bonus of 0.3 on your final exam grade.
The grade bonus is only applied on passed exams (4.0 or better).
The programming tasks will be done in C++ and Scala.
Prior C++ knowledge is required, but you can also learn it yourself during the semester.
The bonus projects will be published in the moodle course.
C++ learning materials:
- Systems Programming in C++ (by our chair)
- Introduction to C++ standard library containers
- Topcoder: power up C++ with the standard template library part one
- If you prefer a well written book: A Tour of C++
Slides
- Introduction
- Basic Building Blocks
- Advanced SQL
- SQL Query Unnesting
- Distributed Data Processing
- No-SQL Databases
- Other Data Models
- Neo Join
- RDF Query Optimization
Tutorial Sessions
Please find all exercise sheets, discussion and further material on the moodle course.
Literature
- Anand Rajaraman, Jeffrey David Ullman Mining of Massive DatasetsCambridge University Press B31
- Maurice Herlihy, Nir Shavit The Art of Multiprocessor Programming Morgan Kaufmann, 2012.
- Garcia-Molina, Ullman, Widom Database Systems: The Complete Book Prentice Hall, 2000.
- Alfons Kemper, André Eickler Datenbanksysteme. Eine Einführung 10., aktualisierte und erweiterte Auflage, Oldenbourg Verlag, 2015.