Distributed Data Mining Lab Course SoSe 19

 

Type:

Master Lab Course 6 P (IN2106, IN4176)

Ects:

10

Supervisors

Dr. Lothar Richter, Dmitrii Nechaev

Rotation:

weekly meeting of 2 hours, time slot: Wednesday 13 - 15, room 01.09.034

Rooms:

01.09.034 for the weekly meeting

Language:

English

Announcements:

There are two identical pre-meetings: Tue, Feb 5th, 4.30 pm and Thu, Feb 7th, 3.30 pm. Room FMI 01.09.034

Content

The character of this lab course will be highly explorative and technical oriented and covers the following (among others). Since the syllabus is continuously evolving and updating the mentioned topics might still change:

  • Hadoop File System
  • Exploration and Comparison of Hadoop, Spark and Dask
  • Installation/Configuration
  • Installation, Configuration and Application of the  MLlib framework
  • MapReduce
  • Simple applications

Prerequisites

  • Basic experience in Data Mining / Machine Learning
  • Sound Linux administration/ command line skills
  • Good command of at least one of these programming lanuages: Java, Scala, Python

Resources

Slides

Feb 5th / Feb 7th Premeeting