Data Mining Algorithms 1

Organization (WiSe 25/26)

Course
3+2 hours weekly (equals 6 ECTS)
Lecture:
Prof. Dr. Thomas Seidl
Assistant:
Dr. Gabriel Marques Tavares, Zhi-Cong Xian, Tanveer Hannan
Audience:
Master and Bachelor students in the programs of the Institute for Informatics
Course Material:
Prior Knowledge:
None
Course Language:
English

Content

The vast increase in data volume in almost every field results in increased difficulty or even impossibility for information analysis. Especially in areas such as biological measurement evaluation (e.g. gene sequencing, micro-array processes …) or data transaction in large telecommunications or network operators, using data without computational aid is inconceivable. The research area “Knowledge Discovery in Databases (KDD)” investigates solutions to these problems. It combines statistics, machine learning, database systems, and (semi-) automatic extraction methods for valid, new, and potentially useful knowledge from large databases. The term data mining in this context refers to the fundamental step in the KDD process, in which the actual analysis of the data is carried out. Data mining is often applied to large amounts of operational data that are managed separately in so-called data warehouses. The frequently used term Business Intelligence describes, among other things, the application of data mining algorithms to the information provided by a data warehouse in order to support targeted decision-making processes. The lecture gives an overview of the basics of the most important KDD techniques. Particularly: Classification, regression/trend detection, clustering, outlier detection, association rules, and process mining.

To deepen the lecture, exercises are offered in which the presented procedures are further explained and illustrated with practical examples.