Academic Catalog

STAT411 STATISTICAL DATA MINING

Course Code: 2460411
METU Credit (Theoretical-Laboratory hours/week): 4(3-2)
ECTS Credit: 8.0
Department: Statistics
Language of Instruction: English
Level of Study: Undergraduate
Course Coordinator:
Offered Semester: Fall and Spring Semesters.
Prerequisite: Set 1: 2460291 , 2460363
The course set above should be completed before taking STAT411 STATISTICAL DATA MINING .

Course Content

Descriptive and predictive mining. Data preprocessing: cleaning transformation. outlier detection, missing data imputation. Dimension reduction, Principal Component Analysis (PCA). Sampling, oversampling. Exploratory data analysis (EDA). Clustering methods: partitioning, hierarchical, density-based, model-based. Predictive modeling. Regression. Variable selection. Robust and nonlinear regression. Nonparametric regression. Classifiers. Logistic regression. Decision trees. Random Forest. Model evaluation and validation. Real-life applications using recent available software.