STAT411 STATISTICAL DATA MINING
Course Code: |
2460411 |
METU Credit (Theoretical-Laboratory hours/week): |
4(3-2) |
ECTS Credit: |
6.0 |
Department: |
Statistics |
Language of Instruction: |
English |
Level of Study: |
Undergraduate |
Course Coordinator: |
Prof.Dr. CEYLAN YOZGATLIGÝL |
Offered Semester: |
Fall and Spring Semesters. |
Prerequisite: |
Set 1: 2460291
, 2460363 |
The course set above should be completed before taking
STAT411 STATISTICAL DATA MINING. |
Course Content
Descriptive and predictive mining. Data preprocessing: cleaning transformation. outlier detection, missing data imputation. Dimension reduction, Principal Component Analysis (PCA). Sampling, oversampling. Exploratory data analysis (EDA). Clustering methods: partitioning, hierarchical, density-based, model-based. Predictive modeling. Regression. Variable selection. Robust and nonlinear regression. Nonparametric regression. Classifiers. Logistic regression. Decision trees. Random Forest. Model evaluation and validation. Real-life applications using recent available software.