STAT411 STATISTICAL DATA MINING
| Course Code: |
2460411 |
| METU Credit (Theoretical-Laboratory hours/week): |
4(3-2) |
| ECTS Credit: |
6.0 |
| Department: |
Statistics |
| Language of Instruction: |
English |
| Level of Study: |
Undergraduate |
| Course Coordinator: |
Prof.Dr. CEYLAN YOZGATLIGÝL |
| Offered Semester: |
Fall and Spring Semesters. |
| Prerequisite: |
Set 1: 2460291
, 2460363 |
| The course set above should be completed before taking
STAT411 STATISTICAL DATA MINING. |
Course Content
Descriptive and predictive mining. Data preprocessing: cleaning transformation. outlier detection. missing data imputation. Dimension reduction. Principal Component Analysis (PCA). Sampling. oversampling. Exploratory data analysis (EDA). Clustering methods: partitioning. hierarchical. density-based. model-based. Predictive modeling. Regression. Variable selection. Robust and nonlinear regression. Nonparametric regression. Classifiers. Logistic regression. Decision trees. Random Forest. Model evaluation and validation. Real-life applications using recent available software.