Objectives:
Students will learn concepts, techniques and tools for data science practice, including data collection and integration, exploratory data analysis, predictive modeling, descriptive modeling. Apply basic machine learning algorithms (Linear Regression, k-Nearest Neighbors (k-NN), k-means, Naive Bayes) for predictive modeling.
Weekly Lectures (subject to change)
-
Exploratory Data Analysis
-
Distributions - Density Functions :Introduction to R
-
Estimation- Hypothesis Testing
-
Linear Least Squares
-
Linear Regression- Multiple Linear Regression
-
Classification: Logistic Regression, LDA,QDA,KNN
-
Midterm
-
Resampling Methods: Cross-Validation
-
Resampling Methods: The Bootstrap
-
Linear Model Selection and Regularization:Ridge, Lasso Regression
-
Tree- Based Methods
-
Support Vector Machines
Grading
- Assignments 20%
- Midterm 25%
- Project 20%
- Final Exam 35%