This course is designed to introduce statistical research methodology--the procedures by which statisticians go about approaching and analyzing data--to early undergraduates. Students will learn basic concepts of statistical learning--inference vs. prediction, supervised vs. unsupervised learning, regression vs. classification, etc.--and will reinforce this knowledge by applying, e.g., linear regression, random forest, principal components analysis, and/or hierarchical clustering and more to datasets provided by the instructor. Students will also practice disseminating the results of their analyses via oral presentations and posters. Analyses will primarily be carried out using the R
programming language, but with attention paid to how one would perform similar analyses using Python. Previous knowledge of R
is not required for this course. Space is very limited; there will be an application process. The course is currently open to sophomore statistics students only.
Fall 2019, Tu-Th 1:30 - 2:50 PM, Wean 4625
Week | Day | Topic |
---|---|---|
1 | Tu | introduction to R + pre-test assessment |
Th | R: vectors + lab | |
2 | Tu | R: dplyr + ggplot + lab |
Th | exploratory data analysis + lab | |
3 | Tu | statistical learning + K-means + hierarchical clustering + lab |
Th | PCA + lab | |
4 | Tu | regression model assessment + lab |
Th | classification model assessment + lab | |
5 | Tu | linear regression + lab |
Th | GLM + logistic regression + lab | |
6 | Tu | best subset selection + lab |
Th | penalized regression + lab | |
7 | Tu | ML + trees + lab |
Th | reserved for project dataset work | |
8 | Tu | random forest + lab |
Th | cancelled: mid-semester break | |
9 | Tu | boosting + lab |
Th | naive bayes + lab | |
10 | Tu | SVM + lab |
Th | KNN + lab | |
11 | Tu | TBD |
Th | TBD | |
12 | Tu | TBD |
Th | reserved for project dataset work | |
13 | Tu | hackathon preliminaries |
Th | hackathon presentations | |
14 | Tu | cancelled: Thanksgiving |
Th | cancelled: Thanksgiving | |
15 | Tu | oral presentations |
Th | post-test assessment + retrospection | |
F | Fr | final poster due |