-
Instructor: Randy Lai ([email protected])
-
Meeting time: 12:10 - 1:30 PM, TR
-
Location: WELLMN 216
-
Office hour: MSB 4111 Tuesday and Wednesday 2:00 - 3:00pm (or by appointment)
-
TA: Si Teng Hao ([email protected])
-
Meeting time: 9:00 - 9:50 AM, W or 11:00 - 11:50 AM, W
-
Office hour: MSB 1117 Friday 10am - 12pm
Date | Note | HTML | |
---|---|---|---|
01-07 | introduction | html | |
01-09 - 01-14 | tidy data | html | |
01-16 - 01-21 | functional programming | html | |
01-23 | conditions | html | |
debugging | html | ||
01-28 | profiling | html | |
Rcpp | html |
Tentative Schedule:
Week | Topic |
---|---|
1 | Version Control |
2 | Tidy data and Functional programming |
3 | Debugging and profiling |
4 | Writing C and C++ extensions |
5 | Parallel Computing |
6 | Databases and SQL |
7 | Writing an R package |
8 | Interoperate with python |
9 | Deep learning in R |
10 | Backups |
Category | Grade Percentage |
---|---|
Assignments | 70% |
Project | 25% |
Participation | 5% |
- There will be around 5/6 assignments
- Assignments must be turned in by the due date. No late assignments are accepted.
- Participation will be based on your involvement in class, discussion, or office hours. The most subjective way to earn participation points is to have some interactions on Piazza. (A+ will be only given to those students with high participation)
- H. Wickham, Advanced R (https://adv-r.hadley.nz/)
- G. Grolemund and H. Wickham, R for Data Science (https://r4ds.had.co.nz/)
- J. Bryan, Data wrangling, exploration, and analysis with R (https://stat545.com/)
- J. Bryan, the STAT 545 TAs, J. Hester, Happy Git and GitHub for the useR (https://happygitwithr.com/)
- R. Peng, S. Kross, and B. Anderson, Mastering Software Development in R (https://bookdown.org/rdpeng/RProgDA/)
I and TA will not respond to any emails about general questions about assignments and course materials. Please use piazza in regard to this matter. For private or sensitive questions you can do private posts on Piazza or email the instructor or TA.
Learn how to ask a question. Asking a question is an art, stackoverflow.com has some good tips.
Details of project will be announced later.
- J. Bryan, Data wrangling, exploration, and analysis with R (https://stat545.com/)
- J. Bryan, the STAT 545 TAs, J. Hester, Happy Git and GitHub for the useR (https://happygitwithr.com/)
- G. Grolemund and H. Wickhom, R for Data Science (https://r4ds.had.co.nz/)
- H. Wickham, Advanced R (https://adv-r.hadley.nz/)
- R. Peng, S. Kross, and B. Anderson, Mastering Software Development in R (https://bookdown.org/rdpeng/RProgDA/)
(Adapted from Nick Ulle and Clark Fitzgerald )
Point values and weights may differ among assignments. This is to indicate what the most important aspects are, so that you spend your time on those that matter most. Check the homework submission page on Canvas to see what the point values are for each assignment.
The grading criteria are correctness, code quality, and communication. The following describes what an excellent homework solution should look like:
The report does the following:
solves all the questions contained in the prompt makes conclusions that are supported by evidence in the data discusses efficiency and limitations of the computation cites any sources used The attached code runs without modification.
The code is idiomatic and efficient. Different steps of the data processing are logically organized into scripts and small, reusable functions. Variable names are descriptive. The style is consistent and easy to read.
Plots include titles, axis labels, and legends or special annotations where appropriate. Tables include only columns of interest, are clearly explained in the body of the report, and not too large. Numbers are reported in human readable terms, i.e. 31 billion rather than 31415926535. Writing is clear, correct English.
The report points out anomalies or notable aspects of the data discovered over the course of the analysis. It discusses assumptions in the overall approach and examines how credible they are. It mentions ideas for extending or improving the analysis or the computation.