Git Product home page Git Product logo

sta141c-lectures's Introduction

STA 141C Big Data & High Performance Statistical Computing

People

  • Instructor: Randy Lai ([email protected])

  • Meeting time: 12:10 - 1:30 PM, TR

  • Location: WELLMN 216

  • Office hour: MSB 4111 Tuesday and Wednesday 2:00 - 3:00pm (or by appointment)

  • TA: Si Teng Hao ([email protected])

  • Meeting time: 9:00 - 9:50 AM, W or 11:00 - 11:50 AM, W

  • Office hour: MSB 1117 Friday 10am - 12pm

Material

Date Note HTML PDF
01-07 introduction html pdf
01-09 - 01-14 tidy data html pdf
01-16 - 01-21 functional programming html pdf
01-23 conditions html pdf
debugging html pdf
01-28 profiling html
Rcpp html

Site

  • Canvas for grades
  • Piazza for discussion
  • GitHub for lecture notes

Tentative Schedule:

Week Topic
1 Version Control
2 Tidy data and Functional programming
3 Debugging and profiling
4 Writing C and C++ extensions
5 Parallel Computing
6 Databases and SQL
7 Writing an R package
8 Interoperate with python
9 Deep learning in R
10 Backups

Grading

Category Grade Percentage
Assignments 70%
Project 25%
Participation 5%
  • There will be around 5/6 assignments
  • Assignments must be turned in by the due date. No late assignments are accepted.
  • Participation will be based on your involvement in class, discussion, or office hours. The most subjective way to earn participation points is to have some interactions on Piazza. (A+ will be only given to those students with high participation)

Resources

How to ask questions

I and TA will not respond to any emails about general questions about assignments and course materials. Please use piazza in regard to this matter. For private or sensitive questions you can do private posts on Piazza or email the instructor or TA.

Learn how to ask a question. Asking a question is an art, stackoverflow.com has some good tips.

Final Project

Details of project will be announced later.

Resources

Assignment Rubric

(Adapted from Nick Ulle and Clark Fitzgerald )

Point values and weights may differ among assignments. This is to indicate what the most important aspects are, so that you spend your time on those that matter most. Check the homework submission page on Canvas to see what the point values are for each assignment.

The grading criteria are correctness, code quality, and communication. The following describes what an excellent homework solution should look like:

Correctness

The report does the following:

solves all the questions contained in the prompt makes conclusions that are supported by evidence in the data discusses efficiency and limitations of the computation cites any sources used The attached code runs without modification.

Code Quality

The code is idiomatic and efficient. Different steps of the data processing are logically organized into scripts and small, reusable functions. Variable names are descriptive. The style is consistent and easy to read.

Communication

Plots include titles, axis labels, and legends or special annotations where appropriate. Tables include only columns of interest, are clearly explained in the body of the report, and not too large. Numbers are reported in human readable terms, i.e. 31 billion rather than 31415926535. Writing is clear, correct English.

Inquisitiveness

The report points out anomalies or notable aspects of the data discovered over the course of the analysis. It discusses assumptions in the overall approach and examines how credible they are. It mentions ideas for extending or improving the analysis or the computation.

sta141c-lectures's People

Contributors

randy3k avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.