Git Product home page Git Product logo

activity05-data-summarization's Introduction

Activity 5 - Data Summarization

It is assumed that you have read Sections 5.5 - 5.6 from R4DS and completed the Derive Information with dplyr Primer.

In this activity, you will:

  • Produce numerical summaries of variables using {dplyr}.
  • Produce numerical summaries of variables by a grouping variable using {dplyr}.
  • Compute new variables in a dataset using {dplyr}.

☑️ Task 1: The Workflow

Remember that more detailed directions can be found in Task 1 of Activity 4.

fork Fork this repo and clone it to a new RStudio Project

pause

Planned Pause Point: If you have any questions, contact your instructor or another group. We will complete this Activity during our next class session

☑️ Task 2: Complete the RMarkdown File

The activity05-data-summarization.Rmd file contains the directions for this activity. For the rest of this class period, you will complete the RMarkdown document with your neighbor(s). Your instructor will be circling and be available to help when needed.

Note that each person is working in their own repo. We are not worrying about collaborating for the time being and instead will be working on being more comfortable with the workflow for working between RStudio and GitHub.

However, do not continue in this README document until you and your neighbor(s) have completed your .Rmd files.

Work Work Work

☑️ Task 3: Reflection

We now have a number of skills to help us explore datasets. Before we add too many more tools/skills, we should verify what we have currently learned. Look at the Course Objectives that we came up with at the beginning of this semester. Take 5 minutes to identify which of these you feel comfortable with. How could you demonstrate what you have learned?

Now, think back through the 5 Activities that we have completed. What is still not clear? What will you do to better understand these tricky items?

Next: Activity 6 will focus on restructuring data to be easier for humans to read or easier for computers to handle.

activity05-data-summarization's People

Contributors

dykesb avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.