Git Product home page Git Product logo

gettingcleaningdata's Introduction

GettingCleaningData repository

Introduction

This repository contains the course assignment for the "Getting and Cleaning Data" module of the John Hopkins Data Science specialization on Coursera.org.

As part of the assignment a raw data file of Samsung smart phone data tests was processed to create a tidy data set.

The repository contains:

  • an R script called run_analysis.R for reading the raw data and creating the tidy data set txt file
  • a code book which describes the prepared txt file containing the tidy data set

More information about the raw data and the tests it represents

More information about the data can be found at the following links. For information about how the data was produced: http://archive.ics.uci.edu/ml/datasets/Human+Activity+Recognition+Using+Smartphones

To download the raw data needed by the run_analysis.R programme: https://d396qusza40orc.cloudfront.net/getdata%2Fprojectfiles%2FUCI%20HAR%20Dataset.zip

Specification for run_analysis.R

  1. Merges the training and the test sets to create one data set.
  2. Extracts only the measurements on the mean and standard deviation for each measurement.
  3. Uses descriptive activity names to name the activities in the data set (e.g. WALKING, WALKING DOWNSTAIRS)
  4. Appropriately labels the data set with descriptive variable names.
  5. From the data set in step 4, creates a second, independent tidy data set with the average of each variable for each activity and each subject.

Running the programme

  1. Download the Raw data from the above link and unzip it to your R working directory.
  2. Execute run_analysis.R
  3. A tidy data set will be prepared in the working directory in a file called AggregatedTidySamsungData.txt

gettingcleaningdata's People

Contributors

domlloyd avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.