Getting and Cleaning Data
Course project
This repository contains a script to analyse the "Human Activity Recognition Using Smartphones Data Set". A full description is available online
The data set can be downloaded is available as zip file
R based analysis of the data set
All analysis can be done by sourcing the file run_analysis.R
The analysis contains the following steps:
- Merges the training and the test sets to create one data set.
- Extracts only the measurements on the mean and standard deviation for each measurement.
- Uses descriptive activity names to name the activities in the data set
- Appropriately labels the data set with descriptive variable names.
- From the data set in step 4, creates a second, independent tidy data set with the average of each variable for each activity and each subject.
As a result of the analyis a single file will be created
- tidy.csv holds the means of each variable for each activity and each subject
A code is available for the data set in the file CodeBook.md
Requirements
You need to have the following packages installed
- dplyr
- reshape2
If you don't have them installed run the following commands
install.packages("dplyr")
install.packages("reshape2")
How to run
- checkout this repository
- download the dataset
- unzip the dataset to the root folder of the repository
- open an R console in the repository root or set the working directory to the repository
- source the script run_analysis.R
What will happen when I run the script?
- the function createTable will be called and create the tidy dataset
- this second tidy dataset will be saved to the filed tidy.csv