GitHub - pinogl/getdata-project: Getting and Cleaning Data Course Project

#Getting and Cleaning Data Course Project

The goal of this project is to prepare tidy data that can be used for later analysis.

##Input data The raw input data for the project is the UCI HAR Dataset available from the course website https://d396qusza40orc.cloudfront.net/getdata%2Fprojectfiles%2FUCI%20HAR%20Dataset.zip

The files relevant to the project are:

##R script An R script run_analysis.R has been developed to implement the following tasks.

Merge the training and the test sets to create one data set data.raw.
Extract only the measurements on the mean and standard deviation for each measurement, by filtering the list of all features. Only the features statistics mean and standard eviation are kept.
Assign descriptive activity names by replacing the codes with the labels, after some data cleaing (word splitting and capitalization).
Label the data set with descriptive variable names, following the camelCase naming convention.
Create tidy data set with the average of each variable for each activity and each subject.

##In order to get the tidy dataset you have to:

put UCI HAR Dataset.zip and run_analysis.R files in the same directory.
With RStudio
1. Open run_analysis.R from the chosen directory
2. Session -> Set Working Directory -> to Source File Location
3. Code -> Run All
HAR-tidy.txt dataset is written into the data folder. It can be read using read.table("data/HAR-tidy.txt", header=TRUE)

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Codebook.md		Codebook.md
HAR-tidy.txt		HAR-tidy.txt
README.md		README.md
run_analysis.R		run_analysis.R

Provide feedback