You should create one R script called run_analysis.R that does the following.
- Merges the training and the test sets to create one data set.
- Extracts only the measurements on the mean and standard deviation for each measurement.
- Uses descriptive activity names to name the activities in the data set
- Appropriately labels the data set with descriptive activity names.
- Creates a second, independent tidy data set with the average of each variable for each activity and each subject.
- Download the data source and extract it on your local drive. Note that a folder
UCI HAR Dataset
will be created. - The script
run_analysis.R
should be placed in the parent folder ofUCI HAR Dataset
- Set the parent folder of
UCI HAR Dataset
as your working directory usingsetwd()
function. - Run
source("run_analysis.R")
, then it will generate a new filetiny_data.txt
in your working directory.
data.table
- powerful library for fast aggregation of large data. You can install the package by executing install.packages("data.table")