The R script called run_analysis.R performs the following steps: 1 Merges the training and the test sets to create one data set. 2 Extracts only the measurements on the mean and standard deviation for each measurement. 3 Uses descriptive activity names to name the activities in the data set 4 Appropriately labels the data set with descriptive variable names. 5 From the data set in step 4, creates a second, independent tidy data set with the average of each variable for each activity and each subject.
Run source("run_analysis.R”):
-
It downloads the data source https://d396qusza40orc.cloudfront.net/getdata%2Fprojectfiles%2FUCI%20HAR%20Dataset.zip , unzips the file and and puts the content into a folder on your local drive. A UCI HAR Dataset folder will be created for the content
-
It installs and reads the libraries : data.table and dplyr
-
It reads the metadata in UCI HAR Dataset
-
It reads the Test and Train data
-
Both data sets are merged
-
The mean and standard deviation are extracted
-
Dataset activity names are renamed
-
Tidy data file is created (tiny_data.txt) in your working directory