Skip to content

Latest commit

 

History

History
48 lines (37 loc) · 1.23 KB

notebook_tidyr.md

File metadata and controls

48 lines (37 loc) · 1.23 KB

R Notebook - Data Wrangling - tidyr

tidyr agenda

--Basics
--gather()
--separate()
--arrange()
--mutate()
--summarize()
--group_by()


Prerequisites

library(dplyr)
library(tidyr)
library(ggplot2)

Basics

tibble <- tbl_df(diamonds) #Convert diamonds dataset into tbl_df
head(tibble) #View first 6 rows
## # A tibble: 6 × 10
##   carat       cut color clarity depth table price     x     y     z
##   <dbl>     <ord> <ord>   <ord> <dbl> <dbl> <int> <dbl> <dbl> <dbl>
## 1  0.23     Ideal     E     SI2  61.5    55   326  3.95  3.98  2.43
## 2  0.21   Premium     E     SI1  59.8    61   326  3.89  3.84  2.31
## 3  0.23      Good     E     VS1  56.9    65   327  4.05  4.07  2.31
## 4  0.29   Premium     I     VS2  62.4    58   334  4.20  4.23  2.63
## 5  0.31      Good     J     SI2  63.3    58   335  4.34  4.35  2.75
## 6  0.24 Very Good     J    VVS2  62.8    57   336  3.94  3.96  2.48
dim(tibble) #Check the dimensions of the dataset
## [1] 53940    10