Skip to content

Latest commit

 

History

History
114 lines (81 loc) · 5.9 KB

readme.md

File metadata and controls

114 lines (81 loc) · 5.9 KB

The Global Human Day

The data this week comes from the The Human Chronome Project an initiative based at McGill University in Montreal, from their paper The global human day in PNAS and the associated dataset on Zenodo.

The daily activities of ≈8 billion people occupy exactly 24 h per day, placing a strict physical limit on what changes can be achieved in the world. These activities form the basis of human behavior, and because of the global integration of societies and economies, many of these activities interact across national borders. This project estimates how all humans spend their time using a generalized, physical outcome–based categorization that facilitates the integration of data from hundreds of diverse datasets.

See their supplementary materials for details about their methods and additional visualizations.

The Zenodo dataset includes the input data and scripts used to create the datasets used in the paper. The datasets are from the outputData file "all_countries.csv", "global_human_day.csv", "global_economic_activity.csv" and inputData "country_regions.csv". The outputData files are aggregated output data from data collected, created from the scripts in the 'scripts' directory.

h/t Data is Plural 2023-07-13 newsletter for the dataset.

The Data

# Option 1: tidytuesdayR package 
## install.packages("tidytuesdayR")

tuesdata <- tidytuesdayR::tt_load('2023-09-12')
## OR
tuesdata <- tidytuesdayR::tt_load(2023, week = 37)

all_countries <- tuesdata$all_countries
country_regions <- tuesdata$country_regions
global_human_day <- tuesdata$global_human_day
global_economic_activity <- tuesdata$global_economic_activity

# Option 2: Read directly from GitHub

all_countries <- readr::read_csv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/main/data/2023/2023-09-12/all_countries.csv')
country_regions <- readr::read_csv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/main/data/2023/2023-09-12/country_regions.csv')
global_human_day <- readr::read_csv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/main/data/2023/2023-09-12/global_human_day.csv')
global_economic_activity <- readr::read_csv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/main/data/2023/2023-09-12/global_economic_activity.csv')

How to Participate

  • Explore the data, watching out for interesting relationships. We would like to emphasize that you should not draw conclusions about causation in the data. There are various moderating variables that affect all data, many of which might not have been captured in these datasets. As such, our suggestion is to use the data provided to practice your data tidying and plotting techniques, and to consider for yourself what nuances might underlie these relationships.
  • Create a visualization, a model, a shiny app, or some other piece of data-science-related output, using R or another programming language.
  • Share your output and the code used to generate it on social media with the #TidyTuesday hashtag.

Data Dictionary

all_countries.csv

variable class description
Category character M24 categories
Subcategory character M24 subcategories
country_iso3 character Country code in iso3
region_code character Region code
population double Population
hoursPerDayCombined double Hours per day combined for the country
uncertaintyCombined double Uncertainty combined. Uncertainty is in units variance.

country_regions.csv

variable class description
region_code character Region code
region_name character Region name
country_name character Country name
M49_code double M49 code
country_iso2 character Country code in iso2
country_iso3 character Country code in iso3
alt_country_name character Alternative country name
alt_country_name1 character Alternative country name 1
alt_country_name2 character Alternative country name 2
alt_country_name3 character Alternative country name 3
alt_country_name4 character Alternative country name 4
alt_country_name5 character Alternative country name 5
alt_country_name6 character Alternative country name 6
other_code1 character Other country code 1
other_code2 character Other country code 2

global_human_day.csv

variable class description
Subcategory character M24 subcategory
hoursPerDay double Hours per day for all countries
uncertainty double Uncertainty in units variance.

global_economic_activity.csv

variable class description
Subcategory character M24 subcategory
hoursPerDay double Hours per day for all countries.
uncertainty double Uncertainty in units variance.

Cleaning Script

library(tidyverse)

# Read in the data file all_countries.csv from https://zenodo.org/record/8040631

all_countries <- read_csv("all_countries.csv")

# Change variable name to be consistent between files

colnames(all_countries)[3] <- "country_iso3"

# Remove columns on data status

all_countries = subset(all_countries, select = -c(dataStatus,dataStatusEconomic))

# write out data
readr::write_csv(
  all_countries, "all_countries.csv")