Marble Racing

The data this week comes from Jelle's Marble Runs courtesy of Randy Olson.

Randy's blogpost covers some additional analysis.

Jelle's Marble Runs started as a quirky YouTube channel back in 2006 and has refined the art of marble racing to the point that many — including sponsor John Oliver from Last Week Tonight — consider marble racing a legitimate contender for the national sports spotlight. Given that Jelle's Marble Runs just completed their popular Marbula One competition last month, I was curious to look at the race results to see if these races were anything more than chaos.

Do some marbles race better than others? Who would I put my money on in season 2 of Marbula One? ... If any of these questions interest you, read on and I'll answer some of them.

The first step to answering these questions was to get some data. Thankfully, all of the Marbula One videos are organized in a YouTube playlist available here. From every race, my marble racing analytics team recorded each marble racer's qualifier performance, total race time, average lap time, final rank, and some other statistics. That dataset is available for download on my website here.

Some additional context from the fandom Wiki for Jelle's Marble Runs and a link to Season 1 courtesy of Georgios Karamanis.

Spotlight from John Oliver on Last Week Tonight
courtesy of Dennis Hammerschmidt

Get the data here

# Get the Data

marbles <- readr::read_csv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/main/data/2020/2020-06-02/marbles.csv')

# Or read in with tidytuesdayR package (https://github.com/dslc-io/tidytuesdayR)

# Either ISO-8601 date or year/week works!

# Install via pak::pak("dslc-io/tidytuesdayR")

tuesdata <- tidytuesdayR::tt_load('2020-06-02')
tuesdata <- tidytuesdayR::tt_load(2020, week = 23)


marbles <- tuesdata$marbles

Data Dictionary

`marbles.csv`

variable	class	description
date	character	date of race
race	character	race id
site	character	site of race
source	character	youtube url
marble_name	character	name of marble
team_name	character	team name
time_s	double	Time in seconds
pole	character	pole position
points	double	Points gained
track_length_m	double	track length in meters
number_laps	double	number of laps
avg_time_lap	double	average lap time
host	character	Host of race
notes	character	Notes (very few, but some notes about potential errors)

`skimr`

── Data Summary ────────────────────────
                           Values 
Name                       marbles
Number of rows             256    
Number of columns          14     
_______________________           
Column type frequency:            
  character                9      
  numeric                  5      
________________________          
Group variables                   

── Variable type: character ────────────────────────────────────────────────────────
  skim_variable n_missing complete_rate   min   max empty n_unique whitespace
1 date                  0        1          8     9     0       16          0
2 race                  0        1          4     4     0       16          0
3 site                  0        1          7    15     0        8          0
4 source                0        1         34    34     0       16          0
5 marble_name           0        1          4     9     0       32          0
6 team_name             0        1          6    16     0       16          0
7 pole                128        0.5        2     3     0       16          0
8 host                  0        1          2     3     0        2          0
9 notes               249        0.0273    37   100     0        7          0

── Variable type: numeric ──────────────────────────────────────────────────────────
  skim_variable  n_missing complete_rate   mean      sd hist 
1 time_s                 3         0.988 191.   169.    ▇▁▁▇▁
2 points               128         0.5     6.45   7.74  ▇▂▂▁▁
3 track_length_m         0         1      13.2    0.952 ▅▅▂▁▇
4 number_laps            0         1       6.25   5.53  ▇▁▃▂▂
5 avg_time_lap           3         0.988  29.7    5.55  ▃▆▇▇▂

Cleaning Script

library(tidyverse)
library(skimr)
library(janitor)


marbles <- read_csv("2020/2020-06-02/Jelles-Marble-Racing-Marbula-One.csv") %>% 
 janitor::clean_names() %>% 
  select(-x14) %>% 
  rename(notes = x15)

skimr::skim(marbles)

marbles %>% 
  write_csv("2020/2020-06-02/marbles.csv")

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

readme.md

readme.md

Marble Racing

Get the data here

Data Dictionary

`marbles.csv`

`skimr`

Cleaning Script

Files

readme.md

Latest commit

History

readme.md

File metadata and controls

Marble Racing

Get the data here

Data Dictionary

marbles.csv

skimr

Cleaning Script

`marbles.csv`

`skimr`