Skip to content

hudcostreets/path

Repository files navigation

PATH ridership stats

Cleaned + Plotted Port Authority data from https://www.panynj.gov/path/en/about/stats.html

Cleaned data

Jan 2012 – Oct 2023

PATH weekday ridership over time, stacked by station

Closer look at 2020-Present

PATH weekday ridership over time, stacked by station, 2020 to 2022-09; 275k in Jan/Feb 2020, large drop, almost to zero, in April 2020, steadily climbing back to 150k

Weekends only

PATH weekend ridership over time, stacked by station

Weekends (2020-Present)

PATH Saturday ridership over time, stacked by station, September 2022 has surpassed January/February 2020

Weekdays, Grouped by Month

Weekends, Grouped by Month

Weekdays vs. Weekends

Weekdays vs. Weekends, compared to 2019

As of September 2024:

  • weekday ridership was 69.2% of Sept '19 (pre-COVID)
  • weekend ridership was 99.5% of Sept '19 (pre-COVID)

Methods

PATH Monthly Data

1. Download "PATH Ridership Reports (By Month)"

From www.panynj.gov/path/en/about/stats.html, to data/:

y=2024
name=$y-PATH-Monthly-Ridership-Report.pdf
wget -O data/$name https://www.panynj.gov/content/dam/path/about/statistics/$name

name=$y-PATH-Hourly-Ridership-Report.pdf
wget -O data/$name https://www.panynj.gov/content/dam/path/about/statistics/$name

1b. Use Tabula to extract tables

This only has to be done once, the resulting templates are saved in templates/.

Selecting tables from a "PATH Ridership Report"

2. Process each year's data, output .pqts

See:

y=2024; juq papermill run -p year=$y monthly.ipynb out/monthly-$y.ipynb

3. Combine all years' data

Bridge & Tunnel Data

Merge per-year PDFs into one:

/opt/homebrew/bin/gs \
  -o merged.pdf \
  -sDEVICE=pdfwrite \
  -dPDFFitPage \
  -g12984x10033 \
  -dPDFSETTINGS=/prepress \ 
  traffic-e-zpass-usage-20*

cf. SO.

About

PATH train ridership stats

Resources

Stars

Watchers

Forks

Languages