Cleaned + Plotted Port Authority data from https://www.panynj.gov/path/en/about/stats.html
As of September 2024:
- weekday ridership was 69.2% of Sept '19 (pre-COVID)
- weekend ridership was 99.5% of Sept '19 (pre-COVID)
From www.panynj.gov/path/en/about/stats.html, to data/
:
y=2024
name=$y-PATH-Monthly-Ridership-Report.pdf
wget -O data/$name https://www.panynj.gov/content/dam/path/about/statistics/$name
name=$y-PATH-Hourly-Ridership-Report.pdf
wget -O data/$name https://www.panynj.gov/content/dam/path/about/statistics/$name
1b. Use Tabula to extract tables
This only has to be done once, the resulting templates are saved in templates/
.
See:
monthly.ipynb
- outputs in
data/*.pqt
y=2024; juq papermill run -p year=$y monthly.ipynb out/monthly-$y.ipynb
- See
months.ipynb
- Output
data/all.pqt
,data/all.xlsx
,img/weekdays.png
Merge per-year PDFs into one:
/opt/homebrew/bin/gs \
-o merged.pdf \
-sDEVICE=pdfwrite \
-dPDFFitPage \
-g12984x10033 \
-dPDFSETTINGS=/prepress \
traffic-e-zpass-usage-20*
cf. SO.