Replication Notes

Fork the repo in it's entirety
(If given access) load the CSS data in a parallel folder to the main project folder 311_bias, like this:
- 311_bias/
  - Data/
  - Writing/
  - Code/
  - etc.
- 311_protected_data/
  - KCMODF_ConsolidatedData_FY13_FY20.csv
  - KCMODF_ConsolidatedData_FY13_FY20.xlsx
To recreate the PDF, just run the .Rmd file and ensure the \extras folder is in the right place (Writing\extras\ ... )
To replicate the figures and analysis, do the following in order:

Run \Code\kc_data_pull.R
- Outputs: 311 data, prop viol data, census data (saved \Data\file_xyz.csv.gz)
- Requirements: Census API key for use with tidycensus
Run \Code\kc_data_clean.R
- Outputs: df_311_viol_full (keep this in namespace envir)
Run \Code\summ_stats.R
- Outputs: percentages and descriptive splits for 311-violation conversions
Run \Code\maps.R
- Outputs: The maps
- Note: Be careful with ggsave, this file takes a long time (1GB gg objects)
Run \Code\model.R
- Outputs: Tables and Figure with results (ROC, descr stat, vip)
- Requirements: Properly loaded the protected data
- Note: Can take a while... I ran it chunk by chunk

If replicating from scratch (just using datasets), this should set up all the appropriate tables and files so the Writing\311_bias_report.Rmd can be run.

That should be it ?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

replication_notes.md

replication_notes.md

Replication Notes

Files

replication_notes.md

Latest commit

History

replication_notes.md

File metadata and controls

Replication Notes