Resources - housing-data (provided folder) - data_dictionary_1-CA or data_dictionary_1-FL - CSV containing descriptions, sources, and years for information in data_1 - data_1-CA.csv + data_1-FL.csv - CSV containing geographically organized population data (both states have the same fields) - fields with null data: loan_amount, median_mortgage_amount, median_prop_value, median_sba504_loan_amount, median_sba7a_loan_amount, num_mortgage, num_mortgage_denials, num_mortgage_denials, num_mortgage_originated, number_of_sba504_loans, number_of_sba7a_loans - fields with untrustworthy data: some large value negatives - not yet investigated thoroughly - redundant fields: 'geoid_year','state','state_fips_code', 'county_fips_code' - Field Description Summary: Signals of economic difficulty - economic_distress_pop_agg: 20+% of households in the census tract are very low-income renters or owners who pay more than half their income for housing, population weighted - economic_distress_simple_agg: 20+% of households in the census tract are very low-income renters or owners who pay more than half their income for housing, NOT population weighted - investment_areas: geographic area with (1) population poverty rate of 20+%; (2) or unemployment rate of at least 1.5 times the national average; (3) or for a metropolitan area has a median family income (mfi) at or below 80 percent of the greater of either the metropolitan mfi or national metropolitan mfi; (4) or for a non-metropolitan area that has a mfi at or below 80 percent of the greater of either the statewide non-metropolitan mfi or national non-metropolitan mfi; (5) or is wholly located within an empowerment zone or enterprise community; (6) or has a county population loss greater than or equal to ten percent for metro areas or five percent for non-metro areas. - num_mortgage: number of home mortgage loans reported - num_mortgage_denials: number of home mortgage loans denied - num_mortgage_originated: number of home loans originated - opzone: census tracts identified in areas of economic distress to promote investment - expressed as boolean 1/0 - b19083_001e: Gini Index of Income Inequality - Estimate About the homes - median_mortgage_amount: median home mortgage amount - median_prop_value: median home property value amount About demographics - "B Series" (ex. b23025_002e) employment percentages (ex. total in labor force) (9 rows) - "s0101 Series" (ex. s0101_c01_032e) census data organizing population by age, race, sex, etc. (78 rows) - "dp05 Series" (ex. dp05_0035pe) census data organized by race (7 rows) - "s2001 Series" (ex. s2001_c01_002e) median earnings total vs split by sex (6 rows) - "s1903 Series" (ex. s1903_c03_001e) median income total vs split by race (28 rows) - "s1701 Series" (ex. s1701_c03_001e) percentage of people below poverty level total vs split by age, race (40 rows) - "s2701 Series" (ex. s2701_c03_001e) health insurance coverages split by race, sex, etc (94 rows) Other - SBA loan information (ex. number_of_sba504_loans) small business loan issued (4 rows) - loan_amount: original loan/investment amount (I'm unclear if this is for home loans) - data_dictionary_2-CA.csv (No FL equivalent provided) - CSV containing descriptions, sources, and years for information in data_2 - data_2-CA.csv (No FL equivalent provided) - CSV containing geographically organized information about health and natural hazard risks provided mostly by the EPA - The "hazard data" are primarily 2023 data from the EPA. The rest of the data are from more varied sources & time periods Housing/Demographics - "B Series": employment percentages, 4 rows, ex. b23025_003e - "S2503 Series" - occupied housing organized by total, rented, owned homes, 6 rows, ex. s2503_c01_024e - Energy Burden: percentage of gross household income spent on energy costs, 2 rows, ex. energy_burden Hazard Data - Air Quality - Air Cancer: air toxic cancer risk info, 6 rows, ex. p_d2_cancer - Respiratory Tract Toxins: air toxics respiratory risk info, 6 rows, ex. d2_resp - Air Toxins: toxic releases to air, 6 rows, ex. d2_rsei_air - diesel: Diesel particulate matter risk info, 6 rows, ex. d2_dslpm - Lead Paint: - EPA lead paint risk info, 5 rows, ex. d2_ldpnt - EJScreen lead paint, 2 rows, ex. pre1960 - ozone: ozone risk info, 6 rows, ex. d2_ozone - particular matter: particulate matter risk, 6 rows, ex. d2_pm25 - Superfund proximity: measures how close people might live to Superfund sites, 6 rows, ex. d2_pnpl - Superfund programs: 2014 program to identify, clean up, and return contaminated sites to productive use) - RMP facility proximity: measures how close people might live to RMP facilities, 6 rows, ex. d2_prmp - RMP facility: facilities that require a Risk Management Program because they use hazardous substances - Hazardous Waste: hazardous waste proximity, 6 rows, ex. d2_ptsdf - Traffic proximity: measures how close people live to high traffic areas, 6 rows, ex. d2_ptraf - Wastewater Discharge: wastewater discharge risk info, 6 rows, ex. d2_pwdis - Underground Storage Tanks: underground storage tank exposure, 6 rows, ex. d2_ust Expected Losses due to Natural Hazards: - Agricultural: 2 rows, ex. expected_agricultural_loss_rate_natural_hazards_risk_index - Building Loss: 2 rows, ex. expected_building_loss_rate_natural_hazards_risk_index - Population Loss: 2 rows, ex. expected_population_loss_rate_natural_hazards_risk_index - Fire: 2 rows, ex. share_of_properties_at_risk_of_fire_in_30_years - Flood: 2 rows, ex. share_of_properties_at_risk_of_flood_in_30_years Other: QCT - Qualified Census Tract Indictor, 1 row, 'qct' - just full of zeros pre1960 - associated with lead paint exposure, but could also be used to look at housing age (probably we have better sources for this)