Skip to content

Latest commit

 

History

History
executable file
·
36 lines (32 loc) · 3.47 KB

README.md

File metadata and controls

executable file
·
36 lines (32 loc) · 3.47 KB

README

The data folder hosts the course datasets. It misses Sessions 5–7 as well as little things scraped from Wikipedia (which are not saved) and extra class exercises like the Rheinart and Rogoff replication. Large files are marked with their sizes.

The data are downloaded and immediately saved to comma-separated values (.csv). A few files are compressed or saved in tab-separated values. Some files (marked ) required preprocessing or authentification to download, and are self-contained to the course.

name description Session
grades.2012 Anonymized student grades from an Autumn 2012 math course 2.2
nhis.2005 Data extract from the U.S. National Health Interview Survey, 2005 2.2
us.recessions.4807 BLS employment figures for NBER recession months, 1948-2013 3.1
browsers.0813 StatCounter browser usage shares 2008-2013, scraped from Wikipedia 3.3 (practice)
wdi.govdebt.0511 World Bank central governmental debt for 5 states, 2005-2011 4.0
dailykos.votes.0812 U.S. 2008-2012 county-level vote shares from Daily Kos 4.1
qog.cs.zip Quality of Government Standard cross-section, 15 May 2013 4.1, 5.3, 8.2, 10.0
fide.zip FIDE ratings for chess players, scraped from its website, 2013 4.1
htus8008.zip Bureau of Justice Statistics on U.S. homicide trends, 1980-2008 4.2
schiller.8712 Standard and Poor's Case-Schiller Index for U.S. cities, 1987-2012 4.2
bshor.congress.2012 Boris Shor's U.S. Congressional ideal estimated points, 2012 4.3 (practice)
dwnominate.zip DW-NOMINATE scores for all U.S. Congressional sessions (2.7MB) 4.3 (practice)
Sessions 5--7 are not listed.
olympics.2012 Olympics 100 m sprinters data from Markus Gessman 8.0
oecd.bli.2011.tsv OECD Better Life Index, 2011 8.1
bartels.presvote.4812 Larry Bartel's data for a U.S. presidential election plot 8.2
imf.weo.2012 IMF WEO 2012 replication data by Chris Giles 8.3 (practice)
icm.polls.8413 Guardian/ICM polls, 1984-2013 9.0
beijing.pollution Beijing pollution information by the U.S. Embassy on Twitter, 2013 9.1
geos.tww GEOS episode ratings for Aaron Sorkin's The West Wing 9.2
piketty.saez.2011.zip Piketty & Saez data on U.S. income inequality, 1913-2012 9.3 (practice)
wb.projects.zip World Bank geocoded aid projects in Africa (1.9MB) 10.1
wasserman.votes.0812 U.S. state-level 2008-2012 vote shares, from David Wasserman 10.2
ga.network Network of characters in Grey's Anatomy, by Gary Weissman 11.0
twitter.an.zip Twitter network data for French MPs in (early) 2013 11.1
assange.txt Speech by Julian Assange in 2012 11.2
doctorow.txt Speech by Corey Doctorow in 2011 11.2