ipumsr provides an R interface for handling IPUMS data, allowing users to:
-
Easily read files downloaded from the IPUMS extract system
-
Clean and prepare data using the contextual information contained in the variable-level metadata that is included with many IPUMS files
-
Submit requests for data and download files through the IPUMS API
IPUMS is the world’s largest publicly available individual-level population database, providing census and survey data from around the world integrated across time and space. IPUMS integration and documentation makes it easy to study change, conduct comparative research, merge information across data types, and analyze individuals within family and community context. Data and services are available free of charge.
IPUMS consists of multiple projects, or collections, that provide different data products. Microdata projects distribute data for individual survey units, like people or households. Aggregate data projects distribute aggregate statistics calculated from microdata for particular geographic units.
Project | Data Type | Description |
---|---|---|
IPUMS USA | Microdata | U.S. Census and American Community Survey microdata (1850-present) |
IPUMS CPS | Microdata | Current Population Survey microdata including basic monthly surveys and supplements (1962-present) |
IPUMS International | Microdata | Census microdata covering over 100 countries, contemporary and historical |
IPUMS Time Use | Microdata | Time use microdata from the U.S. (1930-present) and thirteen other countries (1965-present) |
IPUMS Health Surveys | Microdata | Microdata from the U.S. National Health Interview Survey (NHIS) (1963-present) and Medical Expenditure Panel Survey (MEPS) (1996-present) |
IPUMS Global Health | Microdata | Health survey microdata for low- and middle-income countries, including harmonized data collections for Demographic and Health Surveys (DHS) and Performance Monitoring for Action (PMA) surveys |
IPUMS Higher Ed | Microdata | Survey microdata on the science and engineering workforce in the U.S. from 1993 to 2013 |
IPUMS NHGIS | Aggregate Data | Tabular U.S. Census data and GIS boundary files (1790-present) |
IPUMS IHGIS | Aggregate Data | Tabular and GIS data from population, housing, and agricultural censuses around the world |
To install the package from CRAN, use
install.packages("ipumsr")
To install the development version of the package, use
remotes::install_github("ipums/ipumsr")
The package vignettes are a great place to learn more about ipumsr and IPUMS data:
The IPUMS support website also houses many project-specific exercises.
-
The survey and srvyr packages can help you incorporate IPUMS survey weights into your analysis for various survey designs.
-
See haven for more information about value labels and labelled vectors
-
hipread underlies the hierarchical file reading functions in ipumsr
We greatly appreciate feedback and development contributions. Please submit any bug reports, pull requests, or other suggestions on GitHub. Before contributing, please be sure to read the Contributing Guidelines and the Code of Conduct.
If you have general questions or concerns about IPUMS data, check out our user forum or send an email to ipums@umn.edu.