National Neighborhood Data Archive (NaNDA): Socioeconomic and Demographic Characteristics of Census Tracts and ZIP Code Tabulation Areas, United States, 2000-2017 (ICPSR 38528)
Version Date: Sep 15, 2022 View help for published
Principal Investigator(s): View help for Principal Investigator(s)
Robert Melendez, University of Michigan. Institute for Social Research;
Philippa Clarke, University of Michigan. Institute for Social Research;
Anam Khan, University of Michigan. Institute for Social Research;
Iris Gomez-Lopez, University of Michigan. Institute for Social Research;
Mao Li, University of Michigan. Institute for Social Research;
Megan Chenoweth, University of Michigan. Institute for Social Research
Series:
https://doi.org/10.3886/ICPSR38528.v1
Version V1 (see more versions)
You are currently viewing an older version of this data collection. A more recent version may be available by selecting (see more versions)
Additional information about this collection can be found in Version History.
2022-09-15 ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection:
- Checked for undocumented or out-of-range codes.
Summary View help for Summary
These datasets contain measures of socioeconomic and demographic characteristics by US census tract for the years 2000-2017 and ZIP code tabulation area (ZCTA) for the years 2008-2017. Example measures include population density; population distribution by race, ethnicity, age, and income; and proportion of population living below the poverty level, receiving public assistance, and female-headed families. The datasets also contain index variables representing neighborhood disadvantage and affluence.
Citation View help for Citation
Export Citation:
Funding View help for Funding
Subject Terms View help for Subject Terms
Geographic Coverage View help for Geographic Coverage
Smallest Geographic Unit View help for Smallest Geographic Unit
County Federal Information Processing System (FIPS)
Distributor(s) View help for Distributor(s)
Time Period(s) View help for Time Period(s)
Date of Collection View help for Date of Collection
Data Collection Notes View help for Data Collection Notes
-
The data and documentation for the Neighborhood Socioeconomic and Demographic Characteristics by Tract, United States, 2000-2010 Data were originally deposited in openICPSR here.
The data and documentation for the Socioeconomic Status and Demographic Characteristics of Census Tracts, United States, 2008-2017 Data were originally deposited in openICPSR here.
The data and documentation for the Socioeconomic Status and Demographic Characteristics of ZIP Code Tabulation Areas, United States, 2008-2017 Data were originally deposited in openICPSR here.
-
A ZIP code to ZCTA crosswalk must be used to combine the ZCTA dataset with ZIP code geocoded data. Such a crosswalk is available on the UDS Mapper website at https://udsmapper.org/zip-code-to-zcta-crosswalk/. Sample code for merging the UDS Mapper crosswalk with NaNDA datasets is available at http://doi.org/10.3886/E124461V2.
Users wanting to combine the 2000-2010 and 2008-2017 census tract-level measures should be aware of key differences between how measures are calculated across the two datasets. For more information, see the "Usage Note" section of the documentation.
- For additional information see the National Neighborhood Data Archive (NaNDA).
Study Purpose View help for Study Purpose
These datasets were created to measure the socioeconomic and demographic characteristics by US census tract for the years 2000-2017 and ZIP code tabulation area (ZCTA) for the years 2008-2017.
Study Design View help for Study Design
Neighborhood Socioeconomic and Demographic Characteristics by Tract, United States, 2000-2010 Data:
Construction of the Neighborhood Socioeconomic Disadvantage and Affluence Variables To construct a set of variables that would characterize the sociodemographic structure of census tracts over time the research team conducted a principal factor analysis with an orthogonal varimax rotation of 10 census indicators (log transformed to correct positive skew) in 2000. The aim of the research team was to derive a parsimonious set of factors that capture the shared variance of a broad spectrum of neighborhood structural characteristics. Results from the factor analysis indicated 3 separate factors:
The first factor, which the research team interprets as neighborhood disadvantage, is characterized by high levels of poverty, unemployment, female-headed families, households receiving public assistance income, and a high proportion of African Americans in a census tract.
The second factor represents a mix of characteristics associated with neighborhood affluence (concentrations of adults with a college education; with incomes>75K; and employed in managerial and professional occupations). Distinguished from other non-disadvantaged census tracts by their large share of high income, highly-educated, adults in professional occupations, affluent census tracts are likely to attract a set of institutions (e.g., food stores, places to exercise, well-maintained buildings and parks) that foster a set of norms (e.g., an emphasis on exercise and healthy diets) conducive to good health (Clarke, Morenoff, Debbink, et al., 2014). Distinct from simply being the absence of neighborhood disadvantage, neighborhood affluence is associated with higher levels of social control and leverage over local institutions that can foster social environments that facilitate health (Browning & Cagney, 2003).
The third factor represents ethnic and immigrant concentration, (higher values indicate more Hispanic and foreign born in the census tract).
Socioeconomic Status and Demographic Characteristics of Census Tracts, United States, 2008-2017 Data:
To construct this dataset, the research team extracted key census indicators related to race, ethnicity, age, income level, employment, poverty, and home ownership from the American Community Survey (ACS) 2012 five-year estimate (covering 2008-2012). The research team merged the variables with the same variables from the ACS 2017 five-year estimate (covering 2013-2017) and with each tract's land area from the 2010 TIGER/Line shapefiles for census tracts. The research team then used those variables to construct three indices as described below: neighborhood disadvantage, neighborhood affluence, and ethnic immigrant concentration.
Construction of the index variables was informed by previous work to construct a set of variables that would characterize the sociodemographic structure of census tracts over time. In 2000, the research team conducted a principal factor analysis with an orthogonal varimax rotation of ten census indicators (log transformed to correct positive skew) (Morenoff et al., 2007). The aim of the research team was to derive a parsimonious set of factors that capture the shared variance of a broad spectrum of neighborhood structural characteristics. Results from the factor analysis indicated three separate factors:
The first factor, which the research team interprets as neighborhood disadvantage, is characterized by high levels of poverty, unemployment, female-headed families, households receiving public assistance income, and a high proportion of African Americans in a census tract.
The second factor represents a mix of characteristics associated with neighborhood affluence (concentrations of adults with a college education; with incomes>75K; and employed in managerial and professional occupations). Distinguished from other non-disadvantaged census tracts by their large share of high income, highly-educated, adults in professional occupations, affluent census tracts are likely to attract a set of institutions (e.g., food stores, places to exercise, well-maintained buildings and parks) that foster a set of norms (e.g., an emphasis on exercise and healthy diets) conducive to good health (Clarke et al., 2014). Distinct from simply being the absence of neighborhood disadvantage, neighborhood affluence is associated with higher levels of social control and leverage over local institutions that can foster social environments that facilitate health (Browning & Cagney, 2003).
The third factor represents ethnic and immigrant concentration. Higher values indicate more Hispanic and foreign born in the census tract.
Socioeconomic Status and Demographic Characteristics of ZIP Code Tabulation Areas, United States, 2008-2017 Data:
To construct this dataset, the research team extracted key census indicators related to race, ethnicity, age, income level, employment, poverty, and home ownership from the ACS 2012 five-year estimate (covering 2008-2012). The research team merged the variables with the same variables from the ACS 2017 five-year estimate (covering 2013-2017) and with each ZIP code tabulation area (ZCTA)'s land area from the 2010 TIGER/Line shapefiles for ZIP code tabulation areas. The research team then used those variables to construct three indices as described below: neighborhood disadvantage, neighborhood affluence, and ethnic immigrant concentration.
Construction of the index variables was informed by previous work to construct a set of variables that would characterize the sociodemographic structure of census tracts over time. In 2000, the research team conducted a principal factor analysis with an orthogonal varimax rotation of ten census indicators (log transformed to correct positive skew) (Morenoff et al., 2007). The aim of the research team was to derive a parsimonious set of factors that capture the shared variance of a broad spectrum of neighborhood structural characteristics. Results from the factor analysis indicated three separate factors:
The first factor, which the research team interprets as neighborhood disadvantage, is characterized by high levels of poverty, unemployment, female-headed families, households receiving public assistance income, and a high proportion of African Americans in a census tract.
The second factor represents a mix of characteristics associated with neighborhood affluence (concentrations of adults with a college education; with incomes>75K; and employed in managerial and professional occupations). Distinguished from other non-disadvantaged census tracts by their large share of high income, highly-educated, adults in professional occupations, affluent census tracts are likely to attract a set of institutions (e.g., food stores, places to exercise, well-maintained buildings and parks) that foster a set of norms (e.g., an emphasis on exercise and healthy diets) conducive to good health (Clarke et al., 2014). Distinct from simply being the absence of neighborhood disadvantage, neighborhood affluence is associated with higher levels of social control and leverage over local institutions that can foster social environments that facilitate health (Browning & Cagney, 2003).
The third factor represents ethnic and immigrant concentration. Higher values indicate more Hispanic and foreign born in the census tract.
Time Method View help for Time Method
Universe View help for Universe
Census tracts and ZIP code tabulation areas in the United States, including Puerto Rico.
Unit(s) of Observation View help for Unit(s) of Observation
Data Source View help for Data Source
The data and documentation for the Neighborhood Socioeconomic and Demographic Characteristics by Tract, United States, 2000-2010 Data were originally deposited in openICPSR here.
The data and documentation for the Socioeconomic Status and Demographic Characteristics of Census Tracts, United States, 2008-2017 Data were originally deposited in openICPSR here.
The data and documentation for the Socioeconomic Status and Demographic Characteristics of ZIP Code Tabulation Areas, United States, 2008-2017 Data were originally deposited in openICPSR here.
Census tract-level measures, 2000-2010: Population values and percentage variables per census tract are from the 2000 Census of Population and Housing Summary and the American Community Survey five-year estimate for 2008-2012. Census tract boundaries were normalized to the 2010 tract boundaries using the Longitudinal Tract Data Base described in Logan, Xu, and Stults 2014 (http://dx.doi.org/10.1080/00330124.2014.905156).
ZCTA-level measures, 2008-2017: Population values and demographic proportions per ZCTA are from the American Community Survey five-year estimates for 2008-2012 and 2013-2017. ZCTA boundaries are based on the 2019 version of the US TIGER/Line shapefiles.
Census tract-level measures, 2008-2017: Population values and percentage variables per census tract are from the American Community Survey five-year estimates for 2008-2012 and 2013-2017. Both are based on census tract boundaries as of 2010. Land area for each census tract comes from the TIGER/Line shapefiles, 2010 version.
Data Type(s) View help for Data Type(s)
HideOriginal Release Date View help for Original Release Date
2022-09-15
Version History View help for Version History
2022-09-15 ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection:
- Checked for undocumented or out-of-range codes.