National Neighborhood Data Archive (NaNDA): Polluting Sites by Census Tract and ZIP Code Tabulation Area, United States, 1987-2021 (ICPSR 38597)

Version Date: Dec 4, 2023 View help for published

Principal Investigator(s): View help for Principal Investigator(s)
Jessica M. Finlay, University of Michigan. Institute for Social Research; Robert Melendez, University of Michigan. Institute for Social Research; Longrong Pan, University of Michigan. Institute for Social Research; Michael Esposito, Washington University in St. Louis; Anam Khan, University of Michigan. Institute for Social Research; Mao Li, University of Michigan. Institute for Social Research; Iris Gomez-Lopez, University of Michigan. Institute for Social Research; Philippa Clarke, University of Michigan. Institute for Social Research; Grace A. Noppert, University of Michigan. Institute for Social Research; Megan Chenoweth, University of Michigan. Institute for Social Research; Lindsay Gypin, University of Michigan. Institute for Social Research

Series:

https://doi.org/10.3886/ICPSR38597.v2

Version V2 ()

  • V2 [2023-12-04]
  • V1 [2022-11-29] unpublished
Slide tabs to view more

This dataset contains yearly counts from 1987 to 2021 of polluting sites in each United States census tract and within a 0.5-mile buffer to capture spillover effects and in each United States ZIP code tabulation area. Polluting sites are taken from the US Environmental Protection Agency's (EPA) Toxics Release Inventory. These facilities are typically larger and involved in manufacturing, metal mining, electric power generation, chemical manufacturing, and hazardous waste treatment.

Finlay, Jessica M., Melendez, Robert, Pan, Longrong, Esposito, Michael, Khan, Anam, Li, Mao, … Gypin, Lindsay. National Neighborhood Data Archive (NaNDA): Polluting Sites by Census Tract and ZIP Code Tabulation Area, United States, 1987-2021. Inter-university Consortium for Political and Social Research [distributor], 2023-12-04. https://doi.org/10.3886/ICPSR38597.v2

Export Citation:

  • RIS (generic format for RefWorks, EndNote, etc.)
  • EndNote
United States Department of Health and Human Services. Administration for Community Living. National Institute on Disability, Independent Living, and Rehabilitation Research (90RTHF0001), United States Department of Health and Human Services. National Institutes of Health. National Institute on Aging (RF1-AG-057540), United States Department of Health and Human Services. National Institutes of Health. National Institute of Nursing Research (U01NR020556), United States Department of Health and Human Services. National Institutes of Health. National Center on Minority Health and Health Disparities (U01NR020556)

census tract and ZIP code tabulation area

Inter-university Consortium for Political and Social Research
Hide

1987 -- 2021
2021 (Data file 1 (Polluting Sites by Census Tract Data (2010 Census)) and data file 2 (Polluting Sites by ZIP Code Tabulation Area Data (2010 Census))), 2023 (Data file 3 (Polluting Sites by Census Tract Data (2020 Census)) and data file 4 (Polluting Sites by ZIP Code Tabulation Area Data (2020 Census)))
  1. Data and documentation for data file 1 (Polluting Sites by Census Tract Data (2010 Census)) were originally deposited in openICPSR project 159961.

    Data and documentation for data file 2 (Polluting Sites by ZIP Code Tabulation Area Data (2010 Census)) were originally deposited in openICPSR project 159981.

  2. A ZIP code to ZCTA crosswalk must be used to combine this dataset with ZIP code geocoded data. A crosswalk and sample code for merging the crosswalk with National Neighborhood Data Archive (NaNDA) datasets are available in the ICPSR Linkage Library.
  3. For additional information please see the National Neighborhood Data Archive (NaNDA).
Hide

The purpose of this study is to investigate the impact of disamenities, in this case polluting sites, on neighborhood walkability.

The Principal Investigators obtained the latitudes and longitudes of all polluting sites appearing in file type 1A (Facility, Chemical, Releases, and Other Waste Management Summary Information) of the EPA's Toxics Release Inventory (TRI) in reporting years 1987 through 2021. Agencies with ten or more employees that manufacture, process, or use chemicals from a list maintained by the EPA are required to self-report data for the TRI annually (U.S. Environmental Protection Agency, 2021).

ArcGIS Pro was used to create 0.5-mile buffers around all U.S. census tracts. The Principal Investigators then performed a spatial join to assign each polluting site's latitude and longitude to the census tracts (plus buffers) within which it falls. (Polluting locations for which census tracts could not be determined are excluded from this dataset.) A 0.5-mile buffer was used to approximate the effect of sites not only within a neighborhood itself, but in nearby or adjoining neighborhoods. The last step was then to count the total number of polluting sites within each census tract plus buffer in each year.

Cross-sectional

Census tracts and ZIP code tabulation areas in the United States, excluding U.S. island territories.

ZIP code tabulation area, census tract

Data file 3 (Polluting Sites by Census Tract Data (2020 Census)): United States Census Bureau. (2021). Index of TIGER/Line shapefiles: All lines.

Data file 2 (Polluting Sites by ZIP Code Tabulation Area Data (2010 Census)): United States Census Bureau (2010). TIGER/Line shapefiles, 2010 ZIP code tabulation areas (2019 version) [Data set].

Data file 4 (Polluting Sites by ZIP Code Tabulation Area Data (2020 Census)): United States Census Bureau. (2021, July 15). Index of Tiger/Line Shapefiles: ZCTA.

Data file 3 (Polluting Sites by Census Tract Data (2020 Census)) and data file 4 (Polluting Sites by ZIP Code Tabulation Area Data (2020 Census)): US EPA. (2023, May 25). TRI Basic Data Files: Calendar Years 1987-Present [Data and Tools].

Data file 1 (Polluting Sites by Census Tract Data (2010 Census)): United States Census Bureau (2010). TIGER/Line shapefiles, 2010 census tracts (2010 version) [Data set].

Data file 1 (Polluting Sites by Census Tract Data (2010 Census)) and data file 2 (Polluting Sites by ZIP Code Tabulation Area Data (2010 Census)): United States Environmental Protection Agency. Toxics Release Inventory Program - TRI Basic Plus Data Files, Calendar Years 1987-Present. File Type 1A: Facility, Chemical, Releases, and Other Waste Management Summary Information, 2018.

Hide

2022-11-29

2023-12-04

The years in the study title have been updated from "2000-2018" to "1987-2021".

The file names for data files 1 and 2 have been updated to include the census year.

Data files 3 (Polluting Sites by Census Tract Data (2020 Census)) and 4 (Polluting Sites by ZIP Code Tabulation Area Data (2020 Census)) have been added.

2022-11-29 ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection:

  • Checked for undocumented or out-of-range codes.

Hide

Notes

  • The public-use data files in this collection are available for access by the general public. Access does not require affiliation with an ICPSR member institution.