National Neighborhood Data Archive (NaNDA): Polluting Sites by Census Tract and ZIP Code Tabulation Area, United States, 1987-2021 (ICPSR 38597)
Version Date: Dec 4, 2023 View help for published
Principal Investigator(s): View help for Principal Investigator(s)
Jessica M. Finlay, University of Michigan. Institute for Social Research;
Robert Melendez, University of Michigan. Institute for Social Research;
Longrong Pan, University of Michigan. Institute for Social Research;
Michael Esposito, Washington University in St. Louis;
Anam Khan, University of Michigan. Institute for Social Research;
Mao Li, University of Michigan. Institute for Social Research;
Iris Gomez-Lopez, University of Michigan. Institute for Social Research;
Philippa Clarke, University of Michigan. Institute for Social Research;
Grace A. Noppert, University of Michigan. Institute for Social Research;
Megan Chenoweth, University of Michigan. Institute for Social Research;
Lindsay Gypin, University of Michigan. Institute for Social Research
Series:
https://doi.org/10.3886/ICPSR38597.v2
Version V2 (see more versions)
Summary View help for Summary
This dataset contains yearly counts from 1987 to 2021 of polluting sites in each United States census tract and within a 0.5-mile buffer to capture spillover effects and in each United States ZIP code tabulation area. Polluting sites are taken from the US Environmental Protection Agency's (EPA) Toxics Release Inventory. These facilities are typically larger and involved in manufacturing, metal mining, electric power generation, chemical manufacturing, and hazardous waste treatment.
Citation View help for Citation
Export Citation:
Funding View help for Funding
Subject Terms View help for Subject Terms
Geographic Coverage View help for Geographic Coverage
Smallest Geographic Unit View help for Smallest Geographic Unit
census tract and ZIP code tabulation area
Distributor(s) View help for Distributor(s)
Time Period(s) View help for Time Period(s)
Date of Collection View help for Date of Collection
Data Collection Notes View help for Data Collection Notes
-
Data and documentation for data file 1 (Polluting Sites by Census Tract Data (2010 Census)) were originally deposited in openICPSR project 159961.
Data and documentation for data file 2 (Polluting Sites by ZIP Code Tabulation Area Data (2010 Census)) were originally deposited in openICPSR project 159981.
- A ZIP code to ZCTA crosswalk must be used to combine this dataset with ZIP code geocoded data. A crosswalk and sample code for merging the crosswalk with National Neighborhood Data Archive (NaNDA) datasets are available in the ICPSR Linkage Library.
- For additional information please see the National Neighborhood Data Archive (NaNDA).
Study Purpose View help for Study Purpose
The purpose of this study is to investigate the impact of disamenities, in this case polluting sites, on neighborhood walkability.
Study Design View help for Study Design
The Principal Investigators obtained the latitudes and longitudes of all polluting sites appearing in file type 1A (Facility, Chemical, Releases, and Other Waste Management Summary Information) of the EPA's Toxics Release Inventory (TRI) in reporting years 1987 through 2021. Agencies with ten or more employees that manufacture, process, or use chemicals from a list maintained by the EPA are required to self-report data for the TRI annually (U.S. Environmental Protection Agency, 2021).
ArcGIS Pro was used to create 0.5-mile buffers around all U.S. census tracts. The Principal Investigators then performed a spatial join to assign each polluting site's latitude and longitude to the census tracts (plus buffers) within which it falls. (Polluting locations for which census tracts could not be determined are excluded from this dataset.) A 0.5-mile buffer was used to approximate the effect of sites not only within a neighborhood itself, but in nearby or adjoining neighborhoods. The last step was then to count the total number of polluting sites within each census tract plus buffer in each year.
Time Method View help for Time Method
Universe View help for Universe
Census tracts and ZIP code tabulation areas in the United States, excluding U.S. island territories.
Unit(s) of Observation View help for Unit(s) of Observation
Data Source View help for Data Source
Data file 3 (Polluting Sites by Census Tract Data (2020 Census)): United States Census Bureau. (2021). Index of TIGER/Line shapefiles: All lines.
Data file 2 (Polluting Sites by ZIP Code Tabulation Area Data (2010 Census)): United States Census Bureau (2010). TIGER/Line shapefiles, 2010 ZIP code tabulation areas (2019 version) [Data set].
Data file 4 (Polluting Sites by ZIP Code Tabulation Area Data (2020 Census)): United States Census Bureau. (2021, July 15). Index of Tiger/Line Shapefiles: ZCTA.
Data file 3 (Polluting Sites by Census Tract Data (2020 Census)) and data file 4 (Polluting Sites by ZIP Code Tabulation Area Data (2020 Census)): US EPA. (2023, May 25). TRI Basic Data Files: Calendar Years 1987-Present [Data and Tools].
Data file 1 (Polluting Sites by Census Tract Data (2010 Census)): United States Census Bureau (2010). TIGER/Line shapefiles, 2010 census tracts (2010 version) [Data set].
Data file 1 (Polluting Sites by Census Tract Data (2010 Census)) and data file 2 (Polluting Sites by ZIP Code Tabulation Area Data (2010 Census)): United States Environmental Protection Agency. Toxics Release Inventory Program - TRI Basic Plus Data Files, Calendar Years 1987-Present. File Type 1A: Facility, Chemical, Releases, and Other Waste Management Summary Information, 2018.
Data Type(s) View help for Data Type(s)
HideOriginal Release Date View help for Original Release Date
2022-11-29
Version History View help for Version History
2023-12-04
The years in the study title have been updated from "2000-2018" to "1987-2021".
The file names for data files 1 and 2 have been updated to include the census year.
Data files 3 (Polluting Sites by Census Tract Data (2020 Census)) and 4 (Polluting Sites by ZIP Code Tabulation Area Data (2020 Census)) have been added.
2022-11-29 ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection:
- Checked for undocumented or out-of-range codes.
Notes
The public-use data files in this collection are available for access by the general public. Access does not require affiliation with an ICPSR member institution.