National Neighborhood Data Archive (NaNDA): Home Mortgage Disclosure Act Longitudinal Dataset by Census Tract, United States, 1981-2021 (ICPSR 39093)

Version Date: May 15, 2024 View help for published

Principal Investigator(s): View help for Principal Investigator(s)
Jad Edlebi, National Community Reinvestment Coalition; Bruce Mitchell, National Community Reinvestment Coalition; Jason Richardson, National Community Reinvestment Coalition; Helen Meier, University of Michigan. Institute for Social Research; Liang Chen, University of Michigan. Institute for Social Research; Grace Noppert, University of Michigan. Institute for Social Research; Lindsay Gypin, University of Michigan. Institute for Social Research

Series:

https://doi.org/10.3886/ICPSR39093.v1

Version V1 ()

  • V2 [2024-09-12]
  • V1 [2024-05-15] unpublished

You are currently viewing an older version of this data collection. A more recent version may be available by selecting ()

Additional information about this collection can be found in Version History.

2024-05-15 ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection:

  • Checked for undocumented or out-of-range codes.

Slide tabs to view more

The Home Mortgage Disclosure Act (HMDA) database (Consumer Financial Protection Bureau, 2022) has compiled mortgage lending data since 1981, but the collection and dissemination methods have changed over time (Federal Financial Institutions Examination Council, 2018), creating barriers to conducting longitudinal analyses. This HMDA Longitudinal Dataset (HLD) organizes and standardizes information across different eras of HMDA data collection between 1981 and 2020, enabling such analysis. This collection contains two datasets: 1) HMDA aggregated data by census tract for each decade and 2) HMDA aggregated data by census tract for individual years. Items for analysis include borrower income values, mortgages by loan type (e.g., conventional, Federal Housing Administration (FHA), Veterans Affairs (VA), refinances), and mortgages by borrower race and gender.

Edlebi, Jad, Mitchell, Bruce, Richardson, Jason, Meier, Helen, Chen, Liang, Noppert, Grace, and Gypin, Lindsay. National Neighborhood Data Archive (NaNDA): Home Mortgage Disclosure Act Longitudinal Dataset by Census Tract, United States, 1981-2021. Inter-university Consortium for Political and Social Research [distributor], 2024-05-15. https://doi.org/10.3886/ICPSR39093.v1

Export Citation:

  • RIS (generic format for RefWorks, EndNote, etc.)
  • EndNote
United States Department of Health and Human Services. National Institutes of Health. National Institute on Aging (R01AG08401), United States Department of Health and Human Services. National Institutes of Health. National Institute of Nursing Research (U01NR020556), United States Department of Health and Human Services. National Institutes of Health. National Center on Minority Health and Health Disparities (U01NR020556)

Census tract

Inter-university Consortium for Political and Social Research
Hide

1981 -- 2020
1981 -- 2021
  1. For additional information on the National Neighborhood Data Archive (NaNDA), please visit the NaNDA website.
Hide

The purpose of this project was to organize and standardize data obtained from the Home Mortgage Disclosure Act (HMDA) database in order to conduct longitudinal analyses of home lending patterns in the United States.

Data covering the period from 1981 to 2006 were obtained from the National Archives, and data covering the period from 2007 to 2020 were obtained from the Consumer Financial Protection Bureau. Records without census tract identification were excluded from the dataset. Yearly-aggregated data were not adjusted for inflation, but decade-aggregated were adjusted for inflation (December 30, 2020 dollars). Originations included in the dataset are for mortgages made to owner-occupied, single-family homes that were site built within each tract. Originations made for speculation, second homes, vacation homes, and multi-family homes were excluded.

Multiple methods were used to normalize the data. To adjust borrower income at tract-level, percentiles of income at the national level were established for each year, loan counts for each percentile were calculated, and income breakpoints were established in $30,000 increments. Counts were normalized to 2010 census tract boundaries using the Longitudinal Tract Database (LTDB) crosswalk file. To normalize loan amounts to 2020 U.S. dollars, the Consumer Price Index for All Urban Consumers was used to obtain Consumer Price Indices (CPI) as of December 30 each year and divided each year's CPI by the CPI for December 30, 2020. Please refer to the study documentation for more information on dataset limitations and changes to data collection methodology over the years.

Longitudinal: Trend / Repeated Cross-section

Census tracts in the 50 U.S. states, including U.S. island territories.

Time Unit (Decade), Time Unit (Year), Geographic Unit (Census tract)

National Archives (Federal Reserve System, 2016)

Consumer Financial Protection Bureau (Consumer Financial Protection Bureau, 2022)

Data are structured at the tract-level (1 observation per tract) and are grouped by decade (DS1) and by individual year (DS2). Each dataset contains the following aggregated items:

  • Counts of reported borrower income values grouped by percentile and by $30,000 intervals
  • Total and percent of originations (number and U.S. dollar amount) to all borrowers, grand total and by lender type: conventional, government, Federal Housing Administration, Veterans Affairs, Rural Housing Service
  • Total and percent of originations (number and U.S. dollar amount) by loan type: home purchase, home improvement, refinance
  • Total and percent of mortgages (number and U.S. dollar amount) by borrower race and gender

Hide

2024-05-15

2024-05-15 ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection:

  • Checked for undocumented or out-of-range codes.

Hide