National Neighborhood Data Archive (NaNDA): Home Mortgage Disclosure Act Longitudinal Dataset by Census Tract, United States, 1981-2021 (ICPSR 39093)
Version Date: May 15, 2024 View help for published
Principal Investigator(s): View help for Principal Investigator(s)
Jad Edlebi, National Community Reinvestment Coalition;
Bruce Mitchell, National Community Reinvestment Coalition;
Jason Richardson, National Community Reinvestment Coalition;
Helen Meier, University of Michigan. Institute for Social Research;
Liang Chen, University of Michigan. Institute for Social Research;
Grace Noppert, University of Michigan. Institute for Social Research;
Lindsay Gypin, University of Michigan. Institute for Social Research
Series:
https://doi.org/10.3886/ICPSR39093.v1
Version V1 (see more versions)
You are currently viewing an older version of this data collection. A more recent version may be available by selecting (see more versions)
Additional information about this collection can be found in Version History.
2024-05-15 ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection:
- Checked for undocumented or out-of-range codes.
Summary View help for Summary
The Home Mortgage Disclosure Act (HMDA) database (Consumer Financial Protection Bureau, 2022) has compiled mortgage lending data since 1981, but the collection and dissemination methods have changed over time (Federal Financial Institutions Examination Council, 2018), creating barriers to conducting longitudinal analyses. This HMDA Longitudinal Dataset (HLD) organizes and standardizes information across different eras of HMDA data collection between 1981 and 2020, enabling such analysis. This collection contains two datasets: 1) HMDA aggregated data by census tract for each decade and 2) HMDA aggregated data by census tract for individual years. Items for analysis include borrower income values, mortgages by loan type (e.g., conventional, Federal Housing Administration (FHA), Veterans Affairs (VA), refinances), and mortgages by borrower race and gender.
Citation View help for Citation
Export Citation:
Funding View help for Funding
Subject Terms View help for Subject Terms
Geographic Coverage View help for Geographic Coverage
Smallest Geographic Unit View help for Smallest Geographic Unit
Census tract
Distributor(s) View help for Distributor(s)
Time Period(s) View help for Time Period(s)
Date of Collection View help for Date of Collection
Data Collection Notes View help for Data Collection Notes
- For additional information on the National Neighborhood Data Archive (NaNDA), please visit the NaNDA website.
Study Purpose View help for Study Purpose
The purpose of this project was to organize and standardize data obtained from the Home Mortgage Disclosure Act (HMDA) database in order to conduct longitudinal analyses of home lending patterns in the United States.
Study Design View help for Study Design
Data covering the period from 1981 to 2006 were obtained from the National Archives, and data covering the period from 2007 to 2020 were obtained from the Consumer Financial Protection Bureau. Records without census tract identification were excluded from the dataset. Yearly-aggregated data were not adjusted for inflation, but decade-aggregated were adjusted for inflation (December 30, 2020 dollars). Originations included in the dataset are for mortgages made to owner-occupied, single-family homes that were site built within each tract. Originations made for speculation, second homes, vacation homes, and multi-family homes were excluded.
Multiple methods were used to normalize the data. To adjust borrower income at tract-level, percentiles of income at the national level were established for each year, loan counts for each percentile were calculated, and income breakpoints were established in $30,000 increments. Counts were normalized to 2010 census tract boundaries using the Longitudinal Tract Database (LTDB) crosswalk file. To normalize loan amounts to 2020 U.S. dollars, the Consumer Price Index for All Urban Consumers was used to obtain Consumer Price Indices (CPI) as of December 30 each year and divided each year's CPI by the CPI for December 30, 2020. Please refer to the study documentation for more information on dataset limitations and changes to data collection methodology over the years.
Time Method View help for Time Method
Universe View help for Universe
Census tracts in the 50 U.S. states, including U.S. island territories.
Unit(s) of Observation View help for Unit(s) of Observation
Data Source View help for Data Source
National Archives (Federal Reserve System, 2016)
Consumer Financial Protection Bureau (Consumer Financial Protection Bureau, 2022)
Data Type(s) View help for Data Type(s)
Description of Variables View help for Description of Variables
Data are structured at the tract-level (1 observation per tract) and are grouped by decade (DS1) and by individual year (DS2). Each dataset contains the following aggregated items:
- Counts of reported borrower income values grouped by percentile and by $30,000 intervals
- Total and percent of originations (number and U.S. dollar amount) to all borrowers, grand total and by lender type: conventional, government, Federal Housing Administration, Veterans Affairs, Rural Housing Service
- Total and percent of originations (number and U.S. dollar amount) by loan type: home purchase, home improvement, refinance
- Total and percent of mortgages (number and U.S. dollar amount) by borrower race and gender
Original Release Date View help for Original Release Date
2024-05-15
Version History View help for Version History
2024-05-15 ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection:
- Checked for undocumented or out-of-range codes.