2010 Census Production Settings Demographic and Housing Characteristics (DHC) Demonstration Noisy Measurement File (ICPSR 38865)
The 2010 Census Production Settings Demographic and Housing Characteristics Demonstration Noisy Measurement File (2023-04-03) is an intermediate output of the 2020 Census Disclosure Avoidance System (DAS) TopDown Algorithm (TDA) (as described in Abowd, J. et al [2022], and implemented in DAS 2020 Redistricting Production Code). The NMF was produced using the official "production settings," the final set of algorithmic parameters and privacy-loss budget allocations, that were used to produce the 2020 Census Redistricting Data (P.L. 94-171) Summary File and the 2020 Census Demographic and Housing Characteristics File. The NMF consists of the full set of privacy-protected statistical queries (counts of individuals or housing units with particular combinations of characteristics) of confidential 2010 Census data relating to the 2010 Demonstration Data Products Suite - Redistricting and Demographic and Housing Characteristics File - Production Settings (2023-04-03). These statistical queries, called "noisy measurements" were produced under the zero-Concentrated Differential Privacy framework (Bun, M. and Steinke, T [2016]; see also Dwork C. and Roth, A. [2014]) implemented via the discrete Gaussian mechanism (Cannone C., et al., [2023]), which added positive or negative integer-valued noise to each of the resulting counts. The noisy measurements are an intermediate stage of the TDA prior to the post-processing the TDA then performs to ensure internal and hierarchical consistency within the resulting tables. The Census Bureau has released these 2010 Census demonstration data to enable data users to evaluate the expected impact of disclosure avoidance variability on 2020 Census data. The 2010 Census Production Settings Demographic and Housing Characteristics (DHC) Demonstration Noisy Measurement File (2023-04-03) has been cleared for public dissemination by the Census Bureau Disclosure Review Board (CBDRB-FY22-DSEP-004).
The 2010 Census Production Settings Demographic and Housing Characteristics Demonstration Noisy Measurement File (2023-04-03) includes zero-Concentrated Differentially Private (zCDP) (Bun, M. and Steinke, T [2016]) noisy measurements, implemented via the discrete Gaussian mechanism. These are estimated counts of individuals and housing units included in the 2010 Census Edited File (CEF), which includes confidential data initially collected in the 2010 Census of Population and Housing. The noisy measurements included in this file were subsequently post-processed by the TopDown Algorithm (TDA) to produce the 2010 Census Production Settings Privacy-Protected Microdata File - Redistricting (P.L. 94-171) and Demographic and Housing Characteristics File (2023-04-03) (Demonstration Data Products Suite/2023-04-03/). As these 2010 Census demonstration data are intended to support study of the design and expected impacts of the 2020 Disclosure Avoidance System, the 2010 CEF records were pre-processed before application of the zCDP framework. This pre-processing converted the 2010 CEF records into the input-file format, response codes, and tabulation categories used for the 2020 Census, which differ in substantive ways from the format, response codes, and tabulation categories originally used for the 2010 Census.
The NMF provides estimates of counts of persons in the CEF by various characteristics and combinations of characteristics including their reported race and ethnicity, whether they were of voting age, whether they resided in a housing unit or one of 7 group quarters types, and their census block of residence after the addition of discrete Gaussian noise (with the scale parameter determined by the privacy-loss budget allocation for that particular query under zCDP). Noisy measurements of the counts of occupied and vacant housing units by census block are also included. Lastly, data on constraints--information into which no noise was infused by the Disclosure Avoidance System (DAS) and used by the TDA to post-process the noisy measurements into the 2010 Census Production Settings Privacy-Protected Microdata File - Redistricting (P.L. 94-171) and Demographic and Housing Characteristics File (2023-04-03) --are provided.
These data are available for download (i.e. not restricted access). Due to their size, they must be downloaded through the link on this metadata page and not through the standard ICPSR download. The link will take you to the Globus site where these data are housed. A README file is located in the Globus repository. Please refer to that for pertinent information. The Globus holding site requires users to create an account to access these data. Accounts can be created through existing institutional access and by personal access. Please see the Globus "How to get Started" page for more information.
2010 Census Production Settings Redistricting Data (P.L. 94-171) Demonstration Noisy Measurement File (ICPSR 38777)
2020 Census Demographic and Housing Characteristics (DHC) Noisy Measurement File (NMF) (ICPSR 38937)
The 2020 Census Demographic and Housing Characteristics Noisy Measurement File is an intermediate output of the 2020 Census Disclosure Avoidance System (DAS) TopDown Algorithm (TDA) (as described in Abowd, J. et al [2022], and implemented in DAS_2020_DHC_Production_Code/das_decennial/programs/engine/primitives.py at main uscensusbureau/DAS_2020_DHC_Production_Code (github.com) The 2020 Census Demographic and Housing Characteristics Noisy Measurement File includes zero-Concentrated Differentially Private (zCDP) (Bun, M. and Steinke, T [2016]) noisy measurements, implemented via the discrete Gaussian mechanism (Cannone C., et al., [2023] ), which added positive or negative integer-valued noise to each of the resulting counts. These are estimated counts of individuals and housing units included in the 2020 Census Edited File (CEF), which includes confidential data collected in the 2020 Census of Population and Housing.
The noisy measurements included in this file were subsequently post-processed by the TopDown Algorithm (TDA) to produce the Census Demographic and Housing Characteristics Summary File. In addition to the noisy measurements, constraints based on invariant calculations --- counts computed without noise --- are also included (with the exception of the state-level total populations, which can be sourced separately from data.census.gov).
The Noisy Measurement File was produced using the official "production settings," the final set of algorithmic parameters and privacy-loss budget allocations that were used to produce the 2020 Census Redistricting Data (P.L. 94-171) Summary File and the 2020 Census Demographic and Housing Characteristics File.
The noisy measurements are produced in an early stage of the TDA. Afterward, these noisy measurements are post-processed to ensure internal and hierarchical consistency within the resulting tables. The Census Bureau has released these noisy measurements to enable data users to evaluate the impact of disclosure avoidance variability on 2020 Census data. The 2020 Census Demographic and Housing Characteristics (DHC) Noisy Measurement File has been cleared for public dissemination by the Census Bureau Disclosure Review Board (CBDRB-FY22-DSEP-004).
These data are available for download (i.e. not restricted access). Due to their size, they must be downloaded through the link on this metadata page and not through the standard ICPSR download. The link will take you to the Globus site where these data are housed. A README file is located in the Globus repository. Please refer to that for pertinent information.
The Globus holding site requires users to create an account to access these data. Accounts can be created through existing institutional access and by personal access.
Please see the Globus "How to get Started" page for more information.
2020 Census Redistricting Data (P.L. 94-171) Noisy Measurement File, United States (ICPSR 38855)
The 2020 Census Redistricting Data (P.L. 94-171) Noisy Measurement File is an intermediate output of the 2020 Census Disclosure Avoidance System (DAS) TopDown Algorithm (TDA) (as described in Abowd, J. et al [2022] https://doi.org/10.1162/99608f92.529e3cb9, and implemented in the DAS 2020 Redistricting Production Code). The 2020 Redistricting NMF was an intermediate output of the DAS during the execution of the algorithm to produce the 2020 Census Redistricting Data (P.L. 94-171) Summary File. The NMFs are intermediate privacy-protected outputs of the DAS; they were generated using the Census Bureau's implementation of the Discrete Gaussian Mechanism, calibrated to satisfy zero-Concentrated Differential Privacy with bounded neighbors. The NMF values, called "noisy measurements" are the output of applying the Discrete Gaussian Mechanism to counts from the 2020 Census Edited File (CEF). They are generally inconsistent with one another (for example, in a county composed of two tracts, the noisy measurement for the county's total population may not equal the sum of the noisy measurements of the two tracts' total population), and frequently negative (especially when the population being measured was small), but are integer-valued. The NMF was later post-processed as part of the DAS code to take the form of microdata and to satisfy various constraints. The NMF documented here contains both the noisy measurements themselves as well as the data needed to represent the DAS constraints; thus, the NMF could be used to reproduce the steps taken by the DAS code to produce microdata from the noisy measurements by applying the production code base.
The 2020 Census Redistricting Data (P.L. 94-171) Noisy Measurement File includes zero-Concentrated Differentially Private (zCDP) (Bun, M. and Steinke, T [2016]) noisy measurements, implemented via the discrete Gaussian mechanism. These are estimated counts of individuals and housing units included in the 2020 Census Edited File (CEF), which includes confidential data initially collected in the 2020 Census of Population and Housing. The noisy measurements included in this file were subsequently post-processed by the TopDown Algorithm (TDA) to produce the 2020 Census Redistricting Data (P.L. 94-171) Summary File.
The NMF provides estimates of counts of persons in the CEF by various characteristics and combinations of characteristics including their reported race and ethnicity, whether they were of voting age, whether they resided in a housing unit or one of 7 group quarters types, and their census block of residence after the addition of discrete Gaussian noise (with the scale parameter determined by the privacy-loss budget allocation for that particular query under zCDP). Noisy measurements of the counts of occupied and vacant housing units by census block are also included. Lastly, data on constraints--information into which no noise was infused by the Disclosure Avoidance System (DAS) and used by the TDA to post-process the noisy measurements into the 2020 Census Redistricting Data (P.L. 94-171) Summary File --are provided.
These data are available for download (i.e. not restricted access). Due to their size, they must be downloaded through the link on this metadata page and not through the standard ICPSR download. The link will take you to the Globus site where these data are housed. A README file is located in the Globus repository. Please refer to that for pertinent information.
The Globus holding site requires users to create an account to access these data. Accounts can be created through existing institutional access and by personal access.
Please see the Globus "How to get Started" page for more information.
American Community Survey (ACS): Public Use Microdata Sample (PUMS), 1996 (ICPSR 3885)
American Community Survey (ACS): Public Use Microdata Sample (PUMS), 1997 (ICPSR 3886)
American Community Survey (ACS): Public Use Microdata Sample (PUMS), 1998 (ICPSR 3888)
American Community Survey (ACS): Public Use Microdata Sample (PUMS), 2002 (ICPSR 3893)
American Community Survey (ACS): Public Use Microdata Sample (PUMS), 2003 (ICPSR 4117)
American Community Survey (ACS): Public Use Microdata Sample (PUMS), 2004 (ICPSR 4370)
American Community Survey (ACS): Public Use Microdata Sample (PUMS), 2005 (ICPSR 4587)
American Community Survey (ACS): Public Use Microdata Sample (PUMS), 2006 (ICPSR 22101)
American Community Survey (ACS): Public Use Microdata Sample (PUMS), 2007 (ICPSR 24503)
American Community Survey (ACS): Public Use Microdata Sample (PUMS), 2008 (ICPSR 29263)
American Community Survey (ACS): Public Use Microdata Sample (PUMS), 2009 (ICPSR 33802)
American Community Survey (ACS): Three-Year Public Use Microdata Sample (PUMS), 2005-2007 (ICPSR 25042)
American Housing Survey, 1984: MSA File (ICPSR 9092)
American Housing Survey, 1986: MSA Core and Supplement File (ICPSR 6129)
American Housing Survey, 1986: MSA File (ICPSR 9334)
American Housing Survey, 1988: MSA Core and Supplement File (ICPSR 6130)
American Housing Survey, 1988: MSA Core Questions File (ICPSR 9509)
American Housing Survey, 1990: MSA Core Questions File (ICPSR 6003)
American Housing Survey, 1992: MSA Core File (ICPSR 6464)
American Housing Survey, 1994: MSA Core and Supplement File (ICPSR 6954)
American Housing Survey, 1996: MSA Core and Supplement File (ICPSR 2369)
American Housing Survey, 1997: National Microdata (ICPSR 2912)
American Housing Survey, 1999: National Microdata (ICPSR 3204)
American Housing Survey, 2001: National Microdata (ICPSR 4588)
American Housing Survey, 2002: Metropolitan Microdata (ICPSR 4589)
American Housing Survey, 2003: National Microdata (ICPSR 4591)
American Housing Survey, 2004: Metropolitan Microdata (ICPSR 4592)
American Housing Survey, 2005: National Microdata (ICPSR 4593)
American Housing Survey 2007: Metropolitan Survey (ICPSR 24501)
American Housing Survey, 2007: National Microdata (ICPSR 23563)
American Housing Survey, 2009: National Microdata (ICPSR 30941)
This data collection provides information on the characteristics of a national sample of housing units, including apartments, single-family homes, mobile homes, and vacant housing units in 2009. The data are presented in eight separate parts: Part 1, Home Improvement Record, Part 2, Journey to Work Record, Part 3, Mortgages Recorded, Part 4, Housing Unit Record (Main Record), Recodes (One Record per Housing Unit), and Weights, Part 5, Manager and Owner of Rental Units Record, Part 6, Person Record, Part 7, High Burden Unit Record, and Part 8, Recent Mover Groups Record.
Part 1 data include questions about upgrades and remodeling, cost of alterations and repairs, as well as the household member who performed the alteration/repair. Part 2 data include journey to work or commuting information, such as method of transportation to work, length of trip, and miles traveled to work. Additional information collected covers number of hours worked at home, number of days worked at home, average time respondent leaves for work in the morning or evening, whether respondent drives to work alone or with others, and a few other questions pertaining to self-employment and work schedule. Part 3 data include mortgage information, such as type of mortgage obtained by respondent, amount and term of mortgages, as well as years needed to pay them off. Other items asked include monthly payment amount, reason mortgage was taken out, and who provided the mortgage. Part 4 data include household-level information, including demographic information, such as age, sex, race, marital status, income, and relationship to householder. The following topics are also included: data recodes, unit characteristics, and weighting information.
Part 5 data include information pertaining to owners of rental properties and whether the owner/resident manager lives on-site. Part 6 data include individual person level information, in which respondents were queried on basic demographic information (i.e. age, sex, race, marital status, income, and relationship to householder), as well as if they worked at all last week, month and year moved into residence, and their ability to perform everyday tasks and whether they have difficulty hearing, seeing, and concentrating or remembering things. Part 7 data include verification of income to cost when the ratio of income to cost is outside of certain tolerances. Respondents were asked whether they receive help or assistance with grocery bills, clothing and transportation expenses, child care payments, medical and utility bills, as well as with rent payments. Part 8 data include recent mover information, such as how many people were living in last unit before move, whether last residence was a condo or a co-op, as well as whether this residence was outside of the United States.
American Housing Survey, 2009: New Orleans Data (ICPSR 30943)
This data collection is part of the American Housing Metropolitan Survey (AHS-MS, or "metro") which is conducted in odd-numbered years. It cycles through a set of 21 metropolitan areas, surveying each one about once every six years. The metro survey, like the national survey, is longitudinal. This particular survey provides information on the characteristics of a New Orleans metropolitan sample of housing units, including apartments, single-family homes, mobile homes, and vacant housing units in 2009. The data are presented in eight separate parts: Part 1, Home Improvement Record, Part 2, Journey to Work Record, Part 3, Mortgages Recorded, Part 4, Housing Unit Record (Main Record), Recodes (One Record per Housing Unit), and Weights, Part 5, Manager and Owner of Rental Units Record, Part 6, Person Record, Part 7, High Burden Unit Record, and Part 8, Recent Mover Groups Record.
Part 1 data include questions about upgrades and remodeling, cost of alterations and repairs, as well as the household member who performed the alteration/repair. Part 2 data include journey to work or commuting information, such as method of transportation to work, length of trip, and miles traveled to work. Additional information collected covers number of hours worked at home, number of days worked at home, average time respondent leaves for work in the morning or evening, whether respondent drives to work alone or with others, and a few other questions pertaining to self-employment and work schedule. Part 3 data include mortgage information, such as type of mortgage obtained by respondent, amount and term of mortgages, as well as years needed to pay them off. Other items asked include monthly payment amount, reason mortgage was taken out, and who provided the mortgage. Part 4 data include household-level information, including demographic information, such as age, sex, race, marital status, income, and relationship to householder. The following topics are also included: data recodes, unit characteristics, and weighting information.
Part 5 data include information pertaining to owners of rental properties and whether the owner/resident manager lives on-site. Part 6 data include individual person level information, in which respondents were queried on basic demographic information (i.e. age, sex, race, marital status, income, and relationship to householder), as well as if they worked at all last week, month and year moved into residence, and their ability to perform everyday tasks and whether they have difficulty hearing, seeing, and concentrating or remembering things. Part 7 data include verification of income to cost when the ratio of income to cost is outside of certain tolerances. Respondents were asked whether they receive help or assistance with grocery bills, clothing and transportation expenses, child care payments, medical and utility bills, as well as with rent payments. Part 8 data include recent mover information, such as how many people were living in last unit before move, whether last residence was a condo or a co-op, as well as whether this residence was outside of the United States.
American Housing Survey, 2009: Seattle Data (ICPSR 30942)
This data collection is part of the American Housing Metropolitan Survey (AHS-MS, or "metro") which is conducted in odd-numbered years. It cycles through a set of 21 metropolitan areas, surveying each one about once every six years. The metro survey, like the national survey, is longitudinal. This particular survey provides information on the characteristics of a Seattle metropolitan sample of housing units, including apartments, single-family homes, mobile homes, and vacant housing units in 2009. The data are presented in eight separate parts: Part 1, Home Improvement Record, Part 2, Journey to Work Record, Part 3, Mortgages Recorded, Part 4, Housing Unit Record (Main Record), Recodes (One Record per Housing Unit), and Weights, Part 5, Manager and Owner of Rental Units Record, Part 6, Person Record, Part 7, High Burden Unit Record, and Part 8, Recent Mover Groups Record.
Part 1 data include questions about upgrades and remodeling, cost of alterations and repairs, as well as the household member who performed the alteration/repair. Part 2 data include journey to work or commuting information, such as method of transportation to work, length of trip, and miles traveled to work. Additional information collected covers number of hours worked at home, number of days worked at home, average time respondent leaves for work in the morning or evening, whether respondent drives to work alone or with others, and a few other questions pertaining to self-employment and work schedule. Part 3 data include mortgage information, such as type of mortgage obtained by respondent, amount and term of mortgages, as well as years needed to pay them off. Other items asked include monthly payment amount, reason mortgage was taken out, and who provided the mortgage. Part 4 data include household-level information, including demographic information, such as age, sex, race, marital status, income, and relationship to householder. The following topics are also included: data recodes, unit characteristics, and weighting information.
Part 5 data include information pertaining to owners of rental properties and whether the owner/resident manager lives on-site. Part 6 data include individual person level information, in which respondents were queried on basic demographic information (i.e. age, sex, race, marital status, income, and relationship to householder), as well as if they worked at all last week, month and year moved into residence, and their ability to perform everyday tasks and whether they have difficulty hearing, seeing, and concentrating or remembering things. Part 7 data include verification of income to cost when the ratio of income to cost is outside of certain tolerances. Respondents were asked whether they receive help or assistance with grocery bills, clothing and transportation expenses, child care payments, medical and utility bills, as well as with rent payments. Part 8 data include recent mover information, such as how many people were living in last unit before move, whether last residence was a condo or a co-op, as well as whether this residence was outside of the United States.
American Housing Survey (AHS) - Table Creator (ICPSR 36753)
The American Housing Survey (AHS), the most comprehensive housing survey in the U.S., provides up-to-date information on the size and composition of the housing stock in our country. This survey delivers information about the types of homes in which people are now living and the characteristics of these homes, as well as the costs of running and maintaining them. National data are collected every other year and metropolitan area data are collected on a rotating basis. The AHS is sponsored by the Department of Housing and Urban Development (HUD) and conducted by the U.S. Census Bureau.
The AHS Table Creator gives data users the ability to create customized tables from the AHS data without having to use the Public Use File (microdata).
Like the microdata, the AHS Table Creator provides current information on a wide range of housing subjects, including size and composition of the nation's housing inventory, vacancies, fuel usage, physical condition of housing units, characteristics of occupants, equipment breakdowns, home improvements, mortgages and other housing costs, people eligible for and beneficiaries of subsidized housing, home values, and characteristics of recent movers.
For the first time since 1985, the survey selected new national and metropolitan area longitudinal samples. In addition to the "core" data, the AHS collected "topical" data using a series of topical modules. The 2015 AHS includes topical supplements on 1) the presence of arts and cultural opportunities in the community, 2) health and safety hazards in the home, 3) food insecurity, and 4) the use of housing counseling services. Data users can also explore the new national and metropolitan area longitudinal samples as well as the topical supplements using the AHS Table Creator.
Policy analysts, program managers, budget analysts, and Congressional staff use the AHS data and table creator to monitor supply and demand, as well as changes in housing conditions and costs, in order to assess housing needs. Analyses based on the AHS are used to advise the executive and legislative branches in the development of housing policies. HUD uses the AHS to improve efficiency and effectiveness and design housing programs appropriate for different target groups, such as first-time home buyers and the elderly. Academic researchers and private organizations also use AHS data in efforts of specific interest and concern to their respective communities.
The AHS is conducted every two years from May and September in odd-numbered years. HUD sometimes adjusts this schedule and/or sample depending on budget constraints. Public use microdata and reports are released approximately 12 months after data collection.