American Community Survey, 2008-2012 [United States]: Public Use Microdata Sample: Artist Extract (ICPSR 35529)
The American Community Survey (ACS) is an ongoing statistical survey that samples a small percentage of the population every year -- giving communities the information they need to plan investments and services. The 5-year public use microdata sample (PUMS) for 2008-2012 is a subset of the 2008-2012 ACS sample. It contains the same sample as the combined PUMS 1-year files for 2008, 2009, 2010, 2011 and 2012. This data collection provides a person-level subset of 124,023 respondents whose occupations were coded as arts-related in the 2008-2012 ACS PUMS.
The 2008-2012 PUMS is the fourth 5-year file published by the ACS. This data collection contains five years of data for the population from households and the group quarters (GQ) population. The GQ population and population from households are all weighted to agree with the ACS counts which are an average over the five year period (2008-2012). The ACS sample was selected from all counties across the nation.
The ACS provides social, housing, and economic characteristics for demographic groups covering a broad spectrum of geographic areas in the United States and Puerto Rico. Demographic variables include sex, age, relationship of person to the selected respondent, race, and Hispanic origin. Social characteristics variables include school enrollment, educational attainment, marital status, fertility, grandparents caring for children, veteran status, type of disability, health insurance, place of birth, United States citizenship status, year of entry, year of naturalization, language spoken at home, and ancestry. Variables focusing on economic characteristics include employment status, commuting to work, occupation, industry, class of worker, income and benefits, and poverty status.
The public-use data files in this collection are available for access by the general public. Access does not require affiliation with an ICPSR member institution.
WARNING: This study is over 150MB in size and may take several minutes to download on a typical internet connection.
United States Department of Commerce. Bureau of the Census. American Community Survey, 2008-2012 [United States]: Public Use Microdata Sample: Artist Extract. ICPSR35529-v1. Ann Arbor, MI: Inter-university Consortium for Political and Social Research [distributor], 2014-12-23. https://doi.org/10.3886/ICPSR35529.v1
Persistent URL: https://doi.org/10.3886/ICPSR35529.v1
Scope of Study
Geographic Coverage: Alabama, Alaska, Arizona, Arkansas, California, Colorado, Connecticut, Delaware, District of Columbia, Florida, Georgia, Hawaii, Idaho, Illinois, Indiana, Iowa, Kansas, Kentucky, Louisiana, Maine, Maryland, Massachusetts, Michigan, Minnesota, Mississippi, Missouri, Montana, Nebraska, Nevada, New Hampshire, New Jersey, New Mexico, New York (state), North Carolina, North Dakota, Ohio, Oklahoma, Oregon, Pennsylvania, Rhode Island, South Carolina, South Dakota, Tennessee, Texas, United States, Utah, Vermont, Virginia, Washington, West Virginia, Wisconsin, Wyoming
This data collection does not contain records for people in Puerto Rico.
Users are strongly encouraged to read all documentation including sampling, sampling errors, weights, and imputation prior to analyzing the data. Documentation is available for download with this collection.
Occasionally, response categories used in the ACS data files change, which may cause a multi-year PUMS file to carry two or more variables to replace the original single variable seen in the 1-year PUMS. This happens because of changes in the classification systems used for analysis of the economy as well as changes in the detailed race, ancestry, place of birth codes and definitions of Public Use Microdata Areas (PUMAs). This 2008-2012 5-year PUMS file has sixteen sets of dual (or triple) variables (also described as vintage variables) since many detailed variables were revised for the 2012 data products. In order to obtain data for the entire PUMS sample, all of these vintages must be used for a given variable. A value of -9, -09, -009, or -0009 (depending on the variable's length) is assigned to cases for which the variable is not applicable due to the data year, except for occupation codes. For a complete list of the dual variable, see the ReadMe document for the 2008-2012 5-year PUMS and the PUMS Data Dictionary included with this data collection for more information on these variables.
The smallest geographic unit that is identified is the Public Use Microdata Area (PUMA), which is based on a population size of initially around 100,000 or more. There are two sets of PUMAs on the 2008-2012 ACS PUMS. PUMS records from 2008 through 2011 have PUMA codes which were created from the 2000 Census 5 percent PUMS data files. PUMS records from 2012 have PUMA codes based on the 2010 Census data. The older records do not have the 2010 PUMA codes, and the 2012 records do not have the 2000-based 5 percent PUMA codes.
Due to the limit in the number of allowable rows of 65,536 and allowable columns of 256 in Excel 97-2003 (file ending, xls), the Excel file being distributed with this collection is in the later version of Excel (file ending of xlsx).
Study Purpose: The data collection provides the artist extract of the 2008-2012 ACS PUMS. The ACS is an ongoing statistical survey that samples a small percentage of the population every year -- giving communities the information they need to plan investments and services.
Study Design: The 2008-2012 ACS PUMS sample is the same sample found in each of the 1-year PUMS files for the years 2008, 2009, 2010, 2011 and 2012. This data collection contains five percent of the housing units and five percent of the GQ persons plus some imputed GQ persons in the United States and District of Columbia weighted to represent the average population during five years. For this data collection, a subset of 124,023 respondents whose occupations were coded as arts-related in the 2008-2012 ACS PUMS was created.
Sample: The American Community Survey (ACS) and Puerto Rico Community Survey (PRCS) each consist of two separate samples: housing unit (HU) addresses and residents of group quarters (GQ) facilities. The Census Bureau derives the sampling frames from which it draws these samples from the Census Bureau's Master Address File (MAF). The MAF is the Census Bureau's official inventory of known living quarters and selected nonresidential units in the United States and Puerto Rico. The Census Bureau selects independent HU address samples for each of the 3,143 counties and county equivalents in the United States, including the District of Columbia, as well as for each of the 78 municipalities in Puerto Rico. Full-implementation samples of GQ facilities and persons are selected independently within each state, including the District of Columbia and Puerto Rico. Beginning in 2008, the Census Bureau increased the sampling rates in 16 states with small GQ populations to meet publication thresholds. This data collection does not have records for people in Puerto Rico.
The data contain weight variables:
- PWGTP: Person's weight for generating statistics on individuals (such as age). This weight reflects the probability of selection and is adjusted for interviewed households to account for noninterviews.
- PWGTP1-PWGTP80: Replicate weighting variables, used for generating the most accurate standard errors for individuals.
Description of Variables: The ACS provides social, housing, and economic characteristics for demographic groups covering a broad spectrum of geographic areas in the United States and Puerto Rico. Demographic variables include sex, age, relationship of person to the selected respondent, race, and Hispanic origin. Social characteristics variables include school enrollment, educational attainment, marital status, fertility, grandparents caring for children, veteran status, type of disability, health insurance, place of birth, United States citizenship status, year of entry, year of naturalization, language spoken at home, and ancestry. Variables focusing on economic characteristics include employment status, commuting to work, occupation, industry, class of worker, income and benefits, and poverty status.
Extent of Processing: ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection:
- Created variable labels and/or value labels.
- Standardized missing values.
- Created online analysis version with question text.
- Checked for undocumented or out-of-range codes.
Original ICPSR Release: 2014-12-23
- Citations exports are provided above.
Export Study-level metadata (does not include variable-level metadata)
If you're looking for collection-level metadata rather than an individual metadata record, please visit our Metadata Records page.