American Community Survey: Public Use Microdata Sample: Artist Extract, [United States], 2012-2016 (ICPSR 36998)
Version Date: Apr 12, 2018 View help for published
Principal Investigator(s): View help for Principal Investigator(s)
United States. Bureau of the Census
https://doi.org/10.3886/ICPSR36998.v1
Version V1
Summary View help for Summary
The American Community Survey (ACS) is an ongoing statistical survey that samples a small percentage of the population every year -- giving communities the information they need to plan investments and services. The 5-year public use microdata sample (PUMS) for 2012-2016 is a subset of the 2012-2012 ACS sample. It contains the same sample as the combined PUMS 1-year files for 2012, 2013, 2014, 2015 and 2016. This data collection provides a person-level subset of 133,781 respondents whose occupations were coded as arts-related in the 2011-2015 ACS PUMS.
The 2012-2016 PUMS is the seventh 5-year file published by the ACS. This data collection contains five years of data for the population from households and the group quarters (GQ) population. The GQ population and population from households are all weighted to agree with the ACS counts which are an average over the five year period (2012-2016). The ACS sample was selected from all counties across the nation.
The ACS provides social, housing, and economic characteristics for demographic groups covering a broad spectrum of geographic areas in the United States. For a more detailed list of variables of what these categories include please see the decriptions of variables section.
Citation View help for Citation
Export Citation:
Subject Terms View help for Subject Terms
Geographic Coverage View help for Geographic Coverage
Smallest Geographic Unit View help for Smallest Geographic Unit
Public Use Microdata Area (PUMA)
Restrictions View help for Restrictions
Users of the data must agree to the Terms of Use presented on the NADAC Website and available through the link in each codebook.
Distributor(s) View help for Distributor(s)
Time Period(s) View help for Time Period(s)
Date of Collection View help for Date of Collection
Data Collection Notes View help for Data Collection Notes
This data collection does not contain records for people in Puerto Rico.
Users are strongly encouraged to read all documentation including sampling, sampling errors, weights, and imputation prior to analyzing the data. Documentation is available for download with this collection.
Occasionally, response categories used in the ACS data files must change. This may cause a multi-year PUMS file to carry two or more variables to replace the original single variable seen in the 1-year PUMS. This happens because of changes in the classification systems used for analysis of the economy as well as changes in the detailed race, ancestry, place of birth codes and definitions of Public Use Microdata Areas (PUMAs).
In data year 2012, a set of dual vintage variables were introduced to the PUMS 5-year file. The 2012-2016 ACS 5-year PUMS dataset is the first 5-year product to contain five years of data with the 2012 vintage variables. On the 2012-2016 files, these variables have been renamed, with the year now removed from the variable name. For instance, variable POBP12 on the 2011-2015 PUMS 5-year dataset is named POBP on the 2012-2016 dataset.
The smallest geographic unit that is identified is the Public Use Microdata Area (PUMA). PUMAs are special non-overlapping areas that partition each state into contiguous geographic units containing no fewer than 100,000 people each. The 2012-2016 ACS 5-year PUMS files rely on PUMA boundaries that were drawn by state governments after the 2010 Census.
This data collection includes a verification file that provides estimates for selected characteristics provided to assist data users in determining if they are correctly using the weights to compute estimates. Some of these estimates may be different from the estimates for the same characteristics published in the American FactFinder. Users can find the verification file in the Technical User Guide. For an explanation of these differences, please refer to the "Accuracy of the Data" section of the Technical User Guide.
If PUMS users want to compare the 2012-2016 5-year PUMS estimates to 2011-2015 PUMS, or want to reuse a program written for 2011-2015 5-year PUMS data, review "Read Me" section in the Technical User Guide about variables with changes between 2015 and 2016. For more details on the changes, see both the 2011-2015 and 2012-2016 5-year PUMS Data Dictionaries and Code Lists.
Variables added since previous 5-year PUMS: RACNH and RACPI.
Variables deleted since the previous 5-year PUMS: ANC1P05, ANC2P05, CITWP05, LANP05, MARHYP05, MIGPUMA00 MIGSP05, OCCP10, POB05, POWPUMA00, POWSP05, PUMA00, RAC2P05, RAC3P05, RACNHPI, SOCP10, and YOEP05.
Variables with new or modified codes since the previous 5-year PUMS: ADJHSG, ADJINC, BUS, CITWP, CONP, LANP, MARHYP, PLM, SERIALNO, SVAL, TOIL, YBL, YOEP, FBUSP, and FTOILP.
Variables with cosmetic changes to variable labels or value labels: ANC1P, ANC2P, FER, LANP, MIGPUMA, MIGSP, OCCP, POBP, POWPUMA, POWSP, PUMA, RAC2P, RAC3P, SOCP, SRNT, TEL, and FTELP.
Due to the limit in the number of allowable rows of 65,536 and allowable columns of 256 in Excel 97-2003 (file ending, xls), the Excel file being distributed with this collection is in the later version of Excel (file ending of xlsx).
For further information please visit the American Community Survey (ACS) website.Study Purpose View help for Study Purpose
The data collection provides the artist extract of the 2012-2016 ACS PUMS. The ACS is an ongoing statistical survey that samples a small percentage of the population every year, giving communities the information they need to plan investments and services.
Study Design View help for Study Design
The American Community Survey (ACS) uses a series of monthly samples to produce annually updated estimates for the same small areas (census tracts and block groups) formerly surveyed via the decennial census long-form sample. The Public Use Microdata Sample (PUMS) contains a sample of actual responses to ACS. The PUMS dataset includes variables for nearly every question on the survey, as well as many new variables that were derived after the fact from multiple survey responses (such as poverty status). Each record in the file represents a single person, or in the household-level dataset, a single housing unit. PUMS files covering a five-year period, such as 2012-2016, contain data on approximately five percent of the United States population.
Sample View help for Sample
The American Community Survey (ACS) and consists of two separate samples: housing unit (HU) addresses and residents of group quarters (GQ) facilities. The Census Bureau derives the sampling frames from which it draws these samples from the Census Bureau's Master Address File (MAF). The MAF is the Census Bureau's official inventory of known living quarters and selected nonresidential units in the United States.
The Census Bureau selects independent HU address samples for each of the 3,143 counties and county equivalents in the United States, including the District of Columbia. Full-implementation samples of GQ facilities and persons are selected independently within each state, including the District of Columbia. In 2004, the bureau selected samples of HU addresses for every county and county equivalent for field data collection in 2005. Each year from 2005-2010, we selected approximately 2.9 million HU addresses in the U.S. Beginning in 2011, Census implemented the following changes to the ACS sample designs:
- Increased the HU sample in June 2011, bringing the size of the sample selected to 3.54 million addresses per year.
- Added several new HU sampling rates that better control the allocation of the sample and improve estimate reliability for small areas.
- Increased the follow-up sample to 100 percent in select geographic areas.
In addition, starting in 2013, the Census bureau restricted the assignment of the GQ sample for college dorms to the non-summer months (January-April, September-December).
Full-implementation samples of GQ facilities and persons are selected independently within each state, including the District of Columbia. This began in 2006. In 2006 and 2007, the ACS and the PRCS included approximately 2.5 percent of the expected number of residents in GQ facilities. Beginning in 2008, the bureau increased the sampling rates in 16 states with small GQ populations to meet publication thresholds. See Chapters 7 and 8 in the American Community Survey Design and Methodology Report for further details about the data collection methods.
Time Method View help for Time Method
Universe View help for Universe
All persons in households and group quarters in the United States with one or more persons having a detailed occupation code considered "artist" by National Endowment for the Arts.
Unit(s) of Observation View help for Unit(s) of Observation
Data Type(s) View help for Data Type(s)
Mode of Data Collection View help for Mode of Data Collection
Description of Variables View help for Description of Variables
The ACS provides social, housing, and economic characteristics for demographic groups covering a broad spectrum of geographic areas in the United States. Social characteristics variables include school enrollment, educational attainment, marital status, fertility, grandparents caring for children, veteran status, type of disability, health insurance, place of birth, United States citizenship status, year of entry, year of naturalization, language spoken at home, and ancestry. Variables focusing on economic characteristics include employment status, commuting to work, occupation, industry, class of worker, income and benefits, and poverty status. Demographic variables include sex, age, relationship of person to the selected respondent, race, and Hispanic origin.
HideOriginal Release Date View help for Original Release Date
2018-04-12
Version History View help for Version History
2018-04-12 ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection:
- Created variable labels and/or value labels.
- Standardized missing values.
- Created online analysis version with question text.
- Checked for undocumented or out-of-range codes.
Weight View help for Weight
The data contain weight variables:
- PWGTP: Person's weight for generating statistics on individuals (such as age). This weight reflects the probability of selection and is adjusted for interviewed households to account for noninterviews.
- PWGTP1-PWGTP80: Replicate weighting variables, used for generating the most accurate standard errors for individuals.
For additional details about the weight variable and replicate weights, users should refer to relevant sections of the Technical User Guide and the American Community Survey Design and Methodology Report.
HideNotes
The public-use data files in this collection are available for access by the general public. Access does not require affiliation with an ICPSR member institution.