Version Date: Oct 7, 2022 View help for published
Principal Investigator(s): View help for Principal Investigator(s)
United States Department of Health and Human Services. National Institutes of Health. National Institute on Drug Abuse;
United States Department of Health and Human Services. Food and Drug Administration. Center for Tobacco Products
Series:
https://doi.org/10.3886/ICPSR38008.v7
Version V7 (see more versions)
You are currently viewing an older version of this data collection. A more recent version may be available by selecting (see more versions)
Additional information about this collection can be found in Version History.
2022-10-07 Update to Public-Use Master Linkage Files (DS0001) to include variables for new files in the SCPUF collection (ICPSR 37786): Wave 5.5 Questionnaire data and weights (DS2001, DS2002, DS2111, DS2112, DS2121, DS2122, DS2221, and DS2222) and PATH-ATS data and weights (DS3001, DS3111, and DS3121). Also included is one new variable to reflect the addition of a new file in the PUF collection (ICPSR 36498): Wave 5 Ever/Never Reference (DS5503). The BAP variables for Waves 1 to 5 were updated to reflect current availability of biospecimens, including urine collected from youth in Waves 4 and 5. Updated BAP variables in the Restricted-Use Master Linkage Files (DS0002) to reflect current availability of biospecimens, including urine collected from youth in Waves 4 and 5.
2022-05-11 Update to Restricted-Use Master Linkage Files to include variables for new files in the BRUF collection (ICPSR 36840) including single-wave weights for the Wave 4 Biomarker Core.
2022-04-21 Update to Restricted-Use Master Linkage Files (DS0002) to include variables for new files in the SCRUF collection (ICPSR 37519): Wave 5.5 Questionnaire data and weights (DS2001, DS2002, DS2111, DS2112, DS2121, DS2122, DS2221, and DS2222), Wave 5.5 State Identifier data (DS2401 and DS2402), PATH-ATS data and weights (DS3001, DS3111, and DS3121), and PATH-ATS State Identifier data (DS3401). Also included is one new variable to reflect addition of a new file in the RUF collection (ICPSR 36231): Wave 5 Ever/Never Reference (DS5503). The BAP variables for Waves 1 to 4 were updated to reflect current availability of biospecimens. Updated BAP variables in the Public-Use Master Linkage Files (DS0001) to reflect current availability of biospecimens.
2021-12-16 Update to Restricted-Use Master Linkage Files to include variables for new files in the BRUF collection (ICPSR 36840): additional Wave 3 Urine Panel Assays and accompanying weights (DS3038, DS3023, and DS3024) and Wave 5 Urine Collection (DS5001), Urine Weights (DS5021 and DS5022), and Urine Panel Assays (DS5032, DS5033, DS5036, and DS5037).
2021-09-29 Update to Public-Use Master Linkage Files to include variables for Wave 5 (ICPSR 36498).
2021-06-03 Update to Restricted-Use Master Linkage Files to include new variables related to Biomarker Restricted-Use Files (ICPSR 36840) additional Wave 4 Urine Panel Assays (DS4035 and DS4037).
2021-04-27 ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection:
The PATH Study was launched in 2011 to inform the Food and Drug Administration's regulatory activities under the Family Smoking Prevention and Tobacco Control Act (TCA). The PATH Study is a collaboration between the National Institute on Drug Abuse (NIDA), National Institutes of Health (NIH), and the Center for Tobacco Products (CTP), Food and Drug Administration (FDA). The study sampled over 150,000 mailing addresses across the United States to create a national sample of tobacco users and non-users.
45,971 adults and youth constitute the first (baseline) wave, Wave 1, of data collected by this longitudinal cohort study. These 45,971 adults and youth along with 7,207 "shadow youth" (youth ages 9 to 11 sampled at Wave 1) make up the 53,178 participants that constitute the Wave 1 Cohort. Respondents are asked to complete an interview at each follow-up wave. Youth who turn 18 by the current wave of data collection are considered "aged-up adults" and are invited to complete the Adult Interview. Additionally, "shadow youth" are considered "aged-up youth" upon turning 12 years old, when they are asked to complete an interview after parental consent.
At Wave 4, a probability sample of 14,098 adults, youth, and shadow youth ages 10 to 11 was selected from the civilian, noninstitutionalized population at the time of Wave 4. This sample was recruited from residential addresses not selected for Wave 1 in the same sampled Primary Sampling Units (PSU-s) and segments using similar within-household sampling procedures. This "replenishment sample" was combined for estimation and analysis purposes with Wave 4 adult and youth respondents from the Wave 1 Cohort who were in the civilian, noninstitutionalized population at the time of Wave 4. This combined set of Wave 4 participants, 52,731 participants in total, forms the Wave 4 Cohort.
Please refer to the Restricted-Use Files User Guide that provides further details about children designated as "shadow youth" and the formation of the Wave 1 and Wave 4 Cohorts.
Dataset 0001 (DS0001) contains the data from the Public-Use Master Linkage File (PUF-MLF). This file contains 51 variables and 67,276 cases. The file provides a master list of every person's unique identification number and what type of respondent they were in each wave for data that are available in the Public-Use Files and Special Collection Public-Use Files.
Dataset 0002 (DS0002) contains the data from the Restricted-Use Master Linkage File (RUF-MLF). This file contains 122 variables and 67,276 cases. The file provides a master list of every person's unique identification number and what type of respondent they were in each wave for data that are available in the Restricted-Use Files, Special Collection Restricted-Use Files, and Biomarker Restricted-Use Files.
Export Citation:
None
Users are reminded that these data are to be used solely for statistical analysis and reporting of aggregated information, and not for the investigation of specific individuals or organizations.
Access to the RUF-MLF data is restricted. Users interested in obtaining these data must complete a Restricted Data Use Agreement. Data are provided via ICPSR's Virtual Data Enclave (VDE). Apply for access to these data through the ICPSR VDE portal. Information and instructions are available within the data portal. For further assistance please reference the VDE Guide to learn about the application process, about using the VDE, and how to request disclosure review of VDE output.
The PATH Study Data User Forum allows researchers using any PATH Study data files to communicate with each other to ask and answer questions. Announcements, data releases and updates, new publications, upcoming events, and other information for PATH Study data users will also be posted to the forum.
The PUF-MLF is available for access by the general public. For the RUF-MLF, data are provided via ICPSR's Virtual Data Enclave (VDE) where researchers will work with data stored on secure ICPSR servers. Researchers will not possess actual physical copies of the data; however, they may request permission to access selected output outside the virtual environment after review by ICPSR. See the Access Notes to apply for access. Researchers are also encouraged to read the VDE Guide.
The data files contain person-level (PERSONID) across waves of data collection. The PERSONID values are random and contain no direct or indirect personally identifiable information. Chapter 7 in the Public-Use Files User Guide contains information about linking data available for public-use. Appendix E in the Restricted-Use Files User Guide also contains information and programming code on linking files together. The files are sorted by the variable PERSONID.
The PUF-MLF includes indicator variables for the availability of interview data and weights for each participant. It also includes variables that indicate availability of biospecimens through the Biospecimen Access Program (BAP). The PUF-MLF can help analysts identify which Public-Use files contain data for a particular participant (or set of participants).
The RUF-MLF includes indicator variables for the availability of interview data, weights, state identifier data, tobacco Universal Product Code (UPC) data, and biomarker data for each participant. It also includes variables that indicate availability of biospecimens through the BAP. The RUF-MLF can help analysts identify which Restricted-Use files contain data for a particular participant (or set of participants).
The RUF-MLF will be extended as new data are released in the PATH Study RUF, Special Collection RUF, and Biomarker RUF collections. The PUF-MLF will be extended as new data are released in the PATH Study PUF and Special Collection PUF collections.
The PATH Study's documentation is available for your use and may be reproduced in whole or in part without permission from NIH's National Institute on Drug Abuse or FDA's Center for Tobacco Products. Citation of the source is appreciated.
Additional background information including answers to frequently asked questions for study participants and researchers can be found in the Researchers section of the PATH Study Series page.
There are a variety of user guides available that describe the PATH Study as well as the use of specific types of data. Researchers can access the user guides on the PATH Study Series page or through the various collections: Restricted-Use Files, Public-Use Files, Special Collection Restricted-Use Files, Special Collection Public-Use Files, or Biomarker Restricted-Use Files.
2021-04-27 Latest versions of RUF-MLF and PUF-MLF were added to the collection, consolidating the various MLFs that were in each collection: Restricted-Use Files, Public-Use Files, Special Collection Restricted-Use Files, Special Collection Public-Use Files, or Biomarker Restricted-Use Files.
The data for the PATH Study was collected and prepared by Westat. The contract numbers under which they performed their work are HHSN271201100027C and HHSN271201600001C.
The Population Assessment of Tobacco and Health (PATH) Study is a nationally representative longitudinal cohort study on tobacco use behavior, attitudes and beliefs, and tobacco-related health outcomes among adults and youth in the United States. The study's primary objectives are to:
At Wave 1, the study sampled over 150,000 mailing addresses which, using a four-staged stratified sampling design, yielded a sample of 45,971 respondents (32,320 adults / 13,651 youth) who completed a Wave 1 interview. Tobacco users and non-users who were at least 9 years old living in a civilian, non-institutionalized setting were considered for participation during Wave 1. Youth who turn 18 by the next wave of data collection are considered "aged-up adults" and are invited to complete the Adult Interview. Additionally, 7,207 "shadow youth" (youth ages 9 to 11 sampled at Wave 1) are considered "aged-up youth" upon turning 12 years old when they are asked to join the study. These 53,178 participants form the Wave 1 Cohort.
At Wave 4, a probability sample of 14,098 adults, youth, and shadow youth ages 10 to 11 was selected from the civilian, noninstitutionalized population at the time of Wave 4. This sample was recruited from close to 174,000 mailing addresses not selected for Wave 1, in the same sampled PSUs and segments using similar within-household sampling procedures. To meet the needs for the Wave 4 Cohort shadow sample, a randomly selected subset of the sampled addresses (115,500 or close to two-thirds of the addresses) were screened solely to identify shadow youth ages 10 to 11. The remaining addresses (close to 58,500) were screened for adults, youth, and shadow youth ages 10 to 11. These are referred to as the "SO" (shadow youth only) and "AYS" (adults, youth, and shadow youth) replenishment samples, respectively. This "replenishment sample" was combined for estimation and analysis purposes with Wave 4 adult and youth respondents from the Wave 1 Cohort who were in the civilian, noninstitutionalized population at the time of Wave 4. This combined set of Wave 4 participants, 52,731 participants in total, forms the Wave 4 Cohort.
A four-stage stratified area probability sample design was used in the PATH Study, with a two-phase design for sampling adults at the final stage. At the first stage, a stratified sample of geographical primary sampling units (PSUs) was selected, in which a PSU is a county or group of counties. For the second stage, within each selected PSU, smaller geographical segments were formed and then a sample of these segments was drawn. At the third stage, the sampling frame consisted of the residential addresses located in these segments. The fourth stage selected adults and youth from the sampled households identified at these addresses, with varying sampling rates for adults by age, race, and tobacco use status. Adults were sampled in two phases - Phase 1 sampling used information provided in the household screener and Phase 2 sampling used information provided by the adult in the Phase 2 screener at the beginning of the Adult instrument. Please consult the Public-Use Files User Guide or Restricted-Use Files User Guide for additional details about the sampling.
Users and non-users of tobacco products in the civilian, non-institutionalized household population of the United States aged 9 and older at the time of Wave 1 (Wave 1 Cohort); Users and non-users of tobacco products in the civilian, non-institutionalized household population of the United States aged 10 and older at the time of Wave 4 (Wave 4 Cohort)
In the PUF-MLF, indicator variables that identify the availability of interview data, weights, and biospecimens (through the BAP) for each participant (or set of participants) with Public-Use data.
In the RUF-MLF, indicator variables that identify the availability of interview data, weights, biomarker data, and biospecimens (through the BAP) for each participant (or set of participants) with Restricted-Use data.
Hide2021-04-27
2022-10-07 Update to Public-Use Master Linkage Files (DS0001) to include variables for new files in the SCPUF collection (ICPSR 37786): Wave 5.5 Questionnaire data and weights (DS2001, DS2002, DS2111, DS2112, DS2121, DS2122, DS2221, and DS2222) and PATH-ATS data and weights (DS3001, DS3111, and DS3121). Also included is one new variable to reflect the addition of a new file in the PUF collection (ICPSR 36498): Wave 5 Ever/Never Reference (DS5503). The BAP variables for Waves 1 to 5 were updated to reflect current availability of biospecimens, including urine collected from youth in Waves 4 and 5. Updated BAP variables in the Restricted-Use Master Linkage Files (DS0002) to reflect current availability of biospecimens, including urine collected from youth in Waves 4 and 5.
2022-05-11 Update to Restricted-Use Master Linkage Files to include variables for new files in the BRUF collection (ICPSR 36840) including single-wave weights for the Wave 4 Biomarker Core.
2022-04-21 Update to Restricted-Use Master Linkage Files (DS0002) to include variables for new files in the SCRUF collection (ICPSR 37519): Wave 5.5 Questionnaire data and weights (DS2001, DS2002, DS2111, DS2112, DS2121, DS2122, DS2221, and DS2222), Wave 5.5 State Identifier data (DS2401 and DS2402), PATH-ATS data and weights (DS3001, DS3111, and DS3121), and PATH-ATS State Identifier data (DS3401). Also included is one new variable to reflect addition of a new file in the RUF collection (ICPSR 36231): Wave 5 Ever/Never Reference (DS5503). The BAP variables for Waves 1 to 4 were updated to reflect current availability of biospecimens. Updated BAP variables in the Public-Use Master Linkage Files (DS0001) to reflect current availability of biospecimens.
2021-12-16 Update to Restricted-Use Master Linkage Files to include variables for new files in the BRUF collection (ICPSR 36840): additional Wave 3 Urine Panel Assays and accompanying weights (DS3038, DS3023, and DS3024) and Wave 5 Urine Collection (DS5001), Urine Weights (DS5021 and DS5022), and Urine Panel Assays (DS5032, DS5033, DS5036, and DS5037).
2021-09-29 Update to Public-Use Master Linkage Files to include variables for Wave 5 (ICPSR 36498).
2021-06-03 Update to Restricted-Use Master Linkage Files to include new variables related to Biomarker Restricted-Use Files (ICPSR 36840) additional Wave 4 Urine Panel Assays (DS4035 and DS4037).
2021-04-27 ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection: