Version Date: Oct 7, 2022 View help for published
Principal Investigator(s): View help for Principal Investigator(s)
United States Department of Health and Human Services. National Institutes of Health. National Institute on Drug Abuse;
United States Department of Health and Human Services. Food and Drug Administration. Center for Tobacco Products
Series:
https://doi.org/10.3886/ICPSR37786.v4
Version V4 (see more versions)
You are currently viewing an older version of this data collection. A more recent version may be available by selecting (see more versions)
Additional information about this collection can be found in Version History.
2022-10-13 2022-10-07 Wave 5.5 Adult and Youth/Parent Questionnaire and Weight data files were added to the collection. PATH-ATS Questionnaire and Weight files were also added to the collection. The Public-Use Files (PUF) User Guide was updated.
2021-09-29 Wave 4.5 All Participants - Ever/Never Reference Data (DS1503) was added to the study collection.
2021-08-17 Data and documentation related to the Master Linkage File were retired: please see the Master Linkage File Study (ICPSR 38008).
2020-09-15 ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection:
The PATH Study was launched in 2011 to inform the Food and Drug Administration's regulatory activities under the Family Smoking Prevention and Tobacco Control Act (TCA). The PATH Study is a collaboration between the National Institute on Drug Abuse (NIDA), National Institutes of Health (NIH), and the Center for Tobacco Products (CTP), Food and Drug Administration (FDA). The study sampled over 150,000 mailing addresses across the United States to create a national sample of tobacco users and non-users.
45,971 adults and youth constitute the first (baseline) wave, Wave 1, of data collected by this longitudinal cohort study. These 45,971 adults and youth along with 7,207 "shadow youth" (youth ages 9 to 11 sampled at Wave 1) make up the 53,178 participants that constitute the Wave 1 Cohort. Respondents are asked to complete an interview at each follow-up wave. Youth who turn 18 by the current wave of data collection are considered "aged-up adults" and are invited to complete the Adult Interview. Additionally, "shadow youth" are considered "aged-up youth" upon turning 12 years old, when they are asked to complete an interview after parental consent.
At Wave 4, a probability sample of 14,098 adults, youth, and shadow youth ages 10 to 11 was selected from the civilian, noninstitutionalized population at the time of Wave 4. This sample was recruited from residential addresses not selected for Wave 1 in the same sampled Primary Sampling Units (PSUs) and segments using similar within-household sampling procedures. This "replenishment sample" was combined for estimation and analysis purposes with Wave 4 adult and youth respondents from the Wave 1 Cohort who were in the civilian, noninstitutionalized population at the time of Wave 4. This combined set of Wave 4 participants, 52,731 participants in total, forms the Wave 4 Cohort. Please refer to the Public-Use Files User Guide that provides further details about children designated as "shadow youth" and the formation of the Wave 1 and Wave 4 Cohorts.
Wave 4.5 was a special data collection for youth only who were aged 12 to 17 at the time of the Wave 4.5 interview. Wave 4.5 was the fourth annual follow-up wave for those who were members of the Wave 1 Cohort. For those who were sampled at Wave 4, Wave 4.5 was the first annual follow-up wave. Wave 5.5, conducted in 2020, was a special data collection for Wave 4 Cohort youth and young adults ages 13 to 19 at the time of the Wave 5.5 interview. Also in 2020, a subsample of Wave 4 Cohort adults ages 20 and older were interviewed via the PATH Study Adult Telephone Survey (PATH-ATS).
Dataset 1002 (DS1002) contains the data from the Wave 4.5 Youth (and Parent) Questionnaire. This file contains 1,395 variables and 13,131 cases. Of these cases, 11,378 are continuing youth having completed a prior Youth Interview. The other 1,753 cases are "aged-up youth" having previously been sampled as "shadow youth."
Datasets 1112, 1212, and 1222, (DS1112, DS1212, and DS1222) are data files comprising the weight variables for Wave 4.5. The "all-waves" weight file contains weights for participants in the Wave 1 Cohort who completed a Wave 4.5 Youth Interview and completed interviews (if old enough to do so) or verified their information with the study (if not old enough to be interviewed) in Waves 1, 2, 3, and 4.
There are two separate files with "single wave" weights: one for the Wave 1 Cohort and one for the Wave 4 Cohort. The "single-wave" weight file for the Wave 1 Cohort contains weights for youth who completed an interview in Wave 1 and in Wave 4.5, regardless of their participation in the intervening waves. The "single-wave" weight file for the Wave 4 Cohort contains weights for all Wave 4.5 Youth Interview respondents in the Wave 4 Cohort.
Dataset 1503 (DS1503) contains data derived from responses to questionnaires in Wave 1, Wave 2, Wave 3, Wave 4, and Wave 4.5 indicating if participants had ever/never used various tobacco products as of the Wave 4.5 data collection period. This data file contains 26 variables for all 67,276 study participants as of the Wave 4.5 data collection. This file is provided for reference only to simplify the definitions of tobacco use variables in the Adult and Youth data files for subsequent waves.
Dataset 2001 (DS2001) contains the data from the Wave 5.5 Adult Questionnaire. This file contains 2,323 variables and 3,628 cases. Of these cases, 1,014 are continuing adults having completed a prior Adult Questionnaire. The other 2,614 cases are "aged-up adults" having previously completed a Youth Questionnaire.
Dataset 2002 (DS2002) contains the data from the Wave 5.5 Youth (and Parent) Questionnaire. This file contains 1,625 variables and 7,129 cases. Of these cases, 7,076 are continuing youth having completed a prior Youth Interview. The other 53 cases are "aged-up youth" having previously been sampled as "shadow youth."
Datasets 2111, 2112, 2121, 2122, 2221, and 2222 (DS2111, DS2112, DS2121, DS2122, DS2221, and DS2222) are data files comprising the weight variables for Wave 5.5. In Wave 5.5, the weight variables are in individual data files corresponding to the Wave 1 and Wave 4 Cohorts and different weight types.
There are two separate sets of files with "all-waves" weights: one for the Wave 1 Cohort and one for the Wave 4 Cohort. The "all-waves" weight file for the Wave 1 Cohort contains weights for participants who completed a Wave 5.5 interview and completed interviews (if old enough to do so) or verified their information (if not old enough to be interviewed) in Waves 1, 2, 3, 4, 4.5, and 5. The "all-waves" weight file for the Wave 4 Cohort contains weights for participants who completed a Wave 5.5 interview and completed interviews (if old enough to do so) or verified their information (if not old enough to be interviewed) in Waves 4, 4.5, and 5.
The "single-wave" weight file for the Wave 4 Cohort contains weights for all Wave 5.5 interview respondents.
Dataset 3001 (DS3001) contains the data from PATH-ATS. This file contains 908 variables and 8,874 cases, all of which are continuing adults having completed a prior Adult Questionnaire, with their most recent interview in Wave 5.
Datasets 3111 and 3121 (DS3111 and DS3121) are data files comprising weights for PATH-ATS. In PATH-ATS, weight variables are in individual files corresponding to the Wave 1 and Wave 4 Cohorts.
The "all-waves" weight file for the Wave 1 Cohort contains weights for participants who completed an interview in PATH-ATS and completed interviews in Waves 1, 2, 3, 4, and 5. The "all-waves" weight file for the Wave 4 Cohort contains weights for participants who completed an interview in PATH-ATS; all PATH-ATS respondents completed interviews in Wave 4 and Wave 5.
Export Citation:
None
Users are reminded that these data are to be used solely for statistical analysis and reporting of aggregated information, and not for the investigation of specific individuals or organizations.
The PATH Study Data User Forum allows researchers using any PATH Study data files to communicate with each other to ask and answer questions. Announcements, data releases and updates, new publications, upcoming events, and other information for PATH Study data users will also be posted to the forum.
The data files contain person-level (PERSONID) variables allowing linkage of people across waves of data collection. The values in this variable are random and contain no direct or indirect personally identifiable information. Please review Chapter 7 in the Public-Use Files User Guide for information on linking files together. The files are sorted by the variable PERSONID.
ICPSR attempted to duplicate all information contained in the questionnaires into the question text used in the codebooks. Some of the longer programming instructions were not incorporated into the question text. In these cases, the question text includes a note for the user to read the full programming instructions in the corresponding section of the questionnaire. Derived and imputed variables contain the algorithms used in the creation of these variables. Users are advised to refer to the Public-Use Files User Guide and annotated questionnaires when reviewing the codebooks.
Some variables were withheld to limit the release of information that is a potential risk for disclosure. These variables are listed in Appendix C in the Public-Use Files User Guide.
The Youth Interview and Parent Interview questionnaires were distinct and separate questionnaires used in data collection. However, both instruments have been combined into a single document since the responses to these instruments are also combined into a single data file.
The Youth questionnaires in Wave 4.5 includes several questions about tobacco brands and products the respondent usually uses and most recently used. For each question, a list of response options was displayed on the computer screen for the respondent to select. For many major brands and products, the displayed list included both a text label and a thumbnail image of the brand logo or product package. The displayed list was different for each of the tobacco product types with the brands and products listed being those that were known to exist for the specific tobacco product type. Wave 5.5 Adult, Wave 5.5 Youth and PATH-ATS questionnaires also include several questions about tobacco brands and products respondents typically used and in all three, verbal responses to these questions were coded by telephone interviewers. Because these lists are long, they are not provided in a frequency table for each variable in the codebook or in the annotated instrument. For convenience, both the Adult and Youth/Parent codebooks contain an appendix with a frequency table of the top 20 responses for each variable. The PATH Study Master Tobacco Brand and Product Code Guide is available as an Excel workbook file [Documentation.xlsx (Tobacco_Brand)]. The spreadsheets in this Excel workbook file are protected and may not be edited. However, the last spreadsheet contains filters to narrow the complete list. This spreadsheet is the master file of all brand and product responses for these questions from all waves, including any responses that were not in the list of options displayed to the respondent.
In the Parent Interview section, the same questions were asked of parents of all sampled youth except for the emancipated youth. In this section the cases for emancipated youth were coded as "Inapplicable". There are a small number of emancipated youth in Waves 4.5 and 5.5, but there are no individual questions asked exclusively of emancipated youth.
In both the Adult (Wave 5.5) and Youth/Parent data files (Waves 4.5 and 5.5), several groups of variables contain the word "RANDOM" in both the variable name and label. This indicates computerized randomization of the question order. These "RANDOM" variables detail the order in which the questions were asked of a particular respondent. No questions were randomized in PATH-ATS.
All Adult and Youth/Parent data files contain additional derived variables. These variables can be distinguished by the variable name starting with "X0#R" (Waves 4.5 and 5.5) or "T05R" (PATH-ATS) and contain the word "DERIVED" in the variable label. There are several variables for each tobacco category to identify certain classes of current and former tobacco users.
In accordance with the study's informed consent, information is suppressed about individuals who withdrew from the PATH Study. Their information was recoded to a special missing value, designated as -97777.
Consent forms provided to and signed by the respondents for the various types of interviews conducted and biological samples collected are included with Wave 1 and Wave 4 files (Informed Consent forms used for Wave 1 and the Wave 4 Informed Consent form is provided with the Wave 4 files). Participants provide consent at their initial interview and biological sample collection; consents remain in effect for all subsequent waves.
The Nonresponse Bias Analysis Reports for Wave 4.5 and Wave 5.5 detail the response rates and the potential for bias from nonresponse in each respective wave. The Nonresponse Bias Analysis Report for PATH-ATS is forthcoming.
The questionnaires in this collection are updated versions of the fielded questionnaires that were annotated for analytic purposes.
The PATH Study's documentation is available for your use and may be reproduced in whole or in part without permission from NIH's National Institute on Drug Abuse or FDA's Center for Tobacco Products. Citation of the source is appreciated.
Additional background information including answers to frequently asked questions for study participants and researchers can be found in the Researchers section of the PATH Study Series page.
The Public-Use Files User Guide provides an overview of the entire PATH Study. The guide covers topics such as sample design, data collection, weighting, response rates, and programming syntax to run common statistics and link the files together. Researchers should feel free to use the information in the User Guide for their publication and the guide should be cited as follows:
United States Department of Health and Human Services. National Institutes of Health. National Institute on Drug Abuse, and United States Department of Health and Human Services. Food and Drug Administration. Center for Tobacco Products. Population Assessment of Tobacco and Health (PATH) Study [United States] Special Collection Public-Use Files, User Guide. ICPSR37786-v2 Ann Arbor, MI: Inter-university Consortium for Political and Social Research [distributor], 2021-07-29. http://doi.org/10.3886/ICPSR37786.userguide
Work for Wave 4.5, Wave 5.5, and PATH-ATS was performed under contract number HHSN271201600001C.
The Population Assessment of Tobacco and Health (PATH) Study is a nationally representative longitudinal cohort study on tobacco use behavior, attitudes and beliefs, and tobacco-related health outcomes among adults and youth in the United States. The study's primary objectives are to:
At Wave 1, the study sampled over 150,000 mailing addresses which, using a four-staged stratified sampling design, yielded a sample of 45,971 respondents (32,230 adults/ 13,651 youth) who completed a Wave 1 interview. Tobacco users and non-users who were at least 9 years old living a civilian, non-institutionalized setting were considered for participation during Wave 1. Youth who turn 18 by the next wave of data collection are considered "aged-up adults" and are invited to complete the Adult Interview. Additionally, 7,207 "shadow youth" (youth ages 9 to 11 sampled at Wave 1) are considered "aged-up youth" upon turning 12 years old when they are asked to join the study. These 53,178 participants form the Wave 1 Cohort.
At Wave 4, a probability sample of 14,098 adults, youth, and shadow youth ages 10 to 11 was selected from the U.S. civilian, noninstitutionalized population (CNP) at the time of Wave 4. This sample was recruited from close to 174,000 mailing addresses not selected for Wave 1, in the same sampled PSUs and segments using similar within-household sampling procedures. To meet the needs for the Wave 4 Cohort shadow sample, a randomly selected subset of the sampled addresses (115,500 or close to two-thirds of the addresses) were screened solely to identify shadow youth ages 10 to 11. The remaining addresses (close to 58,800) were screened for adults, youth, and shadow youth ages 10 to 11. These are referred to as the "SO" (shadow youth only) and "AYS" (Adults, youth, and shadow youth) replenishment samples, respectively. This "replenishment sample" was combined for estimation and analysis purposes with Wave 4 adult and youth respondents from the Wave 1 Cohort who were in the U.S. CNP at the time of Wave 4. This combined set of Wave 4 participants, 52,371 participants in total, forms the Wave 4 Cohort.
The target population for the Wave 1 Cohort in Wave 4.5 is the resident population of the U.S. and ages 13 to 17 at the time of Wave 4.5 (other than those who were incarcerated) who were in the U.S. CNP at the time of Wave 1. The target population for the Wave 4 Cohort in Wave 4.5 is the resident population of the U.S. and ages 12 to 17 at the time of Wave 4.5 (other than those who were incarcerated) who were in the U.S. CNP at the time of Wave 4.
The target population for the Wave 1 Cohort in Wave 5.5 is the resident population of the U.S. and ages 15 to-17 in the latter portion of 2020 (other than those who were incarcerated) who were in the U.S. CNP at the time of Wave 1. The target population for the Wave 4 Cohort in Wave 5.5 is the resident population of the U.S. and ages 13 to-19 in the latter portion of 2020 (other than those who were incarcerated) who were in the U.S. CNP at the time of Wave 4. Note that data from Wave 5.5 and PATH-ATS can be combined to make cohort-specific estimates for the target populations 15 and older (Wave 1 Cohort) or 13 and older (Wave 4 Cohort).
The Adult files contain a single record for every adult who completed an interview in each relevant special collection. The Youth/Parent file contains a single record of every youth who completed an interview in each relevant special collection. Parents who provided permission for their child to complete the Youth Interview were asked to complete a brief Parent Interview that contained questions about parental supervision, school performance, and tobacco use by youth. The Parent Interview is primarily an interview about the child(ren), not the parent. Almost all youth respondents had a parent or guardian complete the Parent Interview (over 99.0 percent). When multiple youth from the same household were selected to be in the study, the parent(s) completed separate interviews about each youth. If one parent completed multiple interviews, then questions asked about him or her were only asked once and skipped in the other interview(s). The parent's responses were then duplicated for the other child or children.
A $2 incentive was mailed to all addresses sampled at Wave 1 and Wave 4 prior to screening. Adult respondents were paid $35 for their participation in Wave 1, Wave 2, Wave 3, and Wave 4. In Wave 1, Wave 2, Wave 3, Wave 4, and Wave 4.5, youth were paid $25 to complete the Youth Interview, and their parents were given $10 for each parent interview.
A four-stage stratified area probability sample design was used in the PATH Study, with a two-phase design for sampling adults at the final stage. At the first stage, a stratified sample of geographical primary sampling units was selected, in which a PSU is a county or group of counties. For the second stage, within each selected PSU, smaller geographical segments were formed and then a sample of these segments was drawn. At the third stage, the sampling frame consisted of the residential addresses located in these segments. The fourth stage selected adults and youth from the sampled households identified at these addresses, with varying sampling rates for adults by age, race, and tobacco use status. Adults were sampled in two phases - Phase 1 sampling used information provided in the household screener and Phase 2 sampling used information provided by the adult in the Phase 2 screener at the beginning of the Adult instrument. Please consult the Public-Use Files User Guide for additional details about the sampling. There was no additional sampling for Wave 4.5. Wave 4.5 is a special data collection in which PATH Study participants ages 12 to 17 at the time of Wave 4.5 were interviewed.
There was no additional sampling for Wave 5.5. Wave 5.5 was a special data collection in which PATH Study participants ages 13 to 19 at the time of Wave 5.5 were interviewed. Data collection began in December 2019 using the same in-person procedures as in previous PATH Study non-replenishment waves. However, in-person data collection was suspended on March 17, 2020 due to the COVID-19 pandemic. Data collection resumed via telephone on July 3, 2020, and continued until December 31, 2020. Because of potential changes in tobacco-use behaviors due to the COVID-19 pandemic and absence of data collection for an extended period, it was decided that the data collected in person would not made available to researchers. To overcome concerns of biased estimates, all participants who completed a Wave 5.5 interview in person on or before March 17, 2020, and were still age-eligible for Wave 5.5 (i.e., ages 13 to-19) were re-contacted for an interview by telephone starting August 3, 2020. Only data collected via telephone are available in the Wave 5.5 data files for producing Wave 5.5 estimates.
Participants eligible for the PATH-ATS sample were ages 20 and older on August 31, 2020, part of the Wave 4 Cohort, and respondents to the Wave 5 Adult Interview. A stratified random sample of 18,601 PATH Study participants was selected for PATH-ATS from a total of 31,343 eligible participants, with oversampling and undersampling based on age (ages 20 to 24, ages 25 and older), tobacco product use (electronic nicotine delivery systems (ENDS), cigarettes), and frequency of use (ever, past 12 months, past 30 days).
The resident population of the United States who were ages 13 to 17 at the time of Wave 4.5, ages 15 to 19 at the time of Wave 5.5, or age 20 and older at the time of PATH-ATS (other than those who were incarcerated) and part of the civilian, non-institutionalized household population of the United States at the time of Wave 1 (Wave 1 Cohort); the resident population of the United States who were ages 12 to 17 at the time of Wave 4.5, ages 13 to 19 at the time of Wave 5.5, or age 20 and older at the time of PATH-ATS (other than those who were incarcerated) and part of the civilian, non-institutionalized household population of the United States at the time of Wave 4 (Wave 4 Cohort).
Parents and youths were asked about the following types of tobacco products:
Although each section on tobacco products has some unique questions, most questions fit into one of the following categories:
Additional topics include:
Most questions asked in the questionnaires are categorical. Other questions ask, for example, the age at which something occurred or the person's body measurements. Responses to these questions are numerical.
The weighted Wave 4.5 youth interview response rate for the Wave 1 Cohort (conditional on Wave 1 participation) was 74.6 percent.
The response rates for the Wave 4 Cohort of the PATH Study special collections are shown below. Wave 4 Cohort response rates are conditional on interview response or shadow youth participation at Wave 4 (for replenishment sample members selected as shadow youth); the PATH-ATS response rates are conditional on selection into the PATH-ATS sample.
Wave 4.5 Youth Interview: 89.1 percent (weighted)
Wave 5.5 Adult Interview: 69.9 percent (weighted)
Wave 5.5 Youth Interview: 66.8 percent (weighted)
PATH-ATS: 55.6 percent (weighted)
Please consult the Public-Use Files User Guide for further information regarding response rates.
Hide2020-09-15
2022-10-13 2022-10-07 Wave 5.5 Adult and Youth/Parent Questionnaire and Weight data files were added to the collection. PATH-ATS Questionnaire and Weight files were also added to the collection. The Public-Use Files (PUF) User Guide was updated.
2021-09-29 Wave 4.5 All Participants - Ever/Never Reference Data (DS1503) was added to the study collection.
2021-08-17 Data and documentation related to the Master Linkage File were retired: please see the Master Linkage File Study (ICPSR 38008).
2020-09-15 ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection:
At Wave 4.5, only youth ages 12 to 17 were interviewed, along with parents. There are two longitudinal weights available for analysis of Wave 4.5 data for the Wave 1 Cohort: the all-waves weight and the single-wave weight. The "all-waves" weight file contains weights for those Wave 1 Cohort participants who completed a Wave 4.5 interview and completed interviews (if old enough to do so) or verified their information (if not old enough to be interviewed) in Waves 1, 2, 3, and 4. The Wave 4.5 single-wave weight was assigned to participants who completed an interview in Wave 1 and in Wave 4.5, regardless of their participation in the intervening waves. In addition, there is a single-wave weight for all Wave 4.5 Youth Interview respondents in the Wave 4 Cohort.
At Wave 5.5, youth ages 13 to 17 (and their parents) and young adults ages 18 and 19 were interviewed. There are two longitudinal "all waves" weights available for analysis of Wave 5.5 data: one for the Wave 1 Cohort and one for the Wave 4 Cohort. The Wave 1 Cohort "all waves" weight file contains weights for those participants who completed a Wave 5.5 interview and completed interviews (if old enough to do so) or verified their information (if not old enough to be interviewed) in Waves 1, 2, 3, 4, 4.5, and 5. The Wave 4 Cohort "all waves" weight file contains weights for those participants who completed a Wave 5.5 interview and completed interviews (if old enough to do so) or verified their information (if not old enough to be interviewed) in Waves 4, 4.5, and 5. The Wave 5.5 single-wave weight for the Wave 4 Cohort was assigned to participants who completed an interview in Wave 5.5.
In PATH-ATS, adults ages 20 and older were interviewed. There are two longitudinal "all waves" weights available for the analysis of PATH-ATS data: one for the Wave 1 Cohort and one for the Wave 4 Cohort. The Wave 1 Cohort "all waves" weight file contains weights for those participants who completed a PATH-ATS Interview and completed interviews (if old enough to do so) or verified their information (if not old enough to be interviewed) in Waves 1, 2, 3, 4, and 5. The Wave 4 Cohort "all waves" weight file contains weights for those participants who completed a PATH-ATS Interview. There are no "single-wave" weights associated with PATH-ATS.
For each weight mentioned above, there are also 100 replicate weights and design variables (VARPSU and VARSTRAT) for use in variance estimation. Detailed information on how these variables were created, and how and why they should be used is provided in the Public-Use Files User Guide.
Hide