Population Assessment of Tobacco and Health (PATH) Study [United States] Restricted-Use Files (ICPSR 36231)

Version Date: Jun 3, 2019 View help for published

Principal Investigator(s): View help for Principal Investigator(s)
United States Department of Health and Human Services. National Institutes of Health. National Institute on Drug Abuse; United States Department of Health and Human Services. Food and Drug Administration. Center for Tobacco Products

Series:

https://doi.org/10.3886/ICPSR36231.v19

Version V19 ()

  • V30 [2022-04-21]
  • V29 [2021-12-16] unpublished
  • V28 [2021-11-11] unpublished
  • V27 [2021-06-29] unpublished
  • V26 [2021-02-23] unpublished
  • V25 [2020-06-24] unpublished
  • V24 [2020-06-22] unpublished
  • V23 [2020-03-31] unpublished
  • V22 [2020-03-23] unpublished
  • V21 [2019-11-21] unpublished
  • V20 [2019-11-05] unpublished
  • V19 [2019-06-03] unpublished
  • V18 [2019-04-08] unpublished
  • V17 [2019-02-01] unpublished
  • V16 [2019-02-01] unpublished
  • V15 [2018-09-28] unpublished
  • V14 [2018-05-01] unpublished
  • V13 [2017-06-19] unpublished

You are currently viewing an older version of this data collection. A more recent version may be available by selecting ()

Additional information about this collection can be found in Version History.

2019-06-03 Wave 4 Informed Consent documentation was added.

2019-05-30 2019-04-08 Data and documentation for the Master linkage file was updated. Wave 4 Adult and Youth Questionnaire and Weight data files have been added.

2019-02-01 Updating to include public codebooks for Wave 1 Adult and Youth State Identifier data files and the Master linkage file and the public User Guide for the State Identifier Restricted-Use Files.

2019-02-01 Wave 1 Adult and Youth State Identifier data files were added to the collection. Data and documentation for the Master linkage file was updated.

2018-10-01 2018-09-28 Data and documentation for the Master linkage file was updated. The Nonresponse Bias Analysis Report is now included for Wave 3.

2018-05-01

Wave 3 Adult and Youth data files were added to the collection. Wave 1 and Wave 2 Adult and Youth data files were updated to improve the clarity and consistency of variable labels, especially in the Nicotine Dependence section.

A new variable was added to Wave 1 and Wave 2 Adult data - R0#_ND_DATA_ROUTE. A second variable was added to the Wave 2 Adult data - R02R_A_P12M_BLUNTONLY_GRILLO. An additional 18 derived variables in the Wave 2 Adult data were revised and replaced the original variables. The newly named variables possess the original name, but also contain "_REV" at the end of the variable name.

A skip error was identified in the Wave 2 Adult instrument, which resulted in some respondents being asked two questions when they should not have been. Therefore, the affected items, R02_AG0100CG and R02_AG0100FC, contain some extra data. Notes were added to the annotated instrument and codebook to describe the issue.

The User Guide and Questionnaires were also updated to improve understanding of the data files. A Nonresponse Bias Analysis report is now included for Wave 2.

2018-02-15 The citation of this study may have changed due to the new version control system that has been implemented. The previous citation was:

  • United States Department of Health and Human Services. National Institutes of Health. National Institute on Drug Abuse, and United States Department of Health and Human Services. Food and Drug Administration. Center for Tobacco Products. Population Assessment of Tobacco and Health (PATH) Study [United States] Restricted-Use Files. ICPSR36231-v19. Ann Arbor, MI: Inter-university Consortium for Political and Social Research [distributor], 2019-06-03. http://doi.org/10.3886/ICPSR36231.v19

2017-06-19 The Wave 1 and Wave 2 data files, for both Adults and Youth, were updated to correct minor errors along with the questionnaires to correct minor typos and clarify specifications.

2017-04-27 A minor revision was made to the both the English and Spanish versions of the Wave 1 Adult questionnaire. The User Guide was also updated. Two Excel crosswalks, one for Adults and one for Youth, were added to the available documentation to highlight the differences between the Wave 1 and Wave 2 files.

2017-04-03 An update was made to internal files to correct an issue with how missing values are displayed online through ICPSR's variables database.

2017-03-23 Minor revisions were made to the Missing Values Code table within the User Guide and both Codebooks for Wave 2.

2017-03-15 Data from Wave 2 of the study were added to the collection. The User Guide and Master Tobacco Brand and Product Code Guide were expanded to include information for Wave 2.

2017-01-31 The variable R01X_CB_REGION in both the Wave 1 Adult and Youth/Parent files was updated to correct an error in the value labels. The values for codes 2 and 3 had been inadvertently swapped. The data did not change; only the value labels for codes 2 and 3 have been corrected.

2016-11-28 An additional 40 derived variables were added to the end of the Wave 1 Youth / Parent file that are similar to those already in the Wave 1 Adult file. Information for individuals who withdrew from the study is denoted in the datasets by the special missing value -97777. Spanish versions of the annotated instruments are also now available.

2016-05-24 The study's title changed with the removal of the year range. The Informed Consent Document and Non-Response Bias Analysis Report were changed from being study level files to being a part of the Wave 1 (DS1001 and DS1002) specific documentation.

2016-04-22 An additional documentation file (Non-response_Report) was added to the collection.

2016-04-20 Updated the file names only for the two study level documentation files (Informed_Consent and Tobacco_Brand) so that each file was easily identified and distinguishable from the other. No change was made to the content in either file.

2016-04-18 Coding was updated for the sexual attraction variables. The questionnaires were revised to enhance the clarity of the ASK statements. The PDF codebooks now contain full question text from the questionnaires. Lastly, the PATH Study Master Tobacco Brand and Product Code Guide and an Informed Consent Document were also released.

2016-01-13 PDF codebooks were released without question text. The codebooks will be updated in the near future that includes question text.

2015-12-19 ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection:

  • Checked for undocumented or out-of-range codes.

PATH Study (RUF)

The PATH Study was launched in 2011 to inform the Food and Drug Administration's regulatory activities under the Family Smoking Prevention and Tobacco Control Act (TCA). The PATH Study is a collaboration between the National Institute on Drug Abuse (NIDA), National Institutes of Health (NIH), and the Center for Tobacco Products (CTP), Food and Drug Administration (FDA). The study sampled over 150,000 mailing addresses across the United States to create a national sample of tobacco users and non-users.

45,971 adults and youth constitute the first (baseline) wave of data collected by this longitudinal cohort study. Respondents are asked to complete an interview at each follow-up wave. Youth who turn 18 by the current wave of data collection are considered "aged-up adults" and are invited to complete the Adult Interview. Additionally, 7,207 "shadow youth" (youth ages 9 to 11 sampled at Wave 1) are considered "aged-up youth" upon turning 12 years old, when they are asked to complete an interview after parental consent. These 53,178 participants form the Wave 1 Cohort.

At Wave 4, the Wave 1 Cohort was replenished with a probability sample of 14,098 adults, youth, and shadow youth ages 10 to 11 selected from the civilian, noninstitutionalized population at the time of Wave 4. This sample was recruited from residential addresses not selected for Wave 1 in the same sampled PSUs and segments using similar within-household sampling procedures. This "replenishment sample" was combined for estimation and analysis purposes with Wave 4 adult and youth respondents from the Wave 1 Cohort who were in the civilian, noninstitutionalized population at the time of Wave 4. This combined set of Wave 4 participants, 52,731 participants in total, forms the Wave 4 Cohort.

Please refer to the Restricted-Use Files User Guide that provides further details about children designated as "shadow youth" and the formation of the Wave 1 and Wave 4 Cohorts.

Dataset 0001 (DS0001) contains the data from the Master Linkage file. This file contains 42 variables and 67,276 cases. The file provides a master list of every person's unique identification number and what type of respondent they were for each wave.

Dataset 1001 (DS1001) contains the data from the Wave 1 Adult Questionnaire. This data file contains 2,011 variables and 32,320 cases. Each of the cases represents a single, completed interview.

Dataset 1002 (DS1002) contains the data from the Wave 1 Youth (and Parent) Questionnaire. This file contains 1,430 variables and 13,651 cases.

Dataset 1401 (DS1401) contains the Wave 1 State Identifier data for Adults and has 5 variables and 32,320 cases. Dataset 1402 (DS1402) contains the Wave 1 State Identifier data for Youth (and Parents) and has 5 variables and 13,651 cases. The same 5 variables are in each State Identifier dataset, including PERSONID for linking the State Identifier to the questionnaire and biomarker data and 3 variables designating the state (state FIPS, state abbreviation, and full name of the state).

Dataset 2001 (DS2001) contains the data from the Wave 2 Adult Questionnaire. This data file contains 2,409 variables and 28,362 cases. Of these cases, 26,447 also completed a Wave 1 Adult Questionnaire. The other 1,915 cases are "aged-up adults" having previously completed a Wave 1 Youth Questionnaire.

Dataset 2002 (DS2002) contains the data from the Wave 2 Youth (and Parent) Questionnaire. This data file contains 1,588 variables and 12,172 cases. Of these cases, 10,081 also completed a Wave 1 Youth Questionnaire. The other 2,091 cases are "aged-up youth" having previously been sampled as "shadow youth."

Dataset 3001 (DS3001) contains the data from the Wave 3 Adult Questionnaire. This data file contains 2,338 variables and 28,148 cases. Of these cases, 26,241 are continuing adults having completed a prior Adult Questionnaire. The other 1,907 cases are "aged-up adults" having previously completed a Youth Questionnaire.

Dataset 3002 (DS3002) contains the data from the Wave 3 Youth (and Parent) Questionnaire. This data file contains 1,484 variables and 11,814 cases. Of these cases, 9,769 are continuing youth having completed a prior Youth Interview. The other 2,045 cases are "aged-up youth" having previously been sampled as "shadow youth."

Datasets 3101, 3102, 3201, and 3202 (DS3101, DS3102, DS3201, and DS3202) are data files comprising the weight variables for Wave 3. The weight variables for Wave 1 and Wave 2 are included in the main data files. However, starting with Wave 3, the weight variables have been separated into individual data files. The "all-waves" weight files contain weights for respondents who completed an interview for all waves in which they were old enough to do so or verified their information with the study for waves in which they were not old enough to be interviewed. The "single-wave" weight files contain weights for all respondents in Wave 3 regardless of their participation in previous waves.

Dataset 4001 (DS4001) contains the data from the Wave 4 Adult Questionnaire. This data file contains 2,504 variables and 33,822 cases. Of these cases, 25,857 are continuing adults having completed a prior Adult questionnaire, 1,900 are "aged-up adults" having previously completed a Youth questionnaire, and 6,065 are "replenishment sample adults" (also known as "new cohort adults" in the annotated instrument).

Dataset 4002 (DS4002) contains the data from the Wave 4 Youth (and Parent) Questionnaire. This data file contains 1,600 variables and 14,798 cases. Of these cases, 9,365 are continuing youth having completed a prior Youth interview, 1,694 cases are "aged-up youth" having previously been sampled as "shadow youth," and 3,739 are "replenishment sample youth" (also known as "new cohort youth" in the annotated instrument).

Datasets 4101, 4102, 4103, 4201, 4202, and 4203 (DS4101, DS4102, DS4103, DS4201, DS4202, and DS4203) are data files comprising the weight variables for Wave 4. In Wave 4, the weight variables have been separated into individual data files corresponding to the Wave 1 and Wave 4 Cohorts and different weight types. The "all-waves" weight files contain weights for those Wave 1 Cohort respondents who completed an interview for all waves in which they were old enough or verified their information for waves in which they were not old enough to be interviewed. The "single-wave" weight files contain weights for Wave 1 Cohort respondents at Wave 4 who completed an interview at Wave 1, regardless of their participation in previous waves. The "cross-sectional" weight files contain weights for all respondents in the Wave 4 Cohort.

Each case in an Adult data file represents a single, completed interview. Each case in a Youth data file represents one youth and his or her parent's responses about that youth. Parents who provided permission for their child to participate in a Youth interview were asked to complete a brief interview about their child. In both waves of data collection, less than 0.5 percent of the parents did not complete an interview. Most questions are asked about to the child.

When multiple youth from the same household were selected to be in the study, the parent(s) completed separate interviews about each youth. If one parent completed two or more interviews, that parent only answered questions about himself/herself once. Those questions were then skipped in the subsequent interview(s) for the other child(ren) and the responses duplicated in that child(ren)'s data file(s).

United States Department of Health and Human Services. National Institutes of Health. National Institute on Drug Abuse, and United States Department of Health and Human Services. Food and Drug Administration. Center for Tobacco Products. Population Assessment of Tobacco and Health (PATH) Study [United States] Restricted-Use Files. Inter-university Consortium for Political and Social Research [distributor], 2019-06-03. https://doi.org/10.3886/ICPSR36231.v19

Export Citation:

  • RIS (generic format for RefWorks, EndNote, etc.)
  • EndNote
United States Department of Health and Human Services. National Institutes of Health. National Institute on Drug Abuse, United States Department of Health and Human Services. Food and Drug Administration. Center for Tobacco Products

Census Region; Census Division

Users are reminded that these data are to be used solely for statistical analysis and reporting of aggregated information, and not for the investigation of specific individuals or organizations.

Access to these data is restricted. Users interested in obtaining these data must complete a Restricted Data Use Agreement. Data are provided via ICPSR's Virtual Data Enclave (VDE). Apply for access to these data through the ICPSR VDE portal. Information and instructions are available within the data portal. For further assistance please reference the VDE Guide to learn about the application process, about using the VDE, and how to request disclosure review of VDE output.

Inter-university Consortium for Political and Social Research
Hide
2013 -- 2014 (Wave 1), 2014 -- 2015 (Wave 2), 2015 -- 2016 (Wave 3), 2016 -- 2018 (Wave 4)
2013-09 -- 2014-12 (Wave 1), 2014-10 -- 2015-10 (Wave 2), 2015-10 -- 2016-10 (Wave 3), 2016-12 -- 2018-01 (Wave 4)
Hide

The Population Assessment of Tobacco and Health (PATH) Study is a longitudinal cohort study on tobacco use behavior, attitudes and beliefs, and tobacco-related health outcomes among approximately adults and youth in the United States. The study's primary objectives are to:

  • Objective 1: Identify and explain between-person differences and within-person changes in tobacco-use patterns, including the rate and length of use by specific product type and brand, product/brand switching over time, uptake of new products, and dual- and poly-use of tobacco products (i.e., use of multiple products within the same time period and switching between multiple products).
  • Objective 2: Identify between-person differences and within-person changes in risk perceptions regarding harmful and potentially harmful constituents, new and emerging tobacco products, filters and other design features of tobacco products, packaging, and labeling; and identify other factors that may affect use, such as social influences and individual preferences.
  • Objective 3: Characterize the natural history of tobacco dependence, cessation, and relapse, including readiness and self-efficacy to quit, motivations for quitting, the number and length of quit attempts, and the length of abstinence related to various tobacco products.
  • Objective 4: Update the comprehensive baseline and subsequent waves of data on tobacco-use behaviors and related health conditions, including markers of exposure and tobacco-related disease processes identified from the collection and analysis of biospecimens, to assess between-person differences and within-person changes over time in health conditions potentially related to tobacco use, particularly with use of new and different tobacco products, including modified-risk tobacco products.
  • Objective 5: Assess associations between TCA-specific actions and tobacco-product use, risk perceptions and attitudes, use patterns, cessation outcomes, and tobacco-related intermediate endpoints (e.g., biomarkers of exposure and biomarkers related to disease). Analyses will attempt to account for other potential factors, such as demographics, local tobacco-control policies, and social, familial, and economic factors, that may influence the observed patterns.
  • Objective 6: Assess between-person differences and within-person changes over time in attitudes, behaviors, exposure to tobacco products, and related biomarkers among and within population sub-groups identified by such characteristics as race-ethnicity, gender, and/or age, or by risk factors, such as pregnancy or co-occurring substance use or mental health disorders.
  • Objective 7: To the extent to which sample sizes are sufficient, assess and compare samples of former and never users of tobacco products for between-person differences and within-person changes in relapse and uptake, risk perceptions, and indicators of tobacco exposure and disease processes.
  • Objective 8: Use data from the PATH Study's baseline and follow-up waves on tobacco-use behaviors, attitudes, and related health conditions, including potential markers of exposure and related disease processes identified from the analysis of biospecimens, to screen and subsample respondents for participation in formative and/or nested studies conducted during and after the PATH Study's waves of data and biospecimen collection.

At Wave 1, the study sampled over 150,000 mailing addresses which, using a four-staged stratified sampling design, yielded a sample of 45,971 respondents (32,320 adults / 13,651 youth) who completed a Wave 1 interview. Tobacco users and non-users who were at least 9 years old living in a civilian, non-institutionalized setting were considered for participation during Wave 1. Youth who turn 18 by the next wave of data collection are considered "aged-up adults" and are invited to complete the Adult Interview. Additionally, 7,207 "shadow youth" (youth ages 9 to 11 sampled at Wave 1) are considered "aged-up youth" upon turning 12 years old when they are asked to join the study. These 53,178 participants form the Wave 1 Cohort.

At Wave 4, the Wave 1 Cohort was replenished with a probability sample of 14,098 adults, youth, and shadow youth ages 10 to 11 selected from the civilian, noninstitutionalized population at the time of Wave 4. This sample was recruited from close to 174,000 mailing addresses not selected for Wave 1, in the same sampled PSUs and segments using similar within-household sampling procedures. To meet the needs for the Wave 4 Cohort shadow sample, a randomly selected subset of the sampled addresses (115,500 or close to two-thirds of the addresses) were screened solely to identify shadow youth ages 10 to 11. The remaining addresses (close to 58,500) were screened for adults, youth, and shadow youth ages 10 to 11. These are referred to as the "SO" (shadow youth only) and "AYS" (adults, youth, and shadow youth) replenishment samples, respectively. This "replenishment sample" was combined for estimation and analysis purposes with Wave 4 adult and youth respondents from the Wave 1 Cohort who were in the civilian, noninstitutionalized population at the time of Wave 4. This combined set of Wave 4 participants, 52,731 participants in total, forms the Wave 4 Cohort.

The Adult files contain a single record for every adult who completed an interview in the wave. The Youth/Parent files contain a single record of every youth who completed an interview in a given wave. Parents who provided permission for their child to complete a Youth Interview were asked to complete a brief Parent Interview that contained questions about parental supervision, school performance, and tobacco use by youth. The Parent Interview is primarily an interview about the child(ren), not the parent. Almost all youth respondents had a parent or guardian complete the Parent Interview (over 99.0 percent). When multiple youth from the same household were selected to be in the study, the parent(s) completed separate interviews about each youth. If one parent completed multiple interviews, then questions asked about him or her were only asked once and skipped in the other interview(s). The parent's responses were then duplicated for the other child or children.

A $2 incentive was mailed to all addresses sampled at Wave 1 and Wave 4 prior to screening. Adult respondents were paid $35 for their participation. Youth were paid $25 to complete the Youth Interview, and their parents were given $10 for each Parent Interview.

A four-stage stratified area probability sample design was used in the PATH Study, with a two-phase design for sampling adults at the final stage. At the first stage, a stratified sample of geographical primary sampling units (PSUs) was selected, in which a PSU is a county or group of counties. For the second stage, within each selected PSU, smaller geographical segments were formed and then a sample of these segments was drawn. At the third stage, the sampling frame consisted of the residential addresses located in these segments. The fourth stage selected adults and youth from the sampled households identified at these addresses, with varying sampling rates for adults by age, race, and tobacco use status. Adults were sampled in two phases - Phase 1 sampling used information provided in the household screener and Phase 2 sampling used information provided by the adult in the Phase 2 screener at the beginning of the Adult instrument. Please consult the Restricted-Use Files User Guide for additional details about the sampling.

Longitudinal: Panel

Users and non-users of tobacco products in the civilian, non-institutionalized household population of the United States aged 9 and older at the time of Wave 1.

individual

In all four waves, adults and youth were asked about the following types of tobacco products:

  • Cigarettes
  • E-cigarettes/Electronic nicotine products
  • Cigars (traditional, cigarillos, filtered)
  • Pipe tobacco
  • Hookah
  • Smokeless tobacco (snus pouches and other forms of smokeless tobacco)
  • Dissolvable tobacco
  • Bidis and kreteks (youth only)

Although each section on tobacco products has some unique questions, most questions fit into one of the following categories:

  • Ever use
  • Recency of use
  • Frequency of use
  • Amount of use
  • Brands used
  • Purchase details
  • Use of flavored products
  • Harm and addictiveness
  • Reasons for use

Additional topics, in at least one wave, include:

  • Poly use
  • Nicotine dependence
  • Packaging and health warnings
  • Risk and harm perceptions
  • Secondhand smoke exposure
  • Marketing and advertising
  • Media use
  • Demographics
  • Health
  • Psychosocial and mental health
  • Substance use
  • Cessation
  • Peer and family influences

Most questions asked in the questionnaires are categorical. Other questions ask, for example, the age at which something occurred or the person's body measurements. Responses to these questions are numerical.

The response rates for the Wave 1 Cohort of the PATH Study are shown below. The Wave 1 interview rates are conditional on completion of the Wave 1 screener. The response rates for Waves 2, 3, and 4 are conditional on Wave 1 participation.

  • Wave 1 Household screener: 54.0 percent (weighted)
  • Wave 1 Adult Interview: 74.0 percent (weighted)
  • Wave 1 Youth Interview: 78.4 percent (weighted)
  • Wave 2 Adult Interview: 83.2 percent (weighted)
  • Wave 2 Youth Interview: 87.3 percent (weighted)
  • Wave 3 Adult Interview: 78.4 percent (weighted)
  • Wave 3 Youth Interview: 83.3 percent (weighted)
  • Wave 4 Adult Interview: 73.5 percent (weighted)
  • Wave 4 Youth INterview: 79.5 percent (weighted)

The response rates for the adults and youth in the Wave 4 replenishment sample are shown below. The Wave 4 interview rates for the adults and youth in this sample are conditional on completion of the Wave 4 screener.

  • Wave 4 Household screener (for households in which youth and adults were recruited): 52.8 percent (weighted)
  • Wave 4 Adult Interview: 68.0 percent (weighted)
  • Wave 4 Youth Interview: 70.6 percent (weighted)

Please consult the Restricted-Use Files User Guide for further information regarding response rates.

Hide

2015-12-19

2019-06-03 Wave 4 Informed Consent documentation was added.

2019-05-30 2019-04-08 Data and documentation for the Master linkage file was updated. Wave 4 Adult and Youth Questionnaire and Weight data files have been added.

2019-02-01 Updating to include public codebooks for Wave 1 Adult and Youth State Identifier data files and the Master linkage file and the public User Guide for the State Identifier Restricted-Use Files.

2019-02-01 Wave 1 Adult and Youth State Identifier data files were added to the collection. Data and documentation for the Master linkage file was updated.

2018-10-01 2018-09-28 Data and documentation for the Master linkage file was updated. The Nonresponse Bias Analysis Report is now included for Wave 3.

2018-05-01

Wave 3 Adult and Youth data files were added to the collection. Wave 1 and Wave 2 Adult and Youth data files were updated to improve the clarity and consistency of variable labels, especially in the Nicotine Dependence section.

A new variable was added to Wave 1 and Wave 2 Adult data - R0#_ND_DATA_ROUTE. A second variable was added to the Wave 2 Adult data - R02R_A_P12M_BLUNTONLY_GRILLO. An additional 18 derived variables in the Wave 2 Adult data were revised and replaced the original variables. The newly named variables possess the original name, but also contain "_REV" at the end of the variable name.

A skip error was identified in the Wave 2 Adult instrument, which resulted in some respondents being asked two questions when they should not have been. Therefore, the affected items, R02_AG0100CG and R02_AG0100FC, contain some extra data. Notes were added to the annotated instrument and codebook to describe the issue.

The User Guide and Questionnaires were also updated to improve understanding of the data files. A Nonresponse Bias Analysis report is now included for Wave 2.

2018-02-15 The citation of this study may have changed due to the new version control system that has been implemented. The previous citation was:
  • United States Department of Health and Human Services. National Institutes of Health. National Institute on Drug Abuse, and United States Department of Health and Human Services. Food and Drug Administration. Center for Tobacco Products. Population Assessment of Tobacco and Health (PATH) Study [United States] Restricted-Use Files. ICPSR36231-v19. Ann Arbor, MI: Inter-university Consortium for Political and Social Research [distributor], 2019-06-03. http://doi.org/10.3886/ICPSR36231.v19

2017-06-19 The Wave 1 and Wave 2 data files, for both Adults and Youth, were updated to correct minor errors along with the questionnaires to correct minor typos and clarify specifications.

2017-04-27 A minor revision was made to the both the English and Spanish versions of the Wave 1 Adult questionnaire. The User Guide was also updated. Two Excel crosswalks, one for Adults and one for Youth, were added to the available documentation to highlight the differences between the Wave 1 and Wave 2 files.

2017-04-03 An update was made to internal files to correct an issue with how missing values are displayed online through ICPSR's variables database.

2017-03-23 Minor revisions were made to the Missing Values Code table within the User Guide and both Codebooks for Wave 2.

2017-03-15 Data from Wave 2 of the study were added to the collection. The User Guide and Master Tobacco Brand and Product Code Guide were expanded to include information for Wave 2.

2017-01-31 The variable R01X_CB_REGION in both the Wave 1 Adult and Youth/Parent files was updated to correct an error in the value labels. The values for codes 2 and 3 had been inadvertently swapped. The data did not change; only the value labels for codes 2 and 3 have been corrected.

2016-11-28 An additional 40 derived variables were added to the end of the Wave 1 Youth / Parent file that are similar to those already in the Wave 1 Adult file. Information for individuals who withdrew from the study is denoted in the datasets by the special missing value -97777. Spanish versions of the annotated instruments are also now available.

2016-05-24 The study's title changed with the removal of the year range. The Informed Consent Document and Non-Response Bias Analysis Report were changed from being study level files to being a part of the Wave 1 (DS1001 and DS1002) specific documentation.

2016-04-22 An additional documentation file (Non-response_Report) was added to the collection.

2016-04-20 Updated the file names only for the two study level documentation files (Informed_Consent and Tobacco_Brand) so that each file was easily identified and distinguishable from the other. No change was made to the content in either file.

2016-04-18 Coding was updated for the sexual attraction variables. The questionnaires were revised to enhance the clarity of the ASK statements. The PDF codebooks now contain full question text from the questionnaires. Lastly, the PATH Study Master Tobacco Brand and Product Code Guide and an Informed Consent Document were also released.

2016-01-13 PDF codebooks were released without question text. The codebooks will be updated in the near future that includes question text.

2015-12-19 ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection:

  • Checked for undocumented or out-of-range codes.
Hide

Each data file for Wave 1 and Wave 2 contains weights for use in analyses of the data from the complex PATH Study sample design. The final full-sample person-level weight for Waves 1 and 2 on the Adult file is R0#_A_PWGT, and the final full-sample person-level weight for Waves 1 and 2 on the Youth / Parent file is R0#_Y_PWGT.

The weights for Wave 3 have been separated into two files:

  • The all-waves weights file is for respondents who completed an interview for all waves in which they were old enough to do so or verified their information with the study for waves in which they were not old enough to be interviewed. The Wave 3 all-waves weight is named R03_A_AWGT for adults and R03_Y_AWGT for youth.
  • The single-wave weights file is for all respondents in Wave 3. The Wave 3 single-wave weight is named R03_A_SWGT for adults and R03_Y_SWGT for youth.

The weights for Wave 4 have been separated into three files corresponding to the Wave 1 and Wave 4 Cohorts and different weight types:

  • The all-waves weights file is for Wave 1 Cohort respondents who either completed an interview for all waves in which they were old enough or verified their information for waves in which they were not old enough to be interviewed. The Wave 4 all-waves weight for the Wave 1 Cohort is named R04_A_A01WGT for adults and R04_Y_A01WGT for youth.
  • The single-wave weights file is for all Wave 1 Cohort respondents who completed interviews at Wave 1 and Wave 4. The Wave 4 single-wave weight for the Wave 1 Cohort is named R04_A_S01WGT for adults and R04_Y_S01WGT for youth.
  • The cross-sectional weights file is for Wave 4 Cohort respondents. The Wave 4 cross-sectional weight for the Wave 4 Cohort is named R04_A_C04WGT for adults and R04_Y_C04WGT for youth.

For each weight mentioned above, there are also 100 replicate weights and design variables (VARPSU and VARSTRAT) for use in variance estimation. Detailed information on how these variables were created, and how and why they should be used is provided in the Restricted-Use Files User Guide.

Note that the weighting procedures adjust for oversampling of specified population groups and nonresponse. ICPSR strongly recommends that researchers read and understand this section before analyzing the data to ensure correct use of these variables.

Hide