Version Date: Jul 17, 2024 View help for published
Principal Investigator(s): View help for Principal Investigator(s)
United States Department of Health and Human Services. National Institutes of Health. National Institute on Drug Abuse;
United States Department of Health and Human Services. Food and Drug Administration. Center for Tobacco Products
Series:
https://doi.org/10.3886/ICPSR36840.v22
Version V22 (see more versions)
The Population Assessment of Tobacco and Health (PATH) Study is a collaboration between the National Institute on Drug Abuse (NIDA), National Institutes of Health (NIH), and the Center for Tobacco Products (CTP), Food and Drug Administration (FDA). The study was launched in 2011 to inform the FDA's tobacco regulatory activities under the Family Smoking Prevention and Tobacco Control Act (TCA). For Wave 1 (baseline), the PATH Study sampled over 150,000 mailing addresses across the United States to create a national sample of people who use or do not use tobacco, yielding interviews with 45,971 adult and youth respondents.
45,971 adults and youth constitute the first (baseline) wave, Wave 1, of data collected by this longitudinal cohort study. These 45,971 adults and youth along with 7,207 "shadow youth" (youth ages 9 to 11 sampled at Wave 1) make up the 53,178 participants that constitute the Wave 1 Cohort. Respondents are asked to complete an interview at each follow-up wave. Youth who turn 18 by the current wave of data collection are considered "aged-up adults" and are invited to complete the Adult Interview. Additionally, "shadow youth" are considered "aged-up youth" upon turning 12 years old, when they are asked to complete an interview after parental consent.
At Wave 4, a probability sample of 14,098 adults, youth, and shadow youth ages 10 to 11 was selected from the civilian, noninstitutionalized population at the time of Wave 4. This sample was recruited from residential addresses not selected for Wave 1 in the same sampled PSUs and segments using similar within-household sampling procedures. This "replenishment sample" was combined for estimation and analysis purposes with Wave 4 adult and youth respondents from the Wave 1 Cohort who were in the civilian, noninstitutionalized population at the time of Wave 4. This combined set of Wave 4 participants, 52,731 participants in total, forms the Wave 4 Cohort.
Please refer to the Restricted-Use Files User Guide that provides further details about children designated as "shadow youth" and the formation of the Wave 1 and Wave 4 Cohorts.
Biospecimen Collection
Each adult respondent, who completed the interview at Wave 1, was asked to provide at least two biospecimens. Providing biospecimens was voluntary and was not a condition of participation. Respondents were asked to report their use of all nicotine-containing products during the 3-day period prior to the time of any biospecimen collection (Nicotine Exposure Questions (NEQs)) to facilitate interpretation of biomarker results.
Of the 32,320 respondents who completed the adult interview at Wave 1, 21,801 (67.4 percent) provided a urine specimen and 14,520 (44.9 percent) provided a blood specimen. A sample of 11,522 adults who provided sufficient urine for the planned analyses were selected from a diverse mix of six tobacco product use groups representing people who never used tobacco, currently use tobacco, and formerly used tobacco (within the last 12 months). This group constitutes the Wave 1 Biomarker Core. Of the 11,522 adults, 7,159 also provided a blood specimen. All urine and blood specimens provided by the Wave 1 Biomarker Core were sent for laboratory analysis.
Each youth who completed the Wave 4 interview was asked to provide a urine specimen. Each Wave 4 shadow youth (ages 10 and 11 at Wave 4) who completed the Wave 5 youth interview was also asked to provide a urine specimen. Providing this urine biospecimen was voluntary and was not a condition of participation.
Of the 14,798 respondents who completed the youth interview at Wave 4, 13,097 (88.5 percent) provided a urine specimen. A sample of 3,509 Wave 4 Cohort youth ages 12 to 17 who completed the Wave 4 youth interview and provided a sufficient amount of urine for the planned laboratory analyses was selected from a diverse mix of five tobacco product use and non-use groups. In addition, a sample of 528 Wave 4 shadow youth who completed a Wave 5 interview and provided a sufficient amount of urine for the planned laboratory analyses at Wave 5 was also selected. These 4,037 sampled youth and shadow youth constitute the Wave 4 Biomarker Core. All urine specimens provided by the Wave 4 Biomarker Core were sent for laboratory analysis.
Biomarker Restricted Use Files
Wave 1 Restricted-Use Biomarker Data Files (Biomarker RUF) consists of three different types of files for the Wave 1 Biomarker Core:
For Wave 2, urine biospecimens were requested from the Wave 1 Biomarker Core. Respondents were also asked to complete the NEQs prior to biospecimen collection.
The Wave 2 Biomarker RUF consists of three different types of files:
For Wave 3, urine biospecimens were requested from the Wave 1 Biomarker Core. Respondents were also asked to complete the NEQs prior to biospecimen collection.
The Wave 3 Biomarker RUF consists of three different types of files:
For Wave 4, urine biospecimens were requested from the Wave 1 Biomarker Core and all youth who completed the Wave 4 interview. Respondents were also asked to complete the NEQs prior to biospecimen collection.
The Wave 4 Biomarker RUF consists of three different types of files for each Biomarker Core:
Wave 1 Biomarker Core:
Wave 4 Biomarker Core:
For Wave 5, urine biospecimens were requested from the Wave 1 Biomarker Core and the Wave 4 Biomarker Core. Respondents were also asked to complete the NEQs prior to biospecimen collection. Note that Wave 5 specimens from the Wave 4 Biomarker Core were sent for laboratory analysis prior to the Wave 4 specimens.
The Wave 5 Biomarker RUF consists of three different types of files for each Biomarker Core:
Wave 1 Biomarker Core:
Wave 4 Biomarker Core:
Note that the initial release of 3 Urine Panels and Biomarker weights for the Wave 4 Biomarker Core only included records for those among the 3,509 members who responded in Wave 5 and provided urine specimens in sufficient quantities for laboratory analyses. As of version 20, the Wave 5 biomarker data files and weights include data for all Wave 4 Biomarker Core members who provided urine specimens at Wave 5 in sufficient quantities for laboratory analyses, including the Wave 4 shadow youth who completed their first interviews at Wave 5. This means that records were added to previously released urine panel data files (DS5051, DS5053, and DS5056) and biomarker weights (DS5042) to include data for the Wave 4 shadow youth (N=528) who completed their first interviews at Wave 5. All panels released in version 20 and beyond will include records for the complete Wave 4 Biomarker Core.
Also note that the Collection and NEQ file for Adult Urine (DS5001) includes data for both the Wave 1 Biomarker Core and Wave 4 Biomarker Core. Please refer to the PATH Study Biomarker Restricted-Use Files User Guide for additional information about the Wave 4 Biomarker Core.
References to the collection of biospecimens will be specified by the collected specimen, i.e., urine and (whole) blood. However, references to biomarker analyses and analytes will be specified by the type of matrix (serum, plasma, or urine) used for the analysis.
Export Citation:
United States of America
Users are reminded that these data are to be used solely for statistical analysis and reporting of aggregated information, and not for the investigation of specific individuals or organizations.
Access to these data is restricted. Users interested in obtaining these data must complete a Restricted Data Use Agreement. Data are provided via ICPSR's Virtual Data Enclave (VDE). Apply for access to these data through the ICPSR VDE portal. Information and instructions are available within the data portal. For further assistance please reference the VDE Guide to learn about the application process, using the VDE, and how to request disclosure review of VDE output.
Data are provided via ICPSR's Virtual Data Enclave (VDE) where researchers work with data stored on secure ICPSR servers. Researchers cannot possess physical copies of the data; however, upon review by ICPSR, they may request permission to access selected output outside the virtual environment. See the Access Notes to apply for access. Researchers are also encouraged to read the VDE Guide.
The data currently available are the Wave 1, Wave 2, Wave 3, Wave 4, and Wave 5 restricted-use biomarker data files.
Further information about the biospecimens collected in Wave 1 - Wave 5 is available in the PATH Study Biomarker Restricted-Use Files User Guide. Researchers interested in accessing PATH Study biospecimens for biospecimen research should refer to the PATH Study Biospecimen Access Program page. This page provides instructions on applying for access, the schedule for reviewing applications, and frequently asked questions about the program.
The Restricted-Use Master Linkage File (RUF-MLF) is available in the Master Linkage File collection (ICPSR 38008). The RUF-MLF (DS0002) includes indicator variables for the availability of interview data, weights, state identifier data, tobacco Universal Product Code (UPC) data, and biomarker data for each participant. The RUF-MLF can help analysts identify which Restricted-Use files contain data for a particular participant (or set of participants). The RUF-MLF and Public-Use Master Linkage File (PUF-MLF, DS0001 in ICPSR 38008) also include the variables that indicate availability of biospecimens through the Biospecimen Access Program (BAP).
The data files contain a person-level (PERSONID) variable to link participant IDs across files (e.g., linking Wave 1 adult questionnaire data with Wave 1 biomarker data) and across waves (e.g., linking Wave 1 and Wave 2 biomarker data). The values in this variable are random and contain no direct or indirect personally identifiable information. Please review Chapter 5 in the Biomarker Restricted-Use Files User Guide for information and programming code on linking files. The files are sorted by the variable PERSONID.
The questionnaires in this collection have been annotated for analytic purposes. Users are advised that most but not all information contained in the questionnaires is duplicated in the question text used in the codebooks. Some of the longer programming instructions were not incorporated into the question text. In these instances, the question text includes a note for the user to read the full programming instructions in the corresponding section of the questionnaire. Derived and imputed variables contain the algorithms used in the creation of these variables. Users are advised to refer to the Biomarker Restricted-Use Files User Guide and annotated questionnaires when reviewing the codebooks.
In accordance with the study's informed consent, information is suppressed for individuals who withdraw from the PATH Study. Their information was recoded to a special missing value, designated as -97777.
The PATH Study's documentation is available for your use and may be reproduced in whole or in part without permission from NIH's National Institute on Drug Abuse or FDA's Center for Tobacco Products. Citation of the source is appreciated.
Additional background information including answers to frequently asked questions can be found in the Researchers section of the PATH Study series page.
The Biomarker Restricted-Use Files User Guide provides an overview of the biospecimen collection process. The User Guide also covers such topics as subsampling of adults for biomarker analysis, weighting, and programming syntax to run common statistics and link the files. Researchers are encouraged to use the information in the User Guide for their publications, with the following citation:
The data for the PATH Study were collected and prepared by Westat. The contract numbers under which they performed their work are HHSN271201100027C and HHSN271201600001C.
The Population Assessment of Tobacco and Health (PATH) Study is a longitudinal cohort study on tobacco use behavior, attitudes and beliefs, and tobacco-related health outcomes among an estimated 46,000 adults and youth in the United States. The study's primary objectives are to:
The study used a four-staged stratified sampling design to sample over 150,000 mailing addresses, yielding a Wave 1 (baseline) sample of 45,971 respondents (32,320 adults and 13,651 youth). People who use or do not use tobacco, ages 12 years and older and in the civilian non-institutionalized household population were eligible for participation in the PATH Study.
Field interviewers invited each Wave 1 adult respondent to voluntarily provide biospecimens as part of the study. Respondents were invited to "self-collect" a urine sample. Consenting respondents received $25 for providing a urine sample. At Wave 1, adults consenting to provide a blood sample were scheduled for a blood draw appointment with a phlebotomist at the respondent's home. The respondent received $25 for their time during this activity.
At Wave 4, field interviewers invited each youth respondent to voluntarily provide urine specimens as part of the study after a parent granted permission. Respondents were invited to "self-collect" a urine sample. Consenting respondents received $25 for providing a urine sample.
Specific details about specimen collection and storage are contained in Chapter 2 of the Biomarker Restricted-Use Files User Guide.
A four-stage stratified area probability sample design was used in the PATH Study, with a two-phase design for sampling adults at the final stage. At the first stage, a stratified sample of geographical primary sampling units (PSUs) was selected, in which a PSU is a county or group of counties. For the second stage, within each selected PSU, smaller geographical segments were formed and then a sample of these segments was drawn. At the third stage, the sampling frame consisted of the residential addresses located in these segments. The fourth stage selected adults and youth from the sampled households identified at these addresses, with varying sampling rates for adults by age, race, and tobacco use status. Adults were sampled in two phases - Phase 1 sampling used information provided in the household screener and Phase 2 sampling used information provided by the adult in the Phase 2 screener at the beginning of the adult instrument.
Wave 1 Biomarker Core
A stratified probability subsample of 11,522 adults who completed the Wave 1 adult interview and provided sufficient urine for the planned analyses (independently of whether they provided a blood specimen) was selected from a diverse mix of six tobacco product use and non-use groups. This group constitutes the Wave 1 Biomarker Core. Of these 11,522 adults, 7,159 also provided a blood specimen. All urine and blood specimens provided by the Wave 1 Biomarker Core were sent for laboratory analysis. Please consult the Biomarker Restricted-Use Files User Guide for additional details about the subsampling. For additional details about the number of specimens shipped in each wave, and the number of results for each panel file see the Wave 1 Biomarker Core Laboratory Inventory File for more details.
Wave 2 was the first follow-up wave for participants in Wave 1. At Wave 2, the PATH Study completed 28,362 Adult Interviews, but only those in the Wave 1 Biomarker Core were asked to provide urine for laboratory analysis. Of those who provided urine samples, 9,012 were shipped for laboratory analysis.
Wave 3 was the second follow-up wave for participants in Wave 1. At Wave 3, the PATH Study completed 28,148 Adult Interviews, but only those in the Wave 1 Biomarker Core were asked to provide urine for laboratory analysis. Of those who provided urine samples, 8,643 were shipped for laboratory analysis.
Wave 4 was the third follow-up wave for participants in Wave 1. At Wave 4, the PATH Study completed 27,757 Adult interviews with Wave 1 Cohort respondents, but only those in the Wave 1 Biomarker Core were asked to provide urine for laboratory analysis. Of those who provided urine samples, 8,333 were shipped for laboratory analysis.
Wave 5 was the fourth follow-up wave for adult participants in Wave 1. At Wave 5, the PATH Study completed 28,970 Adult interviews with Wave 1 Cohort respondents, but only those in the Wave 1 Biomarker Core were asked to provide urine for laboratory analysis. Of those who provided urine samples, 7,868 were shipped for laboratory analysis.
Wave 4 Biomarker Core
Wave 4 was the first wave for participants recruited from the replenishment sample and the first wave in which youth were asked to provide urine specimens. At Wave 4, the PATH Study completed 14,793 Youth Interviews with respondents that became a part of the Wave 4 Cohort. A stratified probability sample of 3,509 Wave 4 Cohort youth ages 12 to 17 who completed the Wave 4 youth interview and provided a sufficient amount of urine for the planned laboratory analyses was selected from a diverse mix of five tobacco product use and non-use groups. This group constitutes the initial Wave 4 Biomarker Core. Please consult the Biomarker Restricted-Use Files User Guide for additional details about the subsampling. For additional details about the number of specimens shipped in each wave, and the number of results for each panel file see the Wave 4 Biomarker Core Laboratory File Inventory for more details.
At Wave 5, participants in the initial Wave 4 Biomarker Core were asked to provide urine for laboratory analysis. In addition, a stratified probability sample of 528 Wave 4 shadow youth respondents who completed a Wave 5 interview and provided a sufficient amount of urine for the planned laboratory analyses at Wave 5 was also selected into the Wave 4 Biomarker Core. Of those who provided urine samples, 2,958 were shipped for laboratory analysis. The initial release of Wave 5 data for the Wave 4 Biomarker Core did not include data from the Wave 4 shadow youth who completed their first interview at Wave 5. As of version 20, this data has been added to previously released panels.
Adult (age 18 and older) who never used tobacco, currently use tobacco, and formerly used tobacco (within the last 12 months) in the civilian, non-institutionalized household population of the United States at the time of Wave 1 (September 12, 2013 - December 14, 2014). Youth (ages 10 to 17) who used or did not use tobacco products in the civilian non-institutionalized household population of the United States at the time of Wave 4 (December 1, 2016 - January 3, 2018).
The collection files for Urine (Wave 1: DS1001, Wave 2: DS2001, Wave 3: DS3001, Wave 4: DS4001 and DS4011, Wave 5: DS5001 and DS5011) and Blood (Wave 1: DS1101, Wave 2, Wave 3, Wave 4, Wave 5: N/A), contain data for the NEQs. Specifically, respondents are asked about their use of seven types of tobacco products: cigarettes, e-cigarettes, cigars (traditional, cigarillos, filtered), pipes, hookah, smokeless tobacco (snus pouches and other forms of smokeless tobacco), and dissolvable tobacco. They are also asked about their use of nicotine replacement therapies (NRT) such as a patch, gum, inhaler, pill, etc., and prescription drugs used for smoking cessation. Each section contains the same basic four questions:
The Urine (Wave 1: DS1021, Wave 2: DS2021, Wave 3: DS3021 and DS3022, Wave 4: DS4021, DS4022, and DS4023, Wave 5: DS5021, DS5022 and DS5042), F2PG2a (Wave 2: DS2022, Wave 3: DS3023 and DS3024, Wave 4: DS4023 and DS4024), and Blood (Wave 1: DS1121, Wave 2 - Wave 5: N/A) Biomarker Weight files contain:
Each Assay Panel for Urine (Wave 1: DS1031-DS1038, Wave 2: DS2031-DS2037, Wave 3: DS3032 - DS3038, Wave 4: DS4032 - DS4058 Wave 5: DS5032 - DS5058), Serum (Wave 1: DS1131-DS1134, Wave 2 - Wave 5: N/A) and Plasma (Wave 1: DS1231, Wave 2 - Wave 5: N/A) contains the following information for each analyte included in the panel.
See Appendix A of the Wave 1 Biomarker Core Laboratory File Inventory and the Wave 4 Biomarker Core Laboratory File Inventory for a list of chemical compounds analyzed within each of the panel codebooks for Wave 1 to Wave 5 for each respective Biomarker Core.
Wave 1 Biomarker Core
A total of 32,320 adults completed an interview during Wave 1 of data collection. The collection rates of urine and blood provided by these 32,320 respondents were:
Not all adults who provided a blood sample provided a urine sample, and some adults refused to provide both of the requested specimens. The Biomarker Restricted-Use Files User Guide gives a breakdown of the various combinations of samples provided by the 32,320 adult respondents. For more information on the number of specimens shipped and the number of results for each assay panel see the Wave 1 Biomarker Core Laboratory Inventory File
Among Wave 2 Adult Interview respondents, a subsample of continuing adults who provided urine at Wave 1 were asked to provide a urine specimen. Among the Wave 1 Biomarker Core, 9,996 completed a Wave 2 interview. The collection rate of urine specimens provided by the Wave 1 Biomarker Core at Wave 2 was 96.4 percent (a total of 9,640 collected).
Among Wave 3 Adult Interview respondents, a subsample of continuing adults who provided urine at Wave 1 were asked to provide a urine specimen. Among the Wave 1 Biomarker Core, 9,462 completed a Wave 3 interview. The collection rate of urine specimens provided by the Wave 1 Biomarker Core at Wave 3 was 97.5 percent (a total of 9,227 collected).
Among Wave 4 Adult Interview respondents, a subsample of continuing adults who provided urine at Wave 1 were asked to provide a urine specimen. Among the Wave 1 Biomarker Core, 8,817 completed a Wave 4 interview. The collection rate of urine specimens provided by the Wave 1 Biomarker Core at Wave 4 was 98 percent (a total of 8,641 collected).
Among Wave 5 Adult Interview respondents, a subsample of continuing adults who provided urine at Wave 1 were asked to provide a urine specimen. Among the Wave 1 Biomarker Core, 8,167 completed a Wave 5 interview. The collection rate of urine specimens provided by the Wave 1 Biomarker Core at Wave 5 was 98 percent (a total of 7,988 collected).
Wave 4 Biomarker Core
A total of 14,798 youth completed an interview during Wave 4 of data collection. The collection rate of urine provided by these 14,798 respondents was:
The Biomarker Restricted-Use Files User Guide gives a breakdown of the number of samples provided by the 14,798 youth respondents. For more information on the number of specimens shipped and the number of results for each assay panel see the Wave 4 Biomarker Core Laboratory Inventory File.
Members of the Wave 4 Biomarker Core were also asked to provide a urine specimen at Wave 5 after completion of an Adult or Youth Interview. Among the Wave 4 Biomarker Core, 1,402 completed an Adult Interview at Wave 5 and 2,199 completed a Youth Interview (including 528 respondents sampled as Shadow Youth at Wave 4). The collection rate of urine specimens provided by the Wave 4 Biomarker Core at Wave 5 was 98 percent (a total of 3,534 collected). The collection rate of urine specimens provided by the Wave 4 Biomarker Core at Wave 5, excluding 528 respondents sampled as Shadow Youth at Wave 4, was 86.5 percent (a total of 3,006 collected).
Hide2017-08-08
2024-07-17 2024-06-21 2024-06-14 Wave 4 data and documentation for one additional Wave 4 Biomarker Core Urine Panel Assay (DS4058) have been added to the collection. Wave 5 data and documentation for one additional Wave 4 Biomarker Core Urine Panel Assay (DS5054) have been added to the collection. The Laboratory File Inventories for Wave 1 and Wave 4 Biomarker Cores were also updated.
2023-12-15 Wave 4 data and documentation for one additional Wave 1 Biomarker Core Urine Panel Assay (DS4038) and accompanying weights (DS4023 and DS4024) have been added to the collection. Wave 4 data and documentation for three additional Wave 4 Biomarker Core Urine Panel Assays (DS4054, DS4055, and DS4057) have been added to the collection. Wave 5 data and documentation for Wave 4 Biomarker Core files that were previously released (DS5042, DS5051, DS5053, DS5056) have been updated to include new records pertaining to Wave 4 shadow youth who completed their first youth interview at Wave 5 and provided sufficient urine for laboratory analyses. Wave 5 data and documentation for three additional Wave 4 Biomarker Core Urine Panel Assays (DS5055, DS5057, and DS5058) have been added to the collection. The Biomarker Restricted Use Files User Guide, Laboratory File Inventories for Wave 1 and Wave 4 Biomarker Cores, the PAH Laboratory Panel document, and the PAH Laboratory Procedures document were also updated.
2023-03-22 Replaced Lab File Inventory.
2022-12-16 Wave 4 data and documentation for the Wave 4 Biomarker Core were added to the collection. Wave 4 data and documentation for one additional Wave 1 Biomarker Core Urine Panel Assay (DS4034) was added to the collection. Wave 5 data and documentation for two additional Wave 1 Biomarker Core Urine Panel Assays (DS5035 and DS5038) and accompanying weights (DS5023 and DS5024) have been added to the collection. The Biomarker Restricted Use Files User Guide, Laboratory File Inventories for Wave 1 and Wave 4 Biomarker Cores, the Metals Laboratory Panel document, and the Metals Laboratory Procedures document were also updated.
2022-05-11 Wave 5 data and documentation for the Wave 4 Biomarker Core were added to the collection. The Biomarker Restricted Use Files User Guide was also updated.
2021-12-16 Wave 4 documentation updated to reflect recent title revisions. Wave 5 data and documentation for the Wave 1 Biomarker Core were added to the collection. Data and documentation for one additional Wave 3 Urine Panel Assay (DS3038) and accompanying weights (DS3023 and DS3024) have been added to the collection. Data and documentation for Volatile Organic Compounds Metabolites (VOCM) Urine Panel Assays in Wave 1 (DS1037), Wave 2 (DS2037), Wave 3 (DS3037), and Wave 4 (DS4037) were updated to reflect revised result values for one analyte (N-Acetyl-S-(3-hydroxypropyl-1-methyl)-L-cysteine [analyte code HPMM]). See Appendix B in the VOCM Lab Panel Documentation for more details.
2021-07-15 Study was updated to resolve issue of multiple user guide files.
2021-06-03 Data and documentation for two additional Wave 4 Urine Panel Assays (DS4035 and DS4037) have been added to the collection. Access to Master Linkage Files is now available via a separate study (ICPSR 38008).
2020-08-17 Data and documentation for one additional Wave 2 Urine Panel Assay (DS2038) and accompanying weights (DS2022), one additional Wave 3 Urine Panel Assay (DS3034) have been added to the collection. In addition, data and documentation for the Wave 4 Urine Collection and NEQ (DS4001), two Wave 4 Urine Panel Weights (DS4021 and DS4022), and three Wave 4 Urine Panel Assays (DS4032, DS4033, and DS4036) have been added to the collection. Data and documentation for the Master Linkage File has also been updated to reflect these new data files.
2020-06-22 Data and documentation for the Master Linkage File and two additional Wave 3 Urine Panel Assays (DS3035 and DS3037) have been added to the collection.
2020-03-19 Data and documentation for the Master Linkage File has been updated.
2020-03-05 Data and documentation for the Wave 3 Urine Collection and NEQ (DS3001), two Wave 3 Urine Panel Weights (DS3021 and DS3022), and three Wave 3 Urine Panel Assays (DS3032, DS3033, and DS3036) have been added to the collection.
2019-11-05Data and Documentation for the Wave 1 Urine Collection and NEQ (DS1001) and Wave 1 Blood Collection (DS1101) were updated to reflect withdrawn participants (indicated by Special Missing -97777). Documentation was updated for 508 compliance at this time.
Dataset and corresponding documentation number-schemes have changed for Wave 1.
2019-03-19 Data and documentation for the Wave 2 urine collection and NEQ (DS2001), urine biomarker weights (DS2021), and seven urine panel assays (DS2031 to DS2037) have been added to the collection. Additionally, the master linkage file (DS0001) has been updated.
2019-02-01 Data and documentation for the Wave 1 VOCM urine panel (DS1009) and the master linkage file (DS0001) were updated for the collection.
2018-10-01 2018-09-28 Data and documentation for five panel assays (F2PGa, hsCRP, IL6, siCAM, and Fibro) was added to the collection. Additionally, data and documentation for the master linkage file was updated. The respondent's local time zones were converted to Eastern Time and the variable R01R_A_URINE_LASTURINATION was recalculated. One case was corrected for variable R01R_A_URINE_NEQ_TDIFF.
2018-05-01 The eight panel assay files were updated due to a modification in the derivation algorithm for the variables "R01R_A_XXXX_IMPFLAG". The text for this variable and that of "R01R_A_XXXX_RESULT_IMP" were updated to include a more detailed algorithm.
2018-02-15 The citation of this study may have changed due to the new version control system that has been implemented. The previous citation was:2017-08-08 ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection:
Wave 1 Biomarker Core
For Wave 1, there are two files containing weights for use in the analyses of biomarker data that reflect the complex PATH Study sample design. The final full-sample person-level weight for Urine is R01_A_URINEPWGT and for Blood it is R01_A_BLOODWGHT. Also, contained in these two files are 100 replicate weights and design variables (VARPSU and VARSTRAT) for use in variance estimation.
For Wave 2, there are two files containing longitudinal weights: Urine weights and F2PG2a* weights. For use with all urine assays except F2PG2a, the final full-sample person-level weight is R02_A_URINEPWGT. Also, contained in this file are 100 replicate weights and design variables (VARPSU and VARSTRAT) for use in variance estimation. For F2PG2a data, separate weights were created where the final full-sample person level weight is R02_A_BC01_URF2WGT. Also contained in this file are the 100 replicate weights and design variables (VARPSU and VARSTRAT) for use in variance estimation.
*It is important to note that although the oxidative stress biomarker 8-isoprostane (F2PG2a) is a urine-based assay, only urine specimens from people who also provided blood specimens at Wave 1 were analyzed to ensure that sample sizes were roughly equivalent to blood-based assays. Therefore, the appropriate weight to use when analyzing the Wave 1 F2PG2a data is the blood weight. For follow-up waves in which F2PG2a data are available, separate biomarker weights are available for analysis of these data.
For Wave 3, there are two files containing weights: the all-waves urine weights and the single-wave urine weights. The Wave 3 urine weights are for use in analyses including the Wave 3 urine biomarker data. The Wave 3 all-waves urine weights file contains weights for members of the Wave 1 Biomarker Core who completed the Wave 3 interview, provided a sufficient amount of urine for the planned analyses at Wave 3, and were also assigned a Wave 2 urine weight. The Wave 3 single-wave urine weights file contains weights for members of the Wave 1 Biomarker Core who completed the Wave 3 interview and provided a sufficient amount of urine for the planned analyses, regardless of their level of participation in the study at Wave 2. The Wave 3 all-waves weight is named R03_A_BC01A_URWGT. The Wave 3 single-wave weight is named R03_A_BC01S_URWGT. Each Wave 3 urine weights file also contains 100 replicate weights and design variables (VARPSU and VARSTRAT) for use in variance estimation. For Wave 3 F2PG2a data, there are two separate files containing weights: the all-waves F2PG2a weights and the single-wave F2PG2a weights. The Wave 3 all-waves F2PG2a weights file contains weights for members of the Wave 1 Biomarker Core who completed the Wave 3 interview and provided a sufficient amount of urine for the planned analysis, and had F2PG2a data available at Wave 1 and Wave 2. The Wave 3 single-wave F2PG2a weights file contains weights for members of the Wave 1 Biomarker Core who completed the Wave 3 interview and provided a sufficient amount of urine for the planned analysis, and had F2PG2a data available at Wave 1, regardless of their level of participation in the study at Wave 2. The Wave 3 F2PG2a all-waves weight is named R03_A_BC01A_URF2WGT. The Wave 3 single-wave weight is named R03_A_BC01S_URF2WGT. Each Wave 3 F2PG2a weights file also contains 100 replicate weights and design variables (VARPSU and VARSTRAT) for use in variance estimation.
For Wave 4, there are two files containing weights: the all-waves urine weights and the single-wave urine weights. The Wave 4 urine weights are for use in analyses including the Wave 4 urine biomarker data. The Wave 4 all-waves urine weights file contains weights for members of the Wave 1 Biomarker Core who completed the Wave 4 interview, provided a sufficient amount of urine for the planned analyses at Wave 4, and were also assigned Wave 2 and Wave 3 urine weights. The Wave 4 single-wave urine weights file contains weights for members of the Wave 1 Biomarker Core who completed the Wave 4 interview and provided a sufficient amount of urine for the planned analyses, regardless of their level of participation in the study at Wave 2 or Wave 3. The Wave 4 all-waves weight is named R04_A_BC01A_URWGT. The Wave 4 single-wave weight is named R04_A_BC01S_URWGT. Each Wave 4 urine weights file also contains 100 replicate weights and design variables (VARPSU and VARSTRAT) for use in variance estimation.
For Wave 5, there are two files containing weights for the Wave 1 Biomarker Core: the all-waves urine weights and the single-wave urine weights. The Wave 5 urine weights are for use in analyses including the Wave 5 urine biomarker data. The Wave 5 all-waves urine weights file contains weights for members of the Wave 1 Biomarker Core who completed the Wave 5 interview, provided a sufficient amount of urine for the planned analyses at Wave 5, and were also assigned Wave 2, Wave 3, and Wave 4 urine weights. The Wave 5 single-wave urine weights file contains weights for members of the Wave 1 Biomarker Core who completed the Wave 5 Adult Interview and provided a sufficient amount of urine for the planned analyses, regardless of their level of participation in the study at Wave 2, Wave 3, or Wave 4. The Wave 5 all-waves weight is named R05_A_BC01A_URWGT. The Wave 5 single-wave weight is named R05_A_BC01S_URWGT. Each Wave 5 urine weights file also contains 100 replicate weights and design variables (VARPSU and VARSTRAT) for use in variance estimation.
For Wave 5 F2PG2a data, there are two separate files containing weights: the all-waves F2PG2a weights and the single-wave F2PG2a weights. The Wave 5 all-waves F2PG2a weights file contains weights for members of the Wave 1 Biomarker Core who completed the Wave 5 interview and provided a sufficient amount of urine for the planned analysis, and had F2PG2a data available at Wave 1, Wave 2, Wave 3, and Wave 4. The Wave 5 single-wave F2PG2a weights file contains weights for members of the Wave 1 Biomarker Core who completed the Wave 5 interview and provided a sufficient amount of urine for the planned analysis, and had F2PG2a data available at Wave 1, regardless of their participation in the study at Wave 2, Wave 3, or Wave 4. The Wave 5 F2PG2a all-waves weight is named R05_A_BC01A_URF2WGT. The Wave 5 single wave weight is named R05_A_BC01S_URF2WGT. Each Wave 5 F2PG2a weights file also contains 100 replicate weights and design variables (VARPSU and VARSTRAT) for use in variance estimation.
Due to the subsampling of adults to determine the biospecimens sent for laboratory analysis, using the weights provided with the Wave 1 Biomarker Core data makes the data nationally representative of people who do or do not use tobacco products in six tobacco product use groups only. These estimates can be described as representative of people who never used tobacco, currently use tobacco, and formerly used tobacco (within the last 12 months) in the U.S. civilian, noninstitutionalized adult population at the time of Wave 1.
At subsequent waves, weighted estimates of Wave # data are nationally representative of adults in the U.S. civilian, noninstitutionalized population who never used tobacco, currently use tobacco, or formerly used tobacco (within the last 12 months) at Wave 1 and who lived in the United States (and were not incarcerated) at Wave #.
Wave 4 Biomarker Core
For Wave 4, there is one file containing cross-sectional urine weights. These urine weights are for use in analysis including the Wave 4 urine biomarker data for the Wave 4 Biomarker Core. The wave 4 cross-sectional urine weights file contains weights for members of the Wave 4 Biomarker Core who completed the Wave 4 youth interview and provided a sufficient amount of urine for the planned analyses at Wave 4. The Wave 4 cross-sectional weight is named R04_BC04_URWGT. The Wave 4 urine weights file for the Wave 4 Biomarker Core also contains 100 replicate weights and design variables (VARPSU and VARSTRAT) for use in variance estimation.
The Wave 5 single-wave urine weights file contains weights for members of the Wave 4 Biomarker Core who completed the Wave 5 Youth or Adult Interview and provided a sufficient amount of urine for the planned analysis. The Wave 5 single-wave weight is named R05_BC04S_URWGT. The Wave 5 urine weights file also contains 100 replicate weights and design variables (VARPSU and VARSTRAT) for use in variance estimation.
Due to the subsampling of youth to determine the biospecimens sent for laboratory analysis, using the weights provided with the Wave 4 Biomarker Core data makes the data nationally representative of the U.S. civilian, noninstitutionalized population ages 12 to 17 at Wave 4. At subsequent waves, weighted estimates of Wave # data are nationally representative of those in the U.S. civilian, noninstitutionalized population ages 10 to 17 at Wave 4 and who lived in the United States (and were not incarcerated) at Wave #.
Detailed information on how all weight variables were created, and how and why they should be used is provided in the Biomarker Restricted-Use Files User Guide. The weighting procedures adjust for oversampling of population groups and non-response. ICPSR strongly recommends that users read and understand this section before analyzing the data to ensure correct use of these variables.
HideThe public-use data files in this collection are available for access by the general public. Access does not require affiliation with an ICPSR member institution.
One or more files in this data collection have special restrictions. Restricted data files are not available for direct download from the website; click on the Restricted Data button to learn more.