National Health and Nutrition Examination Survey (NHANES), 2001-2002 (ICPSR 25502)
Version Date: Feb 22, 2012 View help for published
Principal Investigator(s): View help for Principal Investigator(s)
United States Department of Health and Human Services. Centers for Disease Control and Prevention. National Center for Health Statistics
Summary View help for Summary
The National Health and Nutrition Examination Surveys (NHANES) is a program of studies designed to assess the health and nutritional status of adults and children in the United States. The NHANES combines personal interviews and physical examinations, which focus on different population groups or health topics. These surveys have been conducted by the National Center for Health Statistics (NCHS) on a periodic basis from 1971 to 1994. In 1999 the NHANES became a continuous program with a changing focus on a variety of health and nutrition measurements which were designed to meet current and emerging concerns. The surveys examine a nationally representative sample of approximately 5,000 persons each year. These persons are located in counties across the United States, 15 of which are visited each year. The 2001-2002 NHANES contains data for 11,039 individuals (and MEC examined sample size of 10,477) of all ages. Many questions that were asked in NHANES II, 1976-1980, Hispanic HANES 1982-1984, and NHANES III, 1988-1994, were combined with new questions in the NHANES 2001-2002. As in past health examination surveys, data were collected on the prevalence of chronic conditions in the population. Estimates for previously undiagnosed conditions, as well as those known to and reported by survey respondents, are produced through the survey. Risk factors, those aspects of a person's lifestyle, constitution, heredity, or environment that may increase the chances of developing a certain disease or condition, were examined. Data on smoking, alcohol consumption, sexual practices, drug use, physical fitness and activity, weight, and dietary intake were collected. Information on certain aspects of reproductive health, such as use of oral contraceptives and breastfeeding practices, were also collected. The diseases, medical conditions, and health indicators that were studied include: anemia, cardiovascular disease, diabetes and lower extremity disease, environmental exposures, equilibrium, hearing loss, infectious diseases and immunization, kidney disease, mental health and cognitive functioning, nutrition, obesity, oral health, osteoporosis, physical fitness and physical functioning, reproductive history and sexual behavior, respiratory disease (asthma, chronic bronchitis, emphysema), sexually transmitted diseases, skin diseases, and vision. The sample for the survey was selected to represent the United States population of all ages. Special emphasis in the 2001-2002 NHANES was on adolescent health and the health of older Americans. To produce reliable statistics for these groups, adolescents aged 15-19 years and persons aged 60 years and older were over-sampled for the survey. African Americans and Mexican Americans were also over-sampled to enable accurate estimates for these groups. Several important areas in adolescent health, including nutrition and fitness and other aspects of growth and development, were addressed. Since the United States has experienced dramatic growth in the number of older people during the twentieth century, the aging population has major implications for health care needs, public policy, and research priorities. NCHS is working with public health agencies to increase the knowledge of the health status of older Americans. NHANES has a primary role in this endeavor. In the examination, all participants visit the physician who takes their pulse or blood pressure. Dietary interviews and body measurements are included for everyone. All but the very young have a blood sample taken and see the dentist. Depending upon the age of the participant, the rest of the examination includes tests and procedures to assess the various aspects of health listed above. Usually, the older the individual, the more extensive the examination. Some persons who are unable to come to the examination center may be given a less extensive examination in their homes. Demographic data file variables are grouped into three broad categories: (1) Status Variables: provide core information on the survey participant. Examples of the core variables include interview status, examination status, and sequence number. (Sequence number is a unique ID assigned to each sample person and is required to match the information on this demographic file to the rest of the NHANES 2001-2002 data). (2) Recoded Demographic Variables: these variables include age (age in months for persons through age 19 years, 11 months; age in years for 1-84 year olds, and a top-coded age group of 85 years of age and older), gender, a race/ethnicity variable, current or highest grade of education completed, (less than high school, high school, and more than high school education), country of birth (United States, Mexico, or other foreign born), Poverty Income Ratio (PIR), income, and a pregnancy status variable (adjudicated from various pregnancy related variables). Some of the groupings were made due to limited sample sizes for the two-year data set. (3) Interview and Examination Sample Weight Variables: sample weights are available for analyzing NHANES 2001-2002 data. For a complete listing of survey contents for all years of the NHANES see the document -- Survey Content -- NHANES 1999-2010.
Citation View help for Citation
Subject Terms View help for Subject Terms
Geographic Coverage View help for Geographic Coverage
Distributor(s) View help for Distributor(s)
Time Period(s) View help for Time Period(s)
Date of Collection View help for Date of Collection
Data Collection Notes View help for Data Collection Notes
In preparing the data files for this collection, the National Center for Health Statistics (NCHS) has removed direct identifiers and characteristics that might lead to identification of data subjects. As an additional precaution NCHS requires, under Section 308(d) of the Public Health Service Act (42 U.S.C. 242m), that data collected by NCHS not be used for any purpose other than statistical analysis and reporting. NCHS further requires that analysts not use the data to learn the identity of any persons or establishments and that the director of NCHS be notified if any identities are inadvertently discovered. ICPSR member institutions and other users ordering data from ICPSR are expected to adhere to these restrictions.
Many variables that are listed in the Demographic questionnaire sections of the Household Interview were omitted (by NCHS) from this data release due to concerns about participant confidentiality. NCHS did not include confidential and administrative data in this release and further, some variables have been recoded or top-coded to protect the confidentiality of survey participants.
Many of the NHANES 2001-2002 questions were also asked in NHANES II, 1976-1980, Hispanic HANES 1982-1984, and NHANES III, 1988-1994. New questions were added to the survey based on recommendations from survey collaborators, NCHS staff, and other interagency work groups.
NHANES 2001-2002 survey design and demographic variables are found in Part 1 Demographics file in this release. All of the data files can be linked by using the common survey participant identification number (variable name: SEQN). Merging information from multiple NHANES 2001-2002 data files using SEQN ensures that the appropriate information for each survey participant is linked correctly. All data files should be sorted by SEQN.
The NHANES 2001-2002 data files do not have the same number of records in each file. For example, there are different numbers of subjects in the Interview and Examination samples of the survey. Additionally, the number of records in each data file varies depending on gender and age profiles for the specific component(s).
The sample person demographic file is composed of a limited set of core variables that are required to analyze NHANES 2001-2002 data.
Per agreement with NCHS, ICPSR distributes the data file(s) and text of the technical documentation for this collection as prepared by NCHS.
All data files have been merged with the demographics file with the exception of those parts that did not contain the linking variable SEQN which are as follows: Questionnaire: Dietary Supplement Use -- Supplement Information (Part 215), Questionnaire: Dietary Supplement Use -- Ingredient Information (Part 216), Questionnaire: Dietary Supplement Use -- Supplement Blend (Part 217), and Questionnaire: Drug Information (Part 237).
The user guides that are presently available are comprised of documentation from the NCHS. These user guides do not reflect the merging of each file with the demographics file, as this was done by ICPSR staff.
Part 13 Balance (Subsample) and part 14 Examination: Bioelectrical Impedance Analysis have been updated to correct an error in the data files.
Study Design View help for Study Design
The most recent series of data collection waves for NHANES began in 1999. Every year, approximately 7,000 individuals, of all ages, are interviewed in their homes and of these, approximately 5,000 complete the health examination component of the survey. A majority of the health examinations are conducted in mobile examination centers (MECs). The MECs provide an ideal setting for the collection of high quality data in a standardized environment. In addition to the MEC examinations, a small number of survey participants receive an abbreviated health examination in their homes if they are unable to come to the MEC. The NHANES target population is the civilian, noninstitutionalized United States population. NHANES 2001-2002 includes over-sampling of low-income persons, adolescents 12-19 years of age, persons 60 years of age and older, African Americans, and Mexican Americans. Initially, households are identified for inclusion in the NHANES sample and an advance letter is mailed to each address informing the occupant(s) that an NHANES interviewer will visit their home. The household interview component is comprised of Screener, Sample Person, and Family interviews, each of which has a separate questionnaire (please refer to the data file documentation). Trained household interviewers administer all of the questionnaires. In most cases, the interview setting was the survey participant's home. The interview data are recorded using the Blaise computer-assisted personal interview (CAPI) system. When the interviewer arrives at the home, he or she shows an official identification badge and briefly explains the purpose of the survey. If the occupant has not seen the advance letter, a copy is given to them to read. The interviewer requests that the occupant answer a brief questionnaire to determine if any household occupants are eligible to participate in NHANES. If eligible individuals are identified, the interviewer proceeds with efforts to recruit these individuals. Initially, the interviewer explains the household questionnaires to all eligible participants 16 years of age and older, informs the potential respondents of their rights, and provides assurances about the confidentiality of the survey data (reiterating what is stated in the advance letter). A majority of the household interviews are conducted during the first contact. If this is inconvenient for the survey participant, an appointment is made to administer the household interview questionnaires later. Household interviews for survey participants under 16 years of age are conducted with a proxy (usually their parent or guardian). If there is no one living in the household who is over 16, participants under 16 years of age are permitted to self-report. Respondents are asked to sign an Interview Consent Form agreeing to participate in the household interview portion of the survey. For participants 16-17 years of age a parent or guardian consents and the child gives his/her assent. After the household interview is completed, the interviewer reviews a second informed consent brochure with the participant. This brochure contains detailed information about the NHANES health examination component. All interviewed persons are asked to complete the health examination component. Those who agree to participate are asked to sign additional consent forms for the health examination component. The interviewer telephones the NHANES field office from the participant's home to schedule an appointment for the examination. The interviewer informs the participants that they will receive remuneration as well as reimbursement for transportation and childcare expenses, if necessary.
There are different target population groups for the topics within and between NHANES questionnaire sections. For example, in the Nutrition and Diet Behavior section, questions pertaining to infant nutrition and breast-feeding were asked of proxy respondents for children 6 years of age and younger, alcohol consumption frequency questions were asked of persons 20 years of age and older, and senior meal program participation questions were asked of respondents 60 years of age and older. Data users should review the survey questionnaire codebooks thoroughly to determine the target populations for each NHANES questionnaire section and sub-section.
The NHANES Health Examination Component
When a participant arrives at the MEC, the MEC Coordinator greets the participant and verifies all pertinent identifier information. The participant is given a disposable paper gown, a pair of slippers, and a urine specimen cup. MEC staff direct survey participants to the rooms where the examination components are conducted. Each MEC survey team consists of one physician, one dentist, two dietary interviewers, three certified medical technologists, five health technicians, one phlebotomist, two interviewers, and one computer data manager. Additionally, there is a person designated as the Coordinator who is responsible for managing the movement of participants between examinations, providing remuneration, and distributing a preliminary report of findings. Upon completion of the examination, each examinee is remunerated. Some of the medical findings of the examination are given to the examinees before they leave the MEC. The other reportable survey findings are mailed to participants after the laboratory assays and special tests are completed.
Three MECs are equipped for use in NHANES. Each MEC consists of four large, inter-connected trailer units. An advance team sets up the MECs prior to the start of the survey examinations. Water, sewer, electrical, and communication lines are connected during set-up. The MEC equipment and data collection systems must be checked and calibrated prior to the start of survey data collection. The MECs are open a total of five days per week, and the nonoperational days change on a rotating basis so that appointments can be scheduled on any day of the week. Two examination sessions are conducted daily. For the convenience of the survey participants, appointments can be scheduled during morning, afternoon, or evening hours. The examinations require up to three hours to complete. At any given time during the survey, examinations are conducted at two survey locations simultaneously. Staff vacations are scheduled for periods of about one month at New Years and about two weeks during the summer, leaving ten and one-half months to conduct examinations.
Second Day Examinations and Dietary Interviews
Second day (i.e., repeat) MEC examinations were pilot tested in 1999 and implemented in 2000. Approximately 5 percent of examined persons over 20 years of age from each survey location were asked to complete second day exams. Second day dietary interviews were completed on ten percent of all examined persons in 2000. The second day dietary interview is a primary data collection technique to improve the estimation of distributions of nutrient intakes. Second day examination and dietary data are not included in this data file.
Participants under 1 year of age or 60 years of age and older, who are unable or unwilling to come to the MEC for an examination, are eligible to receive a home examination. Home examination data are not included in this data file.
Sample View help for Sample
The NHANES survey design is a stratified, multistage probability sample of the civilian noninstitutionalized United States population. The stages of sample selection are: (1) selection of Primary Sampling Units (PSUs) which are counties or small groups of contiguous counties, (2) segments within PSUs (a block or group of blocks containing a cluster of households), (3) households within segments, and (4) one or more participants within households. A total of 15 PSUs are visited during a 12-month time period. Details of the design and content of each survey are available at the NHANES Web site.
Universe View help for Universe
The NHANES target population is the civilian, noninstitutionalized United States population.
Unit(s) of Observation View help for Unit(s) of Observation
Data Type(s) View help for Data Type(s)
Mode of Data Collection View help for Mode of Data Collection
Presence of Common Scales View help for Presence of Common Scales
DISC -- Predictive Scale
Original Release Date View help for Original Release Date
Version History View help for Version History
- United States Department of Health and Human Services. Centers for Disease Control and Prevention. National Center for Health Statistics. National Health and Nutrition Examination Survey (NHANES), 2001-2002. ICPSR25502-v5. Ann Arbor, MI: Inter-university Consortium for Political and Social Research [distributor], 2012-02-22. http://doi.org/10.3886/ICPSR25502.v5
2012-02-22 Data and documentation for the following parts have been updated: 13, 14, 110, 136. Additional updates will occur in the future as the NCHS revises the NHANES data collection frequently.
2011-05-24 Codebooks were updated for Part 13, Part 14, and Part 133
2011-05-10 Part 13 Balance (Subsample) and part 14 Examination: Bioelectrical Impedance Analysis have been updated to correct an error in the data files.
2011-05-03 The majority of data files in the collection were merged with the demographics file.
2010-01-25 ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection:
- Performed consistency checks.
- Created variable labels and/or value labels.
- Standardized missing values.
- Checked for undocumented or out-of-range codes.
Weight View help for Weight
Sample weights are available for analyzing NHANES 2001-2002 data. Most data analyses require either the interviewed sample weight (variable name: WTINT2YR) or examined sample weight (variable name: WTMEC2YR). The two-year sample weights (WTINT2YR, WTMEC2YR) should be used for NHANES 2001-2002 analyses and the four-year sample weights (WTINT4YR, WTMEC4YR) for combined analyses of NHANES 1999-2000 and NHANES 2001-2002 data. Use of the correct sample weight for NHANES analyses is extremely important and depends on the variables being used. A good rule of thumb is to use "the least common denominator" approach. With this approach, the analyst checks the variables of interest. The variable that was collected on the smallest number of persons is the least common denominator, and the sample weight that applies to that variable is the appropriate one to use for that particular analysis. Please refer to the NHANES 2001-2002 analytic guidelines provided with the data release files to determine the appropriate analytic methodology.
NCHS June 2004 Version--NHANES Analytic Guidelines
Beginning in 1999, the National Health and Nutrition Examination Survey (NHANES) became a continuous, annual survey rather than the periodic survey that it had been in the past. For a variety of reasons, including disclosure and reliability issues, the survey data are released on public use data files every two years. Thus, the data release cycle for the ongoing (and continuous) NHANES is described as NHANES 1999-2000, NHANES 2001-2002, NHANES 2003-2004, etc. In addition to the analysis of data from any two-year cycle, it is possible to combine two or more "cycles" (e.g., 1999-2000 and 2001-2002) to create NHANES 1999-2002, thus increasing sample size and analytic options. In order to produce estimates with greater statistical reliability, combining two or more two-year cycles of the continuous NHANES is encouraged and strongly recommended. To facilitate analysis of these first two cycles of the continuous NHANES, the appropriate four-year sample weights have been calculated and added to the demographic data files for both 1999-2000 and 2001-2002. Thus, users of the earlier release of the NHANES 1999-2000 demographic file must use the updated demographic file to appropriately analyze the combined four-year data. These four-year sample weights have the same variable name in each two-year demographic file. For example, for the sample persons for whom there are MEC data items, the variable name for the four year weight is WTMEC4YR. The procedure for variance estimation (sampling errors) was changed beginning with NHANES 1999-2000 to one that protects confidentiality and allows the use of "PSUs" and is the recommended approach for analysis on the ongoing and continuous NHANES. This method creates Masked Variance Units (MVUs) which can be used as if they were Pseudo PSUs to estimate sampling errors (similar to past NHANES). The Pseudo-PSUs on the data file are not the "true" design PSUs. They are a collection of secondary sampling units aggregated into groups called Masked Variance Units (MVUs) for the purpose of variance estimation. They produce variance estimates that closely approximate the variances that would have been estimated using the "true" design variance estimates. These MVUs have been created for both NHANES 1999-2000 and NHANES 2001-2002 and added to the demographic data files for both two-year periods. They can also be used for the combined four-year dataset. The stratum variable is SDMVSTRA and the PSU variable is SDMVPSU. Software such as SUDAAN, STATA, and SAS can be used to estimate sampling errors by the Taylor series (linearization) method. Typically, the dataset should first be sorted by SDMVSTRA and SDMVPSU. There are no replicate weights provided for NHANES 2001-2002. Replication techniques can still be used to estimate sampling errors if the software, such as WESVAR, computes its own set of replicate weights based on the nested PSU within stratum design. Some data components of NHANES are only available for 1999-2000 or 2001-2002. For these components, the two-year sample weights and the two-year MVUs can be used for analysis. For 2001-2002, the two-year weights and MVUs are provided on the demographic data file. For 1999-2000, the previously released demographic file has been updated to add the MVUs and four-year sample weights. At this time, the preferred approach for calculating sampling errors is to use the MVUs and to ignore the JK-1 technique utilized as an interim approach with the release of the NHANES 1999-2000 data. On occasion, there may be a particular issue that requires comparison of results from NHANES 1999-2000 with NHANES 2001-2002. For summary statistics such as means and proportions, the appropriate two-year sample weights and MVUs can be used for reasonably valid inferences (although caution should be used when producing estimates for any detailed population subgroup). Calculation of totals (e.g., estimates of the number of persons with a particular characteristic such as BMI greater than 30 or with impaired glucose tolerance) is not appropriate for NHANES 1999-2000 unless the numbers are ratio-adjusted to population counts based on year 2000 Census figures. This is not an issue for NHANES 2001-2002 which was linked to the 2000 Census counts. Users are encouraged to check the NHANES Web site on a regular basis to be aware of the latest version of these NHANES analytic guidelines.
The public-use data files in this collection are available for access by the general public. Access does not require affiliation with an ICPSR member institution.
- The citation of this study may have changed due to the new version control system that has been implemented. Please see version history for more details.
This study is maintained and distributed by the National Archive of Computerized Data on Aging (NACDA), the aging program within ICPSR. NACDA is sponsored by the National Institute on Aging (NIA) at the National Institutes of Heath (NIH).