Collaborative Psychiatric Epidemiology Surveys
The Collaborative Psychiatric Epidemiology Surveys (CPES) were created with support from the National Institute of Mental Health (NIMH). CPES joins together three nationally representative surveys: the National Comorbidity Survey Replication (NCS-R), the National Survey of American Life (NSAL), and the National Latino and Asian American Study (NLAAS).
More information on the CPES survey can be found on the CPES website.
Collaborative Psychiatric Epidemiology Surveys (CPES), 2001-2003 [United States] (ICPSR 20240)
Principal Investigator(s): Alegria, Margarita, Center for Multicultural Mental Health Research at Cambridge Health Alliance; Jackson, James S., University of Michigan. Institute for Social Research; Kessler, Ronald C., Harvard Medical School. Department of Health Care Policy; Takeuchi, David, University of Washington
One or more files in this study are not available for download due to special restrictions ; consult the restrictions note to learn more. You can apply online for access to the data. A login is required to apply for access. (How to apply.)
The restricted versions of this data collection may not be used for any purpose other than statistical reporting and analysis. Use of the restricted versions to learn the identity of any person or establishment is prohibited. To protect respondent privacy, the names and IDs are restricted from general dissemination. To obtain this file, researchers must agree to the terms and conditions of a Restricted Data Use Agreement in accordance with existing ICPSR servicing policies.
WARNING: This study is over 150MB in size and may take several minutes to download on a typical internet connection.
Alegria, Margarita, James S. Jackson, Ronald C. Kessler, and David Takeuchi. Collaborative Psychiatric Epidemiology Surveys (CPES), 2001-2003 [United States]. ICPSR20240-v6. Ann Arbor, MI: Inter-university Consortium for Political and Social Research [distributor], 2008-06-19. http://doi.org/10.3886/ICPSR20240.v6
Persistent URL: http://doi.org/10.3886/ICPSR20240.v6
Scope of Study
Subject Terms: African Americans, anxiety, Asian Americans, crosscultural differences, cultural influences, depression (psychology), health services utilization, Hispanic or Latino Americans, mental disorders, mental health, mental health services, minorities
Geographic Coverage: United States
Unit of Observation: individual
Universe: The CPES universe was defined by the union of the following survey populations for the three component surveys: The NCS-R universe included English-speaking adults aged 18 years and older residing in households located in the coterminous United States. The NSAL universe included adults in the three target groups: Black Americans of African descent, Black Americans of Caribbean descent, and White Americans, who were aged 18 years and older residing in households located in the coterminous United States. The NLAAS universe included Latino American, Asian American, and non-Latino, non-Asian White American adults aged 18 and older residing in households located in the coterminous United States and the state of Hawaii.
Data Types: survey data
Data Collection Notes:
The master CPES datasets are updated periodically as various, typically small, errors are detected. Diagnostic algorithms are periodically updated as well. This updating will continue in the future as needed and public users will be informed of these updates. It is important for public users to recognize that, because of these changes, it will not be possible to reproduce results reported in earlier publications.
The NATIONAL COMORBIDITY SURVEY REPLICATION (NCS-R) is supported by the National Institute of Mental Health (NIMH, U01-MH60220) with supplemental support from the National Institute of Drug Abuse (NIDA, R01-DA12058-05), the Substance Abuse and Mental Health Services Administration (SAMHSA), the Robert Wood Johnson Foundation (RWJF, grant 044780), and the John W. Alden Trust. Collaborating investigators include Ronald C. Kessler (Principal Investigator, Harvard Medical School), Kathleen Merikangas (Co-Principal Investigator, NIMH), James Anthony (Michigan State University), William Eaton (The Johns Hopkins University), Meyer Glantz (NIDA), Doreen Koretz (Harvard University), Jane McLeod (Indiana University), Mark Olfson (Columbia University College of Physicians and Surgeons), Harold Pincus (University of Pittsburgh), Greg Simon (Group Health Cooperative), Michael Von Korff (Group Health Cooperative), Philip Wang (Harvard Medical School), Kenneth Wells (UCLA), Elaine Wethington (Cornell University), and Hans-Ulrich Wittchen (Institute of Clinical Psychology, Technical University Dresden and Max Planck Institute of Psychiatry).
The NATIONAL SURVEY OF AMERICAN LIFE (NSAL) is supported by the National Institute of Mental Health (NIMH U01-MH57716), with supplemental support from the Office of Behavioral and Social Sciences Research (OBSSR) at the National Institute of Health (NIH), and the University of Michigan. Collaborating investigators include James S. Jackson (Principal Investigator, University of Michigan, Survey Research Center), Harold W. Neighbors (Co-Principal Investigator, University of Michigan, Research Center for Group Dynamics), David R. Williams (Co-Principal Investigator, University of Michigan, Survey Research Center), Robert J. Taylor (Co-Principal Investigator, University of Michigan, Research Center for Group Dynamics), Cleopatra H. Caldwell (Co-Investigator, University of Michigan, Research Center for Group Dynamics), Steven J. Trierweiler (Co-Investigator, University of Michigan, Research Center for Group Dynamics), and Randolph M. Nesse (Co-Investigator, University of Michigan, Research Center for Group Dynamics.
The NATIONAL LATINO AND ASIAN AMERICAN STUDY (NLAAS) is supported by the National Institute of Mental Health (NIMH, U01-MH062209, and U01-MH62207), with supplemental support from the Office of Behavioral and Social Sciences Research (OBSSR) at the National Institute of Health (NIH), the Substance Abuse and Mental Health Services Agency (SAMHSA), and the Latino Research Program Project (P01-MH059876). Collaborating investigators include Margarita Alegria (Principal Investigator, Center for Multicultural Mental Health Research at Cambridge Health Alliance), David Takeuchi (Principal Investigator, University of Washington), Sergio Aguilar-Gaxiola (Co-Investigator, California State University-Fresno), Glorisa Canino (Co-Investigator, University of Puerto Rico), Naihua Duan (Co-Investigator, University of California-Los Angeles), Javier Escobar (Co-Investigator, Robert Wood Johnson Medical School), Peter Guarnaccia (Co-Investigator, Rutgers University), Alexander Ortega (Co-Investigator, University of California-Los Angeles), Patrick Shrout (Co-Investigator, New York University), William Vega (Co-Investigator, Robert Wood Johnson Medical School), and Mildred Vera (Co-Investigator, University of Puerto Rico).
Other collaborators from the National Institute of Mental Health include Ellen Stover (Acting Director, Division of Adult Translational Research and Treatment Development), Wayne Fenton (Scientific Collaborator and previous Director, Division of Adult Translational Research and Treatment Development), Lisa Colpe (Project Officer, NIMH), and Karen Bourdon (Project Officer, NIMH).
Sample: The sample for all three surveys consisted of primary sampling units selected with probabilities proportional to size. Data collection for the three surveys was conducted in a total of 252 geographic areas or primary sampling units across the United States. Only 50 of these areas were shared by all three surveys, those representing the most densely populated areas of the country. These latter areas were selected into all three surveys with certainty. In addition, there were 52 areas unique to NSAL and 18 areas unique to NLAAS. These unique areas were introduced to reflect the particular racial and ethnic focus of those surveys. A more detailed discussion of the CPES sample design and that for the individual surveys can be found in the User Guide.
Weight: See the User Guide for the technical report outlining the method used for integrating the design-based analysis weights and variance estimation codes for the three individual surveys to permit analysts to approach analysis of the combined dataset as though it were a single, nationally-representative survey.
Mode of Data Collection: computer-assisted personal interview (CAPI), computer-assisted telephone interview (CATI), telephone interview
Response Rates: For NCS-R, a sample of 13,054 addresses was fielded. In 98.1 percent (11,222) of occupied housing units (11,443), interviewers were able to determine whether or not the household was eligible for inclusion in the study. A total of 10,622 addresses yielded an eligible household and 9,282 adult interviews were completed: 7,693 interviews with the main respondent and 1,589 interviews with a second adult in the household. An additional 554 interviews were collected using a shortened form of the instrument with a subsample of nonrespondents in an effort to assess nonresponse bias. The final weighted response rate for NCS-R, excluding the short form interviews, was 70.9 percent for primary respondents. The response rate for the second respondents was 80.4 percent. For NSAL, 11,634 eligible households were identified from 26,495 randomly sampled addresses. A total of 6,199 adult respondents were interviewed as part of NSAL (1,006 White respondents, 1,623 respondents of Caribbean descent, and 3,570 African American respondents) although the actual number of cases in the CPES data file is 6,082 (3,570 African American, 1,621 Afro-Caribbean, and 891 non-Hispanic White). An extremely small sample (n = 115) of White adults who were interviewed in households where the White subsample was less than 10 percent of the African American density stratum were excluded from the final dataset as well as two of the Afro-Caribbean interviews when it was later discovered that they were duplicate cases. The overall response rate for the core NSAL national sample was 71.5 percent. The Caribbean Supplement sample, which was designed to target areas with high concentrations of persons of Caribbean origin, yielded a weighted response rate of 76.4 percent. For NLAAS, from a total sample of 27,026 addresses, 4,345 eligible main respondents and 1,234 eligible second adult respondents were identified, and 3,620 main respondent and 1,029 second adult interviews were completed. The weighted response rate for NLAAS was 75.7 percent among main respondents (77.6 percent for Latinos, 69.3 percent for Asians). For second respondents, the final response rate was 80.3 percent (82.4 percent for Latinos, 73.7 percent for Asians).
Extent of Processing: ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection:
- Performed consistency checks.
- Standardized missing values.
- Checked for undocumented or out-of-range codes.
Original ICPSR Release: 2007-06-14
- 2014-04-23 2014-04-17 Internal records were updated.
- 2014-04-08 Internal records were updated.
- 2014-03-31 Internal records were updated.
- 2014-03-21 Internal records were updated.
- 2009-01-07 This second version of the restricted CPES files includes additional variables (that were not included in the original restricted files). An Excel file that lists the variables included in each file is also provided with this version.
- 2008-06-19 A supplementary file for NSAL public-use file has been added to the study. This file contains 10 variables along with 3 ID variables for merging with the full NSAL dataset. An Errata file has been also added to the PDF codebook.
- 2008-05-19 Restricted Versions of the National Comorbidity Survey Replication (NCS-R), National Survey of American Life (NSAL), and National Latino and Asian American Study (NLAAS) have been added to the collection.
- 2008-04-07 The data producers have provided updated datasets and CPES Processor Notes. They have resolved discrepancies, updated algorithms, and identified additional variables that can be harmonized. They have created new versions of the CPES files that contain thousands of these types of changes, many at the cell level. Many of these changes represent errors in the original data, therefore previous versions of the CPES file are obsolete and should not be used. The data producers were careful not to change any variable names so that you can use previously-developed code to run against these corrected and updated files to determine if any of these changes impact your analysis. As part of additional harmonization efforts across NCS-R, NLAAS, and NSAL, more variables have been recoded and dropped. The Recoded Variables and Dropped Variables sections of the Processor Notes provide detail on dropped and recoded variables. PS_FLAG (V09435) was added for NCS-R and linked to PS_FLAG for NLAAS and NSAL. CPESCASE (CPES Case ID) was regenerated, and VERSION (Version release number) was added. In the dataset released in July 2007, the Pharmacoepidemiology variables were not completely linked across NCS-R, NLAAS, and NSAL. This has been corrected. These variables include: Med1-reason stop taking: 1st to 6th mention, Med2-reason stop taking: 1st to 4th mention, Med3-reason stop taking: 1st and 2nd mention, Med1-problems took medicine for: 1st to 10th mention, Med2-problems took medicine for: 1st to 9th mention, Med3-problems took medicine for: 1st to 4th mention. Note that for "Med1-reason stop taking: 1st mention" the NCS-R variable (V03031) has not been linked to NLAAS and NSAL (V03011), this was overlooked in the harmonization, but users may link the NCS-R variable to the NLAAS and NSAL variables.
- 2007-12-18 The data producer provided an updated version of the Processor Notes including a new table called: "Constructed Variable Names: Differences between CPES Interactive Documentation and Datasets."
- 2007-08-15 Two sampling error variables, SECLUSTR and SESTRAT, were added to the individual project datasets and the onset/recency diagnostic variables (i.e. any variable that ended in RECI, RECD, ONI, or OND) were bottom-coded (4 or less). In addition, the following variables were added: To NCS-R: D_BIPOLAR130, D_BIPOLAR1130, D_BIPOLARSUB30, D_PTS30, GAD_ONI, GAD_RECI, I_ASA12, I_ASA30, I_ASAH12, I_ASAH30, I_GAD12, I_GAD30, I_GADH12, I_GADH30, I_PTS12, I_PTS30, ICD_ASA, ICD_ASAH, ICD_GAD, ICD_GADH, ICD_PTS, ICD_SAD, ICD_SADH, D_BIPOLARSUB30. To NLAAS: gad_reci, i_gad12, i_gadh12, i_pts12, icd_gad, icd_gadh, icd_pts.
- View publications for the study (~488)
Most Recent Publications
- Citations exports are provided above.
Export Study-level metadata (does not include variable-level metadata)
If you're looking for collection-level metadata rather than an individual metadata record, please visit our Metadata Records page.