National Archive of Criminal Justice Data

This dataset is maintained and distributed by the National Archive of Criminal Justice Data (NACJD), the criminal justice archive within ICPSR. NACJD is primarily sponsored by three agencies within the U.S. Department of Justice: the Bureau of Justice Statistics, the National Institute of Justice, and the Office of Juvenile Justice and Delinquency Prevention.

Recidivism in the National Longitudinal Survey of Youth 1997 - Standalone Data (Rounds 1 to 13) (ICPSR 34562)

Principal Investigator(s): United States Department of Justice. Office of Justice Programs. Bureau of Justice Statistics


The NLSY97 standalone data files are intended to be used by crime researchers for analyses without requiring supplementation from the main NLSY97 data set. The data contain age-based calendar year variables on arrests and incarcerations, self-reported criminal activity, substance use, demographic variables and relevant variables from other domains which are created using the NLSY97 data. The main NLSY97 data are available for public use and can be accessed online at the NLS Investigator Web site and at the NACJD Web site (as ICPSR 3959). Questionnaires, user guides and other documentation are available at the same links. The National Longitudinal Survey of Youth 1997 (NLSY97) was designed by the United States Department of Labor, comprising the National Longitudinal Survey (NLS) Series. Created to be representative of United States residents in 1997 who were born between the years of 1980 and 1984, the NLSY97 documents the transition from school to work experienced by today's youths through data collection from 1997. The majority of the oldest cohort members (age 16 as of December 31, 1996) were still in school during the first survey round and the youngest respondents (age 12) had not yet entered the labor market.

Access Notes

  • The public-use data files in this collection are available for access by the general public. Access does not require affiliation with an ICPSR member institution.


DS0:  Study-Level Files
DS1:  Self-reported Crimes Round 1 to 7 - Download All Files (44.428 MB)
SAS    SPSS    Stata    R    ASCII    Excel/TSV
ASCII + SAS Setup    SPSS Setup    Stata Setup   
DS2:  Self-reported Crimes Round 8 to 13 - Download All Files (17.411 MB)
SAS    SPSS    Stata    R    ASCII    Excel/TSV
ASCII + SAS Setup    SPSS Setup    Stata Setup   

Study Description


United State Department of Justice. Office of Justice Programs. Bureau of Justice Statistics. Recidivism in the National Longitudinal Survey of Youth 1997 - Standalone Data (Rounds 1 to 13). ICPSR34562-v1. Ann Arbor, MI: Inter-university Consortium for Political and Social Research [distributor], 2014-02-06.

Persistent URL:

Export Citation:

  • RIS (generic format for RefWorks, EndNote, etc.)
  • EndNote XML (EndNote X4.0.1 or higher)

Scope of Study

Subject Terms:    alcohol, arrests, crime, demographic characteristics, educational background, health, income, substance abuse, tobacco use, young adults, youths

Geographic Coverage:    United States

Time Period:   

  • 1997--2009

Date of Collection:   

  • 1997--2009

Unit of Observation:    individual

Universe:    All household residents aged 12 to 16 as of December 31, 1996, in the United States.

Data Type(s):    survey data

Data Collection Notes:

Additional information pertaining to the NLSY97 can be found at the Bureau of Labor Statistics Web site for NLS497.


Sample:    The NLSY97 cohort comprises two independent probability samples: a cross-sectional sample and an oversample of black and/or Hispanic or Latino respondents. The cohort was selected using these two samples to meet the survey design requirement of providing sufficient numbers of black and Hispanic or Latino respondents for statistical analysis. The NLSY97 cohort was selected in two phases. In the first phase, a list of housing units for the cross-sectional sample and the oversample was derived from two independently selected, stratified multistage area probability samples. This ensured an accurate representation of different sections of the population defined by race, income, region, and other factors. In the second phase, subsamples of the eligible persons identified in the first phase were selected for interview.

Time Method:    Longitudinal: Cohort/ Event-based

Mode of Data Collection:    face-to-face interview, self-enumerated questionnaire

Extent of Processing:   ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection:

  • Created variable labels and/or value labels.
  • Standardized missing values.
  • Checked for undocumented or out-of-range codes.


Original ICPSR Release:   2014-02-06



Metadata Exports

If you're looking for collection-level metadata rather than an individual metadata record, please visit our Metadata Records page.

Download Statistics