National Longitudinal Study of Adolescent to Adult Health (Add Health), 1994-2025 [Public Use] (ICPSR 21600)
Downloads of Add Health require submission of the following information, which is shared with the original producer of Add Health: supervisor name, supervisor email, and reason for download. A Data Guide for this study is available as a web page and for download.
The National Longitudinal Study of Adolescent to Adult Health (Add Health), 1994-2018 [Public Use] is a longitudinal study of a nationally representative sample of U.S. adolescents in grades 7 through 12 during the 1994-1995 school year. The Add Health cohort was followed into young adulthood with four in-home interviews, the most recent conducted in 2008 when the sample was aged 24-32. Add Health combines longitudinal survey data on respondents' social, economic, psychological, and physical well-being with contextual data on the family, neighborhood, community, school, friendships, peer groups, and romantic relationships.
Add Health Wave I data collection took place between September 1994 and December 1995, and included both an in-school questionnaire and in-home interview. The in-school questionnaire was administered to more than 90,000 students in grades 7 through 12, and gathered information on social and demographic characteristics of adolescent respondents, education and occupation of parents, household structure, expectations for the future, self-esteem, health status, risk behaviors, friendships, and school-year extracurricular activities. All students listed on a sample school's roster were eligible for selection into the core in-home interview sample. In-home interviews included topics such as health status, health-facility utilization, nutrition, peer networks, decision-making processes, family composition and dynamics, educational aspirations and expectations, employment experience, romantic and sexual partnerships, substance use, and criminal activities. A parent, preferably the resident mother, of each adolescent respondent interviewed in Wave I was also asked to complete an interviewer-assisted questionnaire covering topics such as inheritable health conditions, marriages and marriage-like relationships, neighborhood characteristics, involvement in volunteer, civic, and school activities, health-affecting behaviors, education and employment, household income and economic assistance, parent-adolescent communication and interaction, parent's familiarity with the adolescent's friends and friends' parents.
Add Health data collection recommenced for Wave II from April to August 1996, and included almost 15,000 follow-up in-home interviews with adolescents from Wave I. Interview questions were generally similar to Wave I, but also included questions about sun exposure and more detailed nutrition questions. Respondents were asked to report their height and weight during the course of the interview, and were also weighed and measured by the interviewer.
From August 2001 to April 2002, Wave III data were collected through in-home interviews with 15,170 Wave I respondents (now 18 to 26 years old), as well as interviews with their partners. Respondents were administered survey questions designed to obtain information about family, relationships, sexual experiences, childbearing, and educational histories, labor force involvement, civic participation, religion and spirituality, mental health, health insurance, illness, delinquency and violence, gambling, substance abuse, and involvement with the criminal justice system. High School Transcript Release Forms were also collected at Wave III, and these data comprise the Education Data component of the Add Health study.
Wave IV in-home interviews were conducted in 2008 and 2009 when the original Wave I respondents were 24 to 32 years old. Longitudinal survey data were collected on the social, economic, psychological, and health circumstances of respondents, as well as longitudinal geographic data. Survey questions were expanded on educational transitions, economic status and financial resources and strains, sleep patterns and sleep quality, eating habits and nutrition, illnesses and medications, physical activities, emotional content and quality of current or most recent romantic/cohabiting/marriage relationships, and maltreatment during childhood by caregivers. Dates and circumstances of key life events occurring in young adulthood were also recorded, including a complete marriage and cohabitation history, full pregnancy and fertility histories from both men and women, an educational history of dates of degrees and school attendance, contact with the criminal justice system, military service, and various employment events, including the date of first and current jobs, with respective information on occupation, industry, wages, hours, and benefits. Finally, physical measurements and biospecimens were also collected at Wave IV, and included anthropometric measures of weight, height and waist circumference, cardiovascular measures such as systolic blood pressure, diastolic blood pressure, and pulse, metabolic measures from dried blood spots assayed for lipids, glucose, and glycosylated hemoglobin (HbA1c), measures of inflammation and immune function, including High sensitivity C-reactive protein (hsCRP) and Epstein-Barr virus (EBV).
Wave V data collection took place from 2016 to 2018, when the original Wave I respondents were 33 to 43 years old. For the first time, a mixed mode survey design was used. In addition, several experiments were embedded in early phases of the data collection to test response to various treatments. A similar range of data was collected on social, environmental, economic, behavioral, and health circumstances of respondents, with the addition of retrospective child health and socio-economic status questions. Physical measurements and biospecimens were again collected at Wave V, and included most of the same measures as at Wave IV.
The overall goal of Wave VI was to better understand life course trajectories, determinants, and consequences of critical dimensions of aging, health, and health disparities among U.S. early midlife adults. Data collection took place from 2022 to 2025, with participants between the ages of 39 and 51, with an average age of 44. Beyond longitudinal survey measures, newly added questions included those on cumulative stress, discrimination, despair, work-life balance, memory, physical limitations, and caregiving. Continuing from previous waves, home exams collected physical measurements and biospecimens with most of the same measures as Wave V.
Schools and Families Educating (SAFE) Children Study [Chicago, IL]: 1997-2008 (ICPSR 34368)
The Schools and Families Education (SAFE) Children Study was a randomized control trial designed to test the efficacy of a family-based comprehensive preventive intervention, with children living in inner-city Chicago and entering the 1st grade, for effects on key risk markers for later drug and other substance use.
A total of 11 waves of data were collected over the course of three phases and approximately 13 years. In the spring of 1997, there were 424 kindergarten students and primary caregivers recruited to participate in this study. Wave 1 began while the children were in 1st grade. These data contain survey responses for students, their primary caregivers, and their teachers across 27 datasets.
Phase I of the study was to assess the intervention provided in the 1st grade. Half of the families were randomly selected to receive the intervention. The other half were assigned to the control group. Phase II of the study was set-up to give half of the intervention group a booster, a second intervention training. Lastly, there was a Phase III which sought to assess the long-term affects of the initial and booster interventions.
The first dataset (DS1) provides an overview of the study which includes variables for the study design and survey administration. This first file contains 38 variables.
Survey responses were obtained from students nine times beginning in 1st grade and ending in 12th grade. Children were not surveyed in waves 3 and 7. The student survey response data are in DS2 through DS10. The datasets for waves 1, 2, 4, and 5 contain only 50 variables. Waves 6, 8, and 9 contain 424 variables. Waves 10 and 11 contain 1,394 variables. Each of the three phases contain almost identical variables within their respective waves.
The children's primary caregivers were also surveyed nine times over the survey period. Primary caregivers were not surveyed in waves 3 and 7. These data are contained in DS11 through DS19. The primary caregiver files vary in the number and content of variables. On average each wave contains about 1,060 variables with a low of 470 on up to a high of 1,435.
Teachers were surveyed during each of the first eight waves of the study. The teacher data are in DS20 through DS27. Waves 1 and 2 contain just over 120 variables. Waves 3, 4, and 5 contain 145 variables. And waves 6, 7, and 8 contain 173 variables. Each of the three phases contain almost identical variables within their respective waves.