The Future of Families and Child Wellbeing Study (FFCWS), Public Use, United States, 1998-2024 (ICPSR 31622)
The Future of Families and Child Wellbeing Study (FFCWS, formerly known as the Fragile Families and Child Wellbeing Study) follows a cohort of nearly 5,000 children born in large, U.S. cities between 1998 and 2000. The study oversampled births to unmarried couples; and, when weighted, the data are representative of births in large U.S. cities at the turn of the century. The FFCWS was originally designed to address four questions of great interest to researchers and policy makers:
- What are the conditions and capabilities of unmarried parents, especially fathers?
- What is the nature of the relationships between unmarried parents?
- How do children born into these families fare?
- How do policies and environmental conditions affect families and children?
The FFCWS consists of interviews with mothers, fathers, and/or primary caregivers at birth and again when children are ages 1, 3, 5, 9, 15, and 22. The parent interviews collected information on attitudes, relationships, parenting behavior, demographic characteristics, health (mental and physical), economic and employment status, neighborhood characteristics, and program participation. Beginning at age 9, children were interviewed directly (either during the home visit or on the telephone). The direct child interviews collected data on family relationships, home routines, schools, peers, and physical and mental health, as well as health behaviors.
A collaborative study of the FFCWS, the In-Home Longitudinal Study of Pre-School Aged Children (In-Home Study) collected data from a subset of the FFCWS Core respondents at the Year 3 and 5 follow-ups to ask how parental resources in the form of parental presence or absence, time, and money influence children under the age of 5. The In-Home Study collected information on a variety of domains of the child's environment, including: the physical environment (quality of housing, nutrition and food security, health care, adequacy of clothing and supervision) and parenting (parental discipline, parental attachment, and cognitive stimulation). In addition, the In-Home Study also collected information on several important child outcomes, including anthropometrics, child behaviors, and cognitive ability. This information was collected through interviews with the child's primary caregiver, and direct observation of the child's home environment and the child's interactions with his or her caregiver.
Similar activities were conducted during the Year 9 follow-up. At the Year 15 follow-up, a condensed set of home visit activities were conducted with a subsample of approximately 1,000 teens. Teens who participated in the In-Home Study were also invited to participate in a Sleep Study and were asked to wear an accelerometer on their non-dominant wrist for seven consecutive days to track their sleep (Sleep Actigraphy Data) and that day's behaviors and mood (Daily Sleep Actigraphy and Diary Survey Data).
An additional collaborative study collected data from the child care provider (Year 3) and teacher (Years 9 and 15) through mail-based surveys. Saliva samples were collected at Year 9 and 15 (Biomarker file and Polygenic Scores). The Study of Adolescent Neural Development (SAND) COVID Study began data collection in May 2020 following the onset of the COVID-19 pandemic. It included online surveys with the young adult and their primary caregiver.
The FFCWS began its seventh wave of data collection in October 2020, around the focal child's 22nd birthday. Data collection and interviews continued through January 2024. The Year 22 wave included a young adult (YA) survey with the original focal child and a primary caregiver (PCG) survey. Data were also collected on the children of the original focal child (referred to as Generation 3, or G3).
In 2017, the FFCWS team announced the Fragile Families (FF) Challenge, a collaborative effort in which participants were tasked with using machine learning methods and FFCWS data (Baseline to Year 9) to build a model that would predict six key outcomes at Year 15. Materials used in the FF Challenge have been archived in this collection.
Documentation for these files is available on the FFCWS website under Data and Documentation. For details of updates made to the FFCWS data files, please see the project's Data Alerts page.
Data collection for the Future of Families and Child Wellbeing Study was supported by the Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD) of the National Institutes of Health under award numbers R01HD36916, R01HD39135, and R01HD40421, as well as a consortium of private foundations.
Below is the citation for use of the FFCWS data accessed through ICPSR. For information on additional citation requirements when using FFCWS in publications, please refer to this FAQ on the FFCWS project site.
National Survey of Adolescents, 2004: Uganda (ICPSR 22411)
Tsogolo La Thanzi (TLT): Baseline Wave, Malawi, 2009-2012 [Healthy Futures] (ICPSR 36863)
The Tsogolo La Thanzi (TLT): Baseline Wave collection contains data collected as part of the Tsogolo la Thanzi (TLT) Study. TLT is a longitudinal study in Balaka, Malawi designed to examine how young people navigate reproduction in an AIDS epidemic. Tsogolo la Thanzi means "Healthy Futures" in Chichewa, Malawi's most widely spoken language. New data is being collected to develop better understandings of the reproductive goals and behavior of young adults in Malawi -- the first cohort to never have experienced life without AIDS. To understand these patterns of family formation in a rapidly changing setting, TLT used the following approach: an intensive longitudinal design where respondents are interviewed every fourth months at TLT's centralized research center. Data collection began in May of 2009 and was completed in June of 2012. To assess changes on a longer time-horizon, a follow-up survey we refer to as Tsogolo la Thanzi 2 (TLT-2) was fielded between July and October of 2015.
The Women dataset (dataset 1) contains variables that pertain to pregnancy, family composition, partners and relationships, mental health, marriage, sex and protection, sexually transmitted diseases, goods purchases, and diet.
The Male Partners dataset (dataset 2) contains variables that pertain to relationships, religion, politics, family composition, mental health, sex and protection, pregnancy, marriage, sexually transmitted diseases, goods purchases, and diet.
The Random Men dataset (dataset 3) asked respondents about their mental health, partners and relationships, sexually transmitted diseases, sex and protection, family composition, goods purchases, and diet.
The Male Partners at Alternative Waves dataset (dataset 4) includes baseline data collected for male partners who began participating in the study between Wave 2 and Wave 8. If male partners entered the study at Wave 2 or later, their first interview was the baseline questionnaire (Wave 1), and at the next round of data collection they received the current wave's questionnaire. This dataset includes variables that pertain to relationships, religion, mental and physical health, family composition, sex and protection, fatherhood, marriage, sexually transmitted diseases, good purchases and diet.
Demographic variables in each dataset include age, tribe, language, and education.
Tsogolo La Thanzi (TLT): Biomarker Data, Malawi, 2009-2012, 2015 [Healthy Futures] (ICPSR 37200)
The Tsogolo La Thanzi (TLT): Biomarker collection contains data collected as part of the Tsogolo la Thanzi (TLT) Study. TLT is a longitudinal study in Balaka, Malawi designed to examine how young people navigate reproduction in an AIDS epidemic. Tsogolo la Thanzi means "Healthy Futures" in Chichewa, Malawi's most widely spoken language. New data is being collected to develop better understandings of the reproductive goals and behavior of young adults in Malawi -- the first cohort to never have experienced life without AIDS. To understand these patterns of family formation in a rapidly changing setting, TLT used the following approach: an intensive longitudinal design where respondents are interviewed every fourth months at TLT's centralized research center. Data collection began in May of 2009 and was completed in June of 2012. To assess changes on a longer time-horizon, a follow-up survey referred to as Tsologo la Thanzi 2 (TLT-2) was fielded between June and August of 2016.
The biomarker data collection contains the results of HIV testing and pregnancy testing. These data sets include respondents from all waves.
Tsogolo La Thanzi (TLT): Eighth Wave, Malawi, 2011 [Healthy Futures] (ICPSR 38005)
Tsogolo la Thanzi (TLT) is a longitudinal study in Balaka, Malawi designed to examine how young people navigate reproduction in an AIDS epidemic. Tsogolo la Thanzi means "Healthy Futures" in Chichewa, Malawi's most widely spoken language. New data is being collected to develop better understandings of the reproductive goals and behavior of young adults in Malawi -- the first cohort to never have experienced life without AIDS. To understand these patterns of family formation in a rapidly changing setting, TLT used the following approach: an intensive longitudinal design where respondents are interviewed every four months at TLT's centralized research center. Data collection began in May of 2009 and was completed in June of 2012. To assess changes on a longer time-horizon, a follow-up survey referred to as Tsogolo la Thanzi 2 (TLT-2) was fielded between June and August of 2016.
This study contains data collected from the eighth wave of the multi-wave study.
Each of waves 1-8 is comprised of three data files. The Women dataset (dataset 1) is a random sample of women aged 15-25 in 2009 (N=1,505 at wave 1) drawn from a census of the area. Likewise, the Random Men dataset (dataset 3) is a random sample of men aged 15-25 in 2009 (N=574 at wave 1) drawn from a census of the area. The Male Partners dataset (dataset 2) contains survey data from sexual and romantic partners who were referred into the study by respondents in the women's file; this is a non-random sample of male partners, so analysts should be especially cautious with inferences.
Topics covered across all waves include relationships, religion, HIV/AIDS, politics, family composition, mental health, sex and protection, pregnancy, marriage, sexually transmitted diseases, future expectations, school enrollment status, goods purchased/received, and diet.
Modules specific to wave 8 include: health services, travel, treatment optimism, and parent information.
Additional demographic variables in each dataset include age and education.
Tsogolo La Thanzi (TLT): Fifth Wave, Malawi, 2010 [Healthy Futures] (ICPSR 37832)
Tsogolo la Thanzi (TLT) is a longitudinal study in Balaka, Malawi designed to examine how young people navigate reproduction in an AIDS epidemic. Tsogolo la Thanzi means "Healthy Futures" in Chichewa, Malawi's most widely spoken language. New data is being collected to develop better understandings of the reproductive goals and behavior of young adults in Malawi -- the first cohort to never have experienced life without AIDS. To understand these patterns of family formation in a rapidly changing setting, TLT used the following approach: an intensive longitudinal design where respondents are interviewed every four months at TLT's centralized research center. Data collection began in May of 2009 and was completed in June of 2012. To assess changes on a longer time-horizon, a follow-up survey referred to as Tsogolo la Thanzi 2 (TLT-2) was fielded between June and August of 2016.
This study contains data collected from the fifth wave of the multi-wave study.
Each wave is comprised of three data files. The Women dataset (dataset 1) is a random sample of women aged 15-25 in 2009 (N=1,505 at wave 1) drawn from a census of the area. Likewise, the Random Men dataset (dataset 3) is a random sample of men aged 15-25 in 2009 (N=574 at wave 1) drawn from a census of the area. The Male Partners dataset (dataset 2) contains survey data from sexual and romantic partners who were referred into the study by respondents in the women's file; this is a non-random sample of male partners, so analysts should be especially cautious with inferences.
Topics covered across all waves include relationships, religion, HIV/AIDS, politics, family composition, mental health, sex and protection, pregnancy, marriage, sexually transmitted diseases, future expectations, school enrollment status, goods purchased/received, and diet.
Modules specific to wave 5 include: best friend characteristics, health services, relationship power, relationship scripts, treatment optimism and travel.
Additional demographic variables in each dataset include age and education.
Tsogolo La Thanzi (TLT): Fourth Wave, Malawi, 2010 [Healthy Futures] (ICPSR 37460)
Tsogolo la Thanzi (TLT) is a longitudinal study in Balaka, Malawi designed to examine how young people navigate reproduction in an AIDS epidemic. Tsogolo la Thanzi means "Healthy Futures" in Chichewa, Malawi's most widely spoken language. New data is being collected to develop better understandings of the reproductive goals and behavior of young adults in Malawi -- the first cohort to never have experienced life without AIDS. To understand these patterns of family formation in a rapidly changing setting, TLT used the following approach: an intensive longitudinal design where respondents are interviewed every four months at TLT's centralized research center. Data collection began in May of 2009 and was completed in June of 2012. To assess changes on a longer time-horizon, a follow-up survey referred to as Tsogolo la Thanzi 2 (TLT-2) was fielded between June and August of 2016.
This study contains data collected from the fourth wave of the multi-wave study.
Each wave is comprised of three data files. The Women dataset (dataset 1) is a random sample of women aged 15-25 in 2009 (N=1,505 at wave 1), drawn from a census of the area. Likewise, the Random Men dataset (dataset 3) is a random-sample of men aged 15-25 in 2009 (N=574 at wave 1) drawn from a census of the area. The Male Partners dataset (dataset 2) contains survey data from sexual and romantic partners who were referred into the study by respondents in the women's file; this is a non-random sample of male partners, so analysts should be especially cautious with inferences.
Topics covered across all waves include relationships, religion, HIV/AIDS, politics, family composition, mental health, sex and protection, pregnancy, marriage, sexually transmitted diseases, future expectations, school enrollment status, goods purchased/received, and diet.
Additional demographic variables in each dataset include age and education.
Tsogolo La Thanzi (TLT): Ninth Wave, Malawi, 2012 [Healthy Futures] (ICPSR 38029)
Tsogolo la Thanzi (TLT) is a longitudinal study in Balaka, Malawi designed by Jenny Trinitapoli and Sara Yeatman to examine how young people navigate reproduction in an AIDS epidemic. Tsogolo la Thanzi means "Healthy Futures" in Chichewa, Malawi's most widely spoken language. The TLT research team has collected data to better understand the reproductive goals and behavior of young adults in Malawi. This is the first cohort to never have experienced life without AIDS. To understand these patterns of family formation in a rapidly changing setting, TLT used the following approach: an intensive longitudinal design where respondents are interviewed every fourth month at TLT's centralized research center. Data collection began in May of 2009 and was completed in December 2011 (waves 1-8).
In addition, a Refresher Sample (wave 9) was fielded in early 2012 as a form of addressing study attrition but also to create the ability to compare the "treatment" effect of survey participation on respondents who participated in waves 1-8.
The Refresher Sample includes 315 women who were sampled but not enrolled at wave 1 (baseline), and thus only entered the study in 2012. Furthermore, to assess changes on a longer time-horizon, a follow-up survey referred to as TLT-2 was fielded between June and August of 2016 which includes all baseline and comparison sample women, plus all men ever-interviewed for the study.
Each of waves 1-8 are comprised of three data files: women, random men, and male partners. However, wave 9 includes only a sample of women who did not enroll in baseline (N=315).
Topics covered across all waves include relationships, religion, HIV/AIDS, politics, family composition, mental health, sex and protection, pregnancy, marriage, sexually transmitted diseases, future expectations, school enrollment status, goods purchased/received, and diet. Of the occasional modules, those included at wave 9 [Refresher Sample] are: background, residency and migration, travel and parent information. Otherwise, the comparison sample is more similar to the baseline wave, relative to other rounds of data collection.
Tsogolo La Thanzi (TLT): Second Wave, Malawi, 2009 [Healthy Futures] (ICPSR 37146)
Tsogolo la Thanzi (TLT) is a longitudinal study in Balaka, Malawi designed to examine how young people navigate reproduction in an AIDS epidemic. Tsogolo la Thanzi means "Healthy Futures" in Chichewa, Malawi's most widely spoken language. This data was collected to develop better understandings of the reproductive goals and behavior of young adults in Malawi -- the first cohort to never have experienced life without AIDS. To understand these patterns of family formation in a rapidly changing setting, TLT used the following approach: an intensive longitudinal design where respondents are interviewed every four months at TLT's centralized research center. Data collection began in May of 2009 and was completed in June of 2012. To assess changes on a longer time horizon, a follow-up survey we refer to as Tsogolo la Thanzi 2 (TLT-2) was fielded between June and August of 2016.
This study contains data collected from the second wave of the multi-wave study.
Each wave is comprised of three data files. The Women dataset (dataset 1) is a random sample of women aged 15-25 in 2009 (N=1,505 at wave 1), drawn from a census of the area. Likewise, the Random Men dataset (dataset 3) is a random-sample of men aged 15-25 in 2009 (N=574 at wave 1) drawn from a census of the area. The Male Partners dataset (dataset 2) contains survey data from sexual and romantic partners who were referred into the study by respondents in the women's file; this is a non-random sample of male partners, so analysts should be especially cautious with inferences.
Topics covered across all waves include relationships, religion, HIV/AIDS, politics, family composition, mental health, sex and protection, pregnancy, marriage, sexually transmitted diseases, future expectations, school enrollment status, goods purchased/received, and diet.
Modules specific to wave 2 include: two-year future expectations. Additionally, the child roster, household roster, and travel for interview sections begin at wave 2.
Additional demographic variables in each dataset include age and education.
Tsogolo La Thanzi (TLT): Sixth Wave, Malawi, 2011 [Healthy Futures] (ICPSR 37828)
Tsogolo la Thanzi (TLT) is a longitudinal study in Balaka, Malawi designed to examine how young people navigate reproduction in an AIDS epidemic. Tsogolo la Thanzi means "Healthy Futures" in Chichewa, Malawi's most widely spoken language. New data is being collected to develop better understandings of the reproductive goals and behavior of young adults in Malawi -- the first cohort to never have experienced life without AIDS. To understand these patterns of family formation in a rapidly changing setting, TLT used the following approach: an intensive longitudinal design where respondents are interviewed every four months at TLT's centralized research center. Data collection began in May of 2009 and was completed in June of 2012. To assess changes on a longer time-horizon, a follow-up survey referred to as Tsogolo la Thanzi 2 (TLT-2) was fielded between June and August of 2016.
This study contains data collected from the sixth wave of the multi-wave study.
Each wave is comprised of three data files. The Women dataset (dataset 1) is a random sample of women aged 15-25 in 2009 (N=1,505 at wave 1) drawn from a census of the area. Likewise, the Random Men dataset (dataset 3) is a random sample of men aged 15-25 in 2009 (N=574 at wave 1) drawn from a census of the area. The Male Partners dataset (dataset 2) contains survey data from sexual and romantic partners who were referred into the study by respondents in the women's file; this is a non-random sample of male partners, so analysts should be especially cautious with inferences.
Topics covered across all waves include relationships, religion, HIV/AIDS, politics, family composition, mental health, sex and protection, pregnancy, marriage, sexually transmitted diseases, future expectations, school enrollment status, goods purchased/received, and diet.
Modules specific to wave 6 include: best friend characteristics, treatment optimism, travel, and health services.
Additional demographic variables in each dataset include age and education.
Tsogolo La Thanzi (TLT): Third Wave, Malawi, 2010 [Healthy Futures] (ICPSR 37204)
Tsogolo la Thanzi (TLT) is a longitudinal study in Balaka, Malawi designed to examine how young people navigate reproduction in an AIDS epidemic. Tsogolo la Thanzi means "Healthy Futures" in Chichewa, Malawi's most widely spoken language. New data is being collected to develop better understandings of the reproductive goals and behavior of young adults in Malawi -- the first cohort to never have experienced life without AIDS. To understand these patterns of family formation in a rapidly changing setting, TLT used the following approach: an intensive longitudinal design where respondents are interviewed every four months at TLT's centralized research center. Data collection began in May of 2009 and was completed in June of 2012. To assess changes on a longer time-horizon, a follow-up survey referred to as Tsogolo la Thanzi 2 (TLT-2) was fielded between June and August of 2016.
This study contains data collected from the third wave of the multi-wave study.
Each wave is comprised of three data files. The Women dataset (dataset 1) is a random sample of women aged 15-25 in 2009 (N=1,505 at wave 1), drawn from a census of the area. Likewise, the Random Men dataset (dataset 3) is a random-sample of men aged 15-25 in 2009 (N=574 at wave 1) drawn from a census of the area. The Male Partners dataset (dataset 2) contains survey data from sexual and romantic partners who were referred into the study by respondents in the women's file; this is a non-random sample of male partners, so analysts should be especially cautious with inferences.
Topics covered across all waves include relationships, religion, HIV/AIDS, politics, family composition, mental health, sex and protection, pregnancy, marriage, sexually transmitted diseases, future expectations, school enrollment status, goods purchased/received, and diet.
Modules specific to wave 3 include: relationship power.
Additional demographic variables in each dataset include age and education.