PASS Survey Data Linked to Administrative Data of the IAB, 1975-2020 (ICPSR 37160)
Latino National Survey (LNS) Focus Group Data, 2006 (ICPSR 29601)
National Medical Expenditure Survey, 1987: Household Survey, Dental Visit Data [Public Use Tape 14.3] (ICPSR 9814)
Annual Survey of Jails: Individual Reporting-Level Data, 2007 (ICPSR 24641)
Los Angeles Family and Neighborhood Survey (L.A.FANS), Restricted Neighborhood Observations Data, 2000-2001 (ICPSR 37272)
This study is a restricted data file of data from the L.A.FANS Neighborhood Observation Study, an in-person observational by trained L.A.FANS interviewers of the census blocks on which L.A.FANS respondents lived during L.A.FANS Wave 1. Interviewers were trained to walk each block face and record social and physical observations on precoded check sheets. Each block face was observed by several different interviewers working independently at different times of the day and week. These data are designed to be used with L.A.FANS Wave-1 survey interview data restricted versions 2.5 or 3 to provide data on the census block and census tract in which individual respondents lived.
Users who apply for these restricted data must also be approved for using restricted version 2.5 or 3. Please note that L.A. FANS restricted data may only be accessed within the ICPSR Virtual Data Enclave (VDE) and must be merged with the L.A. FANS public data prior to beginning any analysis.
The study is described in detail in the L.A.FANS Neighborhood Observations Codebook.
Further information is available in:
Jones, M., Pebley, A. R., and Sastry, N. (2011). Eyes on the block: Measuring urban physical disorder through in-person observation. Social Science Research, 40(2), 523-537.
Los Angeles Family and Neighborhood Survey (L.A.FANS), Wave 2, Public Data, 2006-2008 (ICPSR 37278)
This study includes public user data files of two waves of interviews with L.A.FANS respondents. There often are multiple respondents in L.A.FANS households and Wave 2 includes both panel respondents and a new sample. Users' Guides which explain the design and how to use the sample are available for Wave 1 and Wave 2 at the RAND website.
The Los Angeles Family and Neighborhood Survey (L.A.FANS) is a two-wave study of adults and children in Los Angeles County and of the neighborhoods in which they live. The first wave (L.A.FANS-1), which was fielded between April 2000 and January 2002, interviewed adults and children living in 3,085 households in a stratified probability sample of 65 neighborhoods throughout Los Angeles County. The samples of neighborhoods and individuals were representative of neighborhoods and residents of Los Angeles County. Poorer neighborhoods and households with children were oversampled. In Wave 2 of L.A.FANS (L.A.FANS-2), Wave 1 respondents living in Los Angeles County were reinterviewed and updated information was collected on Wave 1 respondents who had moved away from Los Angeles County. A sample of individuals who moved into each sampled neighborhood between Waves 1 and 2 was also interviewed, for a total of 2,319 adults and 1,382 children (ages less than 18 years). Additional information on the project is available at the RAND website.
Additional information on the project, survey design, sample, and variables are available from:
- Sastry, Narayan, Bonnie Ghosh-Dastidar, John Adams, and Anne R. Pebley (2006). The Design of a Multilevel Survey of Children, Families, and Communities: The Los Angeles Family and Neighborhood Survey, Social Science Research, Volume 35, Number 4, Pages 1000-1024
- The Users' Guides (Wave 1 and Wave 2)
- RAND Documentation Reports page
Los Angeles Family and Neighborhood Survey (L.A.FANS), Wave 1, Public Data, 2000-2001 (ICPSR 37279)
This study includes public user data files of two waves of interviews with L.A.FANS respondents. There often are multiple respondents in L.A.FANS households and Wave 2 includes both panel respondents and a new sample. Users' Guides which explain the design and how to use the sample are available for Wave 1 and Wave 2 at the RAND website.
The Los Angeles Family and Neighborhood Survey (L.A.FANS) is a two-wave study of adults and children in Los Angeles County and of the neighborhoods in which they live. The first wave (L.A.FANS-1), which was fielded between April 2000 and January 2002, interviewed adults and children living in 3,085 households in a stratified probability sample of 65 neighborhoods throughout Los Angeles County. The samples of neighborhoods and individuals were representative of neighborhoods and residents of Los Angeles County. Poorer neighborhoods and households with children were oversampled. In Wave 2 of L.A.FANS (L.A.FANS-2), Wave 1 respondents living in Los Angeles County were reinterviewed and updated information was collected on Wave 1 respondents who had moved away from Los Angeles County. A sample of individuals who moved into each sampled neighborhood between Waves 1 and 2 was also interviewed, for a total of 2,319 adults and 1,382 children (ages less than 18 years). Additional information on the project is available at the RAND website.
Additional information on the project, survey design, sample, and variables are available from:
- Sastry, Narayan, Bonnie Ghosh-Dastidar, John Adams, and Anne R. Pebley (2006). The Design of a Multilevel Survey of Children, Families, and Communities: The Los Angeles Family and Neighborhood Survey, Social Science Research, Volume 35, Number 4, Pages 1000-1024
- The Users' Guides (Wave 1 and Wave 2)
- RAND Documentation Reports page
Los Angeles Family and Neighborhood Survey (L.A.FANS), Wave 2, Biomarker Data, 2006-2008 (ICPSR 38683)
The Los Angeles Family and Neighborhood Survey (L.A.FANS) is a two-wave study of adults and children in Los Angeles County and of the neighborhoods in which they live. The first wave (L.A.FANS-1), which was fielded between April 2000 and January 2002, interviewed adults and children living in 3,085 households in a stratified probability sample of 65 neighborhoods throughout Los Angeles County. The samples of neighborhoods and individuals were representative of neighborhoods and residents of Los Angeles County. Poorer neighborhoods and households with children were oversampled. In Wave 2 of L.A.FANS (L.A.FANS-2), Wave 1 respondents living in Los Angeles County were reinterviewed and updated information was collected on Wave 1 respondents who had moved away from Los Angeles County. A sample of individuals who moved into each sampled neighborhood between Waves 1 and 2 was also interviewed, for a total of 2,319 adults and 1,382 children (ages less than 18 years).
A new feature in L.A.FANS-2 was the collection of physiological health measures. Interviewers collected blood pressure, pulse, height, weight, hip and waist circumference, and a pulmonary function assessment (spirometry) from respondents. Saliva samples were collected from child respondents. 50% of L.A.FANS households were selected at random and respondents were asked to provide dried blood spot (DBS) samples. The initial release of L.A.FANS-2 data included only the blood pressure and anthropometry data. This collection contains the spirometry, saliva, and DBS data. The saliva and DBS datasets each have one record per respondent who completed the health module, while the spirometry dataset has one record for each trial.
Data in this study are designed for use with the public use data files for L.A.FANS, Wave 1 (study 1) and Wave 2 (study 2). Additional information on the project is available on the L.A.FANS series page and the RAND website.
Additional information on the project, survey design, sample, and variables are available from:
- Sastry, Narayan, Bonnie Ghosh-Dastidar, John Adams, and Anne R. Pebley (2006). The Design of a Multilevel Survey of Children, Families, and Communities: The Los Angeles Family and Neighborhood Survey, Social Science Research, Volume 35, Number 4, Pages 1000-1024
- The Users' Guides (Wave 1 and Wave 2)
- RAND Documentation Reports page
Chitwan Valley Family Study: Labour Outmigration, Agricultural Productivity and Food Security, Nepal, 2015-2017 (ICPSR 36755)
The Chitwan Valley [Nepal] Family Study: Labor Outmigration, Agricultural Productivity and Food Security is a three year project with the aim to investigate the consequences of labor outmigration on agricultural productivity in a poor agricultural country persistently facing food security problems. A Data Guide for this study is available as a web page and for download.
This project's data collection is made up of twenty-five datasets:
Datasets 1-6: The Household Agriculture and Migration Survey includes information on household agricultural practices and remittances received by the household. Face-to-face interviews were conducted to collect data from household members who previously participated in the Chitwan Valley Family Study (ICPSR 4538).
Topics of the survey include crop production and farm technology use, wealth, assets, income, consumption, food security and information about each household member currently away from home. The survey also collected information on gender, ethnicity, and age.
Datasets 7-16: Measured yields of major crops grown by farm households that previously participated in the Chitwan Valley Family Study (ICPSR 4538).
Dataset 17: A monthly demographic event registry administered to all households that previously participated in the Chitwan Valley Family Study (ICPSR 4538).
Datasets 18-23: The Women's Time Use Survey was designed and administered to married, Nepalese women to collect information on changes in their time and involvement in agriculture and other activities. Face-to-face interviews and telephone interviews were conducted to collect data from women who previously participated in the Chitwan Valley Family Study (ICPSR 4538).
The collection covered a range of topics including farm work, hygiene, finances, health, and religion. Further, respondents were queried concerning socialization and assisting children and the elderly.
Datasets 24-25: The Women's Time Use Survey was designed and administered to married, Nepalese women to collect information on changes in their time and involvement in agriculture and other activities. Face-to-face interviews and telephone interviews were conducted to collect data from women who previously participated in the Chitwan Valley Family Study (ICPSR 4538).
First Baby Study (FBS), Pennsylvania, 2009-2014 (ICPSR 38778)
The First Baby Study (FBS) was a prospective cohort study designed to investigate the association between mode of delivery at first childbirth (cesarean or vaginal) and subsequent fecundity and fertility over the course of a 3-year follow-up period. Women were enrolled during pregnancy and interviewed by telephone in their third trimester. Enrolled participants were followed-up with and surveyed at 1, 6, 12, 18, 24, 30 and 36 months postpartum. Participants were enrolled in 2009 to 2011 and the last interview was conducted in 2014.
Active for Life: Translation of Physical Activity Programs for Mid-Life and Older Adults, 2003-2007 [United States] (ICPSR 24723)
Sponsored by the Robert Wood Johnson Foundation, the Active for Life (AFL) initiative investigated how two physical activity programs for adults aged 50 and older, Active Choices (AC) and Active Living Every Day (ALED), worked in community settings. Created by researchers at Stanford University, Active Choices used lifestyle counseling and personalized telephone support to encourage older adults to be physically active. In AFL, this was a 6-month program delivered through one face-to-face meeting followed by up to eight one-on-one telephone counseling calls. Active Living Every Day, which was created by the Cooper Institute and Human Kinetics Inc., also provided lifestyle counseling to promote physical activity, but in a classroom and workbook format. During the first three years of the four-year AFL initiative, ALED was delivered as a 20-week program where participants attended weekly small group meetings, but in the last year it was shortened to 12 weekly meetings. Nine organizations received AFL grants to implement the programs during 2003-2006. Four grantees implemented the one-on-one AC model, while five implemented the group-based ALED model.
Data were collected from the AC and ALED sites for both a process and outcomes evaluation. The primary aims of the process evaluation were to (1) monitor the extent to which the grantees demonstrated fidelity to the AC and ALED models in their program implementation, (2) assess staff experiences implementing the programs, and (3) assess participants' impressions of the programs. A quasi-experimental, pre-post study design was used to assess outcomes. Primary aims of the outcomes evaluation were to evaluate the impact of AC and ALED on self-reported physical activity, and to evaluate the impact of the programs on self-reported stress, depressive symptoms, and satisfaction with body function and appearance. Secondary aims of the outcome evaluation were to (1) evaluate the impact of the programs on measures of functional fitness, (2) examine whether changes in self-reported physical activity and functional fitness were moderated by participant characteristics, including age, gender, race, baseline physical activity self-efficacy, and baseline physical activity social support, and (3) examine whether changes in self-reported physical activity were consistent with a mediation model for physical activity self-efficacy and physical activity social support.
The collection has 14 data files (datasets). Datasets 1-7 constitute the process evaluation data, and Datasets 8-14 the outcomes evaluation data:
Dataset 1 (AC Initial Face-to-Face Sessions Data) contains information about the initial face-to-face AC session: the format, date, and length of the session, whether the 8 steps required in the face-to-face session were completed, what was discussed between the health educator and the participant related to physical activity plans, interests, benefits, and barriers, and the health educator's progress notes. The file contains one record for each AC participant.
Dataset 2 (AC Completed Calls Data) comprises information about the completed AC calls, but does not cover the topics discussed on the calls. Recorded information about each call includes the date and length of the call, the health educator's progress notes, and whether the participant was assessed for injury, light activity, moderate activity, exercise goals, or exercise intentions. Each call is represented by a separate record in the data file and, typically, there are multiple records per participant.
Dataset 3 (AC Topics Discussed on Completed Calls ) contains information about the topics discussed on each completed AC call, e.g., exercise barriers/benefits, previous exercise experiences, goal setting, long term goals, injury prevention, rewards/reinforcement, social support, progress tracking, and relapse prevention. Each record in the file represents one topic and there are often multiple records per call for each participant.
Dataset 4 (AC Aggregate Call Data) aggregates the call data across calls for each AC participant. For example, for a given participant, this dataset shows the total number of calls completed, the number of calls where injury/health problems were assessed, etc. The file contains one record per participant.
Dataset 5 (ALED Sessions Data) contains information about each class session for every ALED group, including the session date, start time, and end time, learning activities covered in the session, participant evaluations of the session and the facilitator, facilitator progress notes, the number of participants who were in various stages of readiness for moderate exercise, and the number of participants who tracked physical activity and thoughts about physical activity. This file has one record for each session of every ALED group.
Dataset 6 (ALED Attendance and Tracking Data (Years 2-4)) consists of participant-level attendance and tracking data for every ALED session during the second to fourth years of the evaluation, including the participant's attendance at the session, whether the participant's stage of readiness was assessed, and whether the participant tracked thoughts about physical activity or actual physical activity. There is no participant-level ALED data for the first year. Each participant has a separate record for each session. Thus, the file contains 20 records per participant in the years 2-3, and 12 records per participant in year 4.
Dataset 7 (ALED Aggregate Attendance and Tracking Data (Years 2-4)) contains ALED attendance and tracking data for each participant in years 2-4, aggregated across the sessions. The data file has one record for each participant.
Dataset 8 (Demographics) comprises program information (e.g., program status, start date, end date, site, etc.), demographic information (e.g., age, gender, race, Hispanic origin, employment status, income, and the participant's state and ZIP code of residence), and responses to the Physical Activity Readiness Questionnaire (PAR-Q), a screening tool that was used to assess possible risks of exercising based on answers to specific health history questions. The file contains one record for each AFL participant, except for those with a status of "nonstarter" or "repeater."
Datasets 9 (Pretest Survey Data) and 10 (Posttest Survey Data) contain data from the Pretest and Posttest Surveys. The Pretest Survey was administered at the beginning of the AC and ALED programs, while the Posttest Survey was administered at their end. Topics covered by the surveys include social and recreational activities, activities undertaken for exercise, perceived stress, depressive symptoms, satisfaction with body appearance and function, social support for physical activity, self-efficacy for physical activity, neighborhood environment, health conditions, health-related quality of life, caregiving, and self-reported height and weight. Both surveys included items from the Community Health Activities Model Program for Seniors Physical Activity Questionnaire (CHAMPS), the Center for Epidemiological Studies Depression Questionnaire (CES-D), the Behavioral Risk Factor Surveillance System Questionnaires (BRFSS), and the International Physical Activity Prevalence Study Environmental Module. These data files each have one record for each participant who submitted a questionnaire.
Dataset 11 (ALED Week 12 Survey Data (Year 4)) contains responses to the ALED Week 12 Posttest Survey, which was used to evaluate the 12-week adaptation of ALED in Year 4. (In Year 4, ALED participants completed both a 12- and 20-week posttest survey). There is one record for each participant who returned this survey.
Dataset 12 (Six-Month Posttest Follow-Up Survey Data (Years 3-4)) comprises data from a special 6-month follow-up survey which was administered in years 3-4 in six of the ALED sites and one of the AC sites. Participants were questioned about their current physical activities, weight, health-related quality of life, satisfaction with bodily function, and other topics. As with Datasets 9-11, the data file contains one record for each participant who returned a questionnaire.
Dataset 13 (Functional Fitness Tests Data) contains the results of pretest and posttest functional fitness tests which were administered by one ALED grantee. Four tests were adminstered: (1) the 30-Foot Walk Test, (2) the 30-Second Chair Stand, (3) 8-Foot Up and Go, and (4) the Chair Sit and Reach Test. This participant-level data file also includes pretest height measurements plus pretest and posttest weight measurements.
Dataset 14 (Participants' Impressions of the Programs (Years 1, 3, and 4)) contains data collected by the last sections of the Posttest Survey, ALED Week 12 survey, and 6-Month Follow-up Survey. The topics it covers include the participants' impressions of the programs, participation in physical activities, and changes (compared to before they started the AFL program) in motivation to be physically active, actual level of physical activity, medical and health conditions, overall pain, flexibility/limberness, level of stress, happiness, and enjoyment of life. The file has a separate record for each survey completed by the participants. Thus, there are 1-3 records per participant.
Los Angeles Family and Neighborhood Survey (L.A.FANS), Wave 2, Restricted Data Version 2, 2006-2008 (ICPSR 37265)
This study includes a restricted data file, version 2, for Wave 2 of the L.A.FANS data. To compare L.A.FANS restricted data, version 2 with other restricted data versions, see the table on the series page for the L.A.FANS data here. Data in this study are designed for use with the public use data files for L.A.FANS, Wave 2 (study 2). This file adds only a few variables to the L.A.FANS, Wave 2 public use files. Specifically, it adds the census tract number for the tract each respondent lives in. It also includes certain variables, thought to be sensitive, which are not available in the public use data. These variables are identified in the L.A.FANS Wave 2 Users Guide and Codebook. Finally, some distance variables and individual characteristics which are treated in the public use data to make it harder to identify individuals are provided in an untreated form in the Version 2 restricted data file. Please note that L.A. FANS restricted data may only be accessed within the ICPSR Virtual Data Enclave (VDE) and must be merged with the L.A. FANS public data prior to beginning any analysis.
A Users' Guide which explains the design and how to use the samples are available for Wave 2 at the RAND website.
Additional information on the project, survey design, sample, and variables are available from:
- Sastry, Narayan, Bonnie Ghosh-Dastidar, John Adams, and Anne R. Pebley (2006). The Design of a Multilevel Survey of Children, Families, and Communities: The Los Angeles Family and Neighborhood Survey, Social Science Research, Volume 35, Number 4, Pages 1000-1024
- The Users' Guides (Wave 1 and Wave 2)
- RAND Documentation Reports page
Los Angeles Family and Neighborhood Survey (L.A.FANS), Wave 2, Restricted Data Version 2.5, 2006-2008 (ICPSR 37266)
This study includes a restricted data file, version 2.5, for Wave 2 of the L.A.FANS data. To compare L.A.FANS restricted data, version 2.5 with other restricted data versions, see the table on the series page for the L.A.FANS data here. Data in this study are designed for use with the public use data files for L.A.FANS, Wave 2 (study 2). This file adds only a few variables to the L.A.FANS, Wave 2 public use files. Specifically, it adds the census tract and block number for the tract each respondent lives in. It also includes certain variables, thought to be sensitive, which are not available in the public use data. These variables are identified in the L.A.FANS Wave 2 Users Guide and Codebook. Finally, some distance variables and individual characteristics which are treated in the public use data to make it harder to identify individuals are provided in an untreated form in the Version 2.5 restricted data file. Please note that L.A. FANS restricted data may only be accessed within the ICPSR Virtual Data Enclave (VDE) and must be merged with the L.A. FANS public data prior to beginning any analysis.
A Users' Guide which explains the design and how to use the samples are available for Wave 2 at the RAND website.
Additional information on the project, survey design, sample, and variables are available from:
- Sastry, Narayan, Bonnie Ghosh-Dastidar, John Adams, and Anne R. Pebley (2006). The Design of a Multilevel Survey of Children, Families, and Communities: The Los Angeles Family and Neighborhood Survey, Social Science Research, Volume 35, Number 4, Pages 1000-1024
- The Users' Guides (Wave 1 and Wave 2)
- RAND Documentation Reports page
Los Angeles Family and Neighborhood Survey (L.A.FANS), Wave 1, Restricted Data Version 2, 2000-2001 (ICPSR 37269)
This study includes restricted data file, version 2, for Wave 1 of the L.A.FANS data. To compare L.A.FANS restricted data, version 2 with other restricted data versions, see the table on the series page for the L.A.FANS data here. Data in this study are designed for use with the public use data files for L.A.FANS, Wave 1 (study 1). This file adds only a few variables to the L.A.FANS, Wave 1 public use files. Specifically, it adds the census tract number for the tract each respondent lives in. It also includes certain variables, thought to be sensitive, which are not available in the public use data. These variables are identified in the L.A.FANS Wave 1 Users Guide and Codebook. Finally, some distance variables and individual characteristics which are treated in the public use data to make it harder to identify individuals are provided in an untreated form in the Version 2 restricted data file. Please note that L.A. FANS restricted data may only be accessed within the ICPSR Virtual Data Enclave (VDE) and must be merged with the L.A. FANS public data prior to beginning any analysis.
A Users' Guide which explains the design and how to use the samples are available for Wave 1 at the RAND website.
Additional information on the project, survey design, sample, and variables are available from:
- Sastry, Narayan, Bonnie Ghosh-Dastidar, John Adams, and Anne R. Pebley (2006). The Design of a Multilevel Survey of Children, Families, and Communities: The Los Angeles Family and Neighborhood Survey, Social Science Research, Volume 35, Number 4, Pages 1000-1024
- The Users' Guides (Wave 1 and Wave 2)
- RAND Documentation Reports page
Los Angeles Family and Neighborhood Survey (L.A.FANS), Wave 1, Restricted Data Version 2.5, 2000-2001 (ICPSR 37270)
This study includes restricted data version 2.5, for Wave 1 of the L.A.FANS data. To compare L.A.FANS restricted data, version 2.5 with other restricted data versions, see the table on the series page for the L.A.FANS data here. Data in this study are designed for use with the public use data files for L.A.FANS, Wave 1 (study 1). This file adds only a few variables to the L.A.FANS, Wave 1 public use files. Specifically, it adds the census tract and block number for the tract each respondent lives in. It also includes certain variables, thought to be sensitive, which are not available in the public use data. These variables are identified in the L.A.FANS Wave 1 Users Guide and Codebook. Finally, some distance variables and individual characteristics which are treated in the public use data to make it harder to identify individuals are provided in an untreated form in the Version 2.5 restricted data file. Please note that L.A. FANS restricted data may only be accessed within the ICPSR Virtual Data Enclave (VDE) and must be merged with the L.A. FANS public data prior to beginning any analysis.
A Users' Guide which explains the design and how to use the samples are available for Wave 1 at the RAND website.
Additional information on the project, survey design, sample, and variables are available from:
- Sastry, Narayan, Bonnie Ghosh-Dastidar, John Adams, and Anne R. Pebley (2006). The Design of a Multilevel Survey of Children, Families, and Communities: The Los Angeles Family and Neighborhood Survey, Social Science Research, Volume 35, Number 4, Pages 1000-1024
- The Users' Guides (Wave 1 and Wave 2)
- RAND Documentation Reports page
Los Angeles Family and Neighborhood Survey (L.A.FANS), Wave 2, Restricted Data Version 3, 2006-2008 (ICPSR 37267)
This study includes a restricted data file, version 3, for Wave 2 of the L.A.FANS data. To compare L.A.FANS restricted data, version 3 with other restricted data versions, see the table on the series page for the L.A.FANS data here. Data in this study are designed for use with the public use data files for L.A.FANS, Wave 2 (study 2). This file adds only a few variables to the L.A.FANS, Wave 2 public use files. Specifically, it adds the census tract and block number for the tract each respondent lives in and geographic coordinates data for a number of locations reported by the respondent (including home, grocery store, place of work, place of worship, schools, etc.). It also includes certain variables, thought to be sensitive, which are not available in the public use data. These variables are identified in the L.A.FANS Wave 2 Users Guide and Codebook. Finally, some distance variables and individual characteristics which are treated in the public use data to make it harder to identify individuals are provided in an untreated form in the Version 3 restricted data file. Please note that L.A. FANS restricted data may only be accessed within the ICPSR Virtual Data Enclave (VDE) and must be merged with the L.A. FANS public data prior to beginning any analysis.
A Users' Guide which explains the design and how to use the samples are available for Wave 2 at the RAND website.
Additional information on the project, survey design, sample, and variables are available from:
- Sastry, Narayan, Bonnie Ghosh-Dastidar, John Adams, and Anne R. Pebley (2006). The Design of a Multilevel Survey of Children, Families, and Communities: The Los Angeles Family and Neighborhood Survey, Social Science Research, Volume 35, Number 4, Pages 1000-1024
- The Users' Guides (Wave 1 and Wave 2)
- RAND Documentation Reports page
Los Angeles Family and Neighborhood Survey (L.A.FANS), Wave 1, Restricted Data Version 3, 2000-2001 (ICPSR 37271)
This study includes restricted data version 3, for Wave 1 of the L.A.FANS data. To compare L.A.FANS restricted data, version 3 with other restricted data versions, see the table on the series page for the L.A.FANS data here. Data in this study are designed for use with the public use data files for L.A.FANS, Wave 1 (study 1). This file adds only a few variables to the L.A.FANS, Wave 1 public use files. Specifically, it adds the census tract and block number for the tract each respondent lives in and geographic coordinates data for a number of locations reported by the respondent (including home, grocery store, place of work, place of worship, schools, etc.). It also includes certain variables, thought to be sensitive, which are not available in the public use data. These variables are identified in the L.A.FANS Wave 1 Users Guide and Codebook. Finally, some distance variables and individual characteristics which are treated in the public use data to make it harder to identify individuals are provided in an untreated form in the Version 3 restricted data file. Please note that L.A. FANS restricted data may only be accessed within the ICPSR Virtual Data Enclave (VDE) and must be merged with the L.A. FANS public data prior to beginning any analysis.
A Users' Guide which explains the design and how to use the samples are available for Wave 1 at the RAND website.
Additional information on the project, survey design, sample, and variables are available from:
- Sastry, Narayan, Bonnie Ghosh-Dastidar, John Adams, and Anne R. Pebley (2006). The Design of a Multilevel Survey of Children, Families, and Communities: The Los Angeles Family and Neighborhood Survey, Social Science Research, Volume 35, Number 4, Pages 1000-1024
- The Users' Guides (Wave 1 and Wave 2)
- RAND Documentation Reports page
Los Angeles Family and Neighborhood Survey (L.A.FANS), Wave 1, Restricted Data Version 1, 2000-2001 (ICPSR 37242)
This study includes a restricted data file for Wave 1 of the L.A.FANS data. To compare L.A.FANS restricted data, version 1 with other restricted data versions, see the table on the series page for the L.A.FANS data here. Data in this study are designed for use with the public use data files for L.A.FANS, Wave 1 (study 1). This file adds only a few variables to the L.A.FANS, Wave 1 public use files. Specifically, it adds a "pseudo-tract ID" which is a number from 1 to 65, randomly assigned to each census tract (neighborhood) in the study. It is not possible to link pseudo-tract IDs in any way to real tract IDs or other neighborhood characteristics. However, pseudo-tract IDs permit users to conduct analyses which take into account the clustered sample design in which neighborhoods (tracts) were selected first and then individuals were sampled within neighborhoods. Pseudo-tract IDs do so because they identify which respondents live in the same neighborhood. It also includes certain variables, thought to be sensitive, which are not available in the public use data. These variables are identified in the L.A.FANS Wave 1 Users Guide and Codebook. Finally, some distance variables and individual characteristics which are treated in the public use data to make it harder to identify individuals are provided in an untreated form in the Version 1 restricted data file. Please note that L.A. FANS restricted data may only be accessed within the ICPSR Virtual Data Enclave (VDE) and must be merged with the L.A. FANS public data prior to beginning any analysis.
A Users' Guide which explains the design and how to use the samples are available for Wave 1 at the RAND website.
Additional information on the project, survey design, sample, and variables are available from:
- Sastry, Narayan, Bonnie Ghosh-Dastidar, John Adams, and Anne R. Pebley (2006). The Design of a Multilevel Survey of Children, Families, and Communities: The Los Angeles Family and Neighborhood Survey, Social Science Research, Volume 35, Number 4, Pages 1000-1024
- The Users' Guides (Wave 1 and Wave 2)
- RAND Documentation Reports page
Los Angeles Family and Neighborhood Survey (L.A.FANS), Wave 2, Restricted Data Version 1, 2006-2008 (ICPSR 37259)
This study includes a restricted data file for Wave 2 of the L.A.FANS data. To compare L.A.FANS restricted data, version 1 with other restricted data versions, see the table on the series page for the L.A.FANS data here. Data in this study are designed for use with the public use data files for L.A.FANS, Wave 2 (study 2). This file adds only a few variables to the L.A.FANS, Wave 2 public use files. Specifically, it adds a "pseudo-tract ID" which is a number from 1 to 65, randomly assigned to each census tract (neighborhood) in the study. It is not possible to link pseudo-tract IDs in any way to real tract IDs or other neighborhood characteristics. However, pseudo-tract IDs permit users to conduct analyses which take into account the clustered sample design in which neighborhoods (tracts) were selected first and then individuals were sampled within neighborhoods. Pseudo-tract IDs do so because they identify which respondents live in the same neighborhood. It also includes certain variables, thought to be sensitive, which are not available in the public use data. These variables are identified in the L.A.FANS Wave 2 Users Guide and Codebook. Finally, some distance variables and individual characteristics which are treated in the public use data to make it harder to identify individuals are provided in an untreated form in the Version 1 restricted data file. Please note that L.A. FANS restricted data may only be accessed within the ICPSR Virtual Data Enclave (VDE) and must be merged with the L.A. FANS public data prior to beginning any analysis.
A Users' Guide which explains the design and how to use the samples are available for Wave 2 at the RAND website.
Additional information on the project, survey design, sample, and variables are available from:
- Sastry, Narayan, Bonnie Ghosh-Dastidar, John Adams, and Anne R. Pebley (2006). The Design of a Multilevel Survey of Children, Families, and Communities: The Los Angeles Family and Neighborhood Survey, Social Science Research, Volume 35, Number 4, Pages 1000-1024
- The Users' Guides (Wave 1 and Wave 2)
- RAND Documentation Reports page
National Couples' Health and Time Study (NCHAT), United States, 2020-2022 (ICPSR 38417)
Iowa Youth and Families Project, 1989-1992 (ICPSR 26721)
This data collection contains the first four waves of the Iowa Youth and Families Project (IYFP), conducted in 1989, 1990, 1991, and 1992. The Iowa Youth and Families Project was developed from an initial sample of 451 7th graders from two-parent families in rural Iowa. The study was merged with the Iowa Single Parent Project (ISPP) to form the Iowa Family Transitions Project in 1994, when the target youth were seniors in high school. Survey data were collected from the target child (7th grader), a sibling within four years of age of the target child, and both parents. Field interviewers visited families at their homes on several occasions to administer questionnaires and videotape interaction tasks including family discussion tasks, family problem-solving tasks, sibling interaction tasks, and marital interaction tasks.
The Household Data files contain information about the family's financial situation, involvement in farming, and demographic information about household members.
The Parent and the Child Survey Data files contain responses to survey questions about the quality and stability of family relationships, emotional, physical, and behavioral problems of individual family members, parent-child conflict, family problem-solving skills, social and financial support from outside the home, traumatic life experiences, alcohol, drug, and tobacco use, and opinions on topics such as abortion, parenting, and gender roles. In addition, the Child Survey Data files include responses collected from the target child and his or her sibling in the study about experiences with puberty, dating, sexual activity, and risk-taking behavior.
The Problem-Solving Data files contain survey data collected from respondents about the family interactions tasks.
The Observational Data files contain the interviewers' observations collected during these tasks.
Demographic variables include sex, age, employment status, occupation, income, home ownership, religious preference, frequency of religious attendance, as well as the ages and sex of all household members and their relationship to the head of household. Demographic information collected on the parents also includes their birth order within their family, the ages and political philosophy of their parents, the sex, age, education level, and occupation of their siblings, and the country of origin of their ancestors.
Provision of Sexuality Education in Secondary Schools, Ghana, Guatemala, Kenya, and Peru, 2015 (ICPSR 38440)
Los Angeles Family and Neighborhood Survey (L.A.FANS), Wave 3, Public Data | Mixed Income Project (MIP), 2011-2013 (ICPSR 37845)
This study includes one public use data file of follow-up interviews, conducted between 2011 and 2013, with respondents to Wave 2 of L.A.FANS (Los Angeles Family and Neighborhood Survey). This follow-up data collection effort (hereafter called L.A.FANS-3 or Wave 3) was part of the broader Mixed Income Project (MIP), which was designed to allow for detailed examination of neighborhood context, residential mobility, and mixed-income housing in Los Angeles and Chicago. The two anchor studies for the MIP are L.A.FANS and the Project on Human Development in Chicago Neighborhoods (PHDCN).
Wave 3 targeted a random probability sample of approximately 1,000 randomly selected adults and children from the prior wave of L.A.FANS, which was fielded between 2006 and 2008, who still resided within Los Angeles County. The Los Angeles field operation first assigned selected respondents to a telephone survey center for interviews. Cases that were not interviewed by telephone were transferred to experienced field interviewers in the Los Angeles area. The final response rate was 75 percent of eligible participants (i.e., residents who still resided in Los Angeles County and who were not institutionalized, incapacitated, or deceased) for a combined sample of 1,032. Two-hundred and two (202) of these respondents were reached during a preliminary Field Test in 2011, after which point the survey was slightly revised. After making these revisions, 830 respondents were reached during the Main Study. For more details on sampling procedures for the Field Test and Main Study, see Methodology section below.
For context, the L.A.FANS is a study of adults and children in Los Angeles County, and of the neighborhoods in which they live. The first wave (L.A.FANS-1 or Wave 1), which was fielded between April 2000 and January 2002, interviewed adults and children living in 3,085 households in a stratified probability sample of 65 neighborhoods throughout Los Angeles County. The samples of neighborhoods and individuals were representative of neighborhoods and residents of Los Angeles County. Poorer neighborhoods and households with children were oversampled. In Wave 2 of L.A.FANS (L.A.FANS-2), Wave 1 respondents still living in Los Angeles County were re-interviewed, while updated information was collected on Wave 1 respondents who had moved away from Los Angeles County. A sample of individuals who moved into each sampled neighborhood between Waves 1 and 2 was also interviewed, for a total of 2,319 adults and 1,382 children (ages less than 18 years). Additional information on the project is available at the RAND website.
Head Start Family and Child Experiences Survey (FACES 2014) Contextual Variables Data File, United States, 2014-2017 (ICPSR 38861)
FACES provides national information about Head Start programs and participants. Beginning in 1997, a series of nationally representative samples of Head Start children and their families, classrooms, and programs has described the population served by Head Start; staff qualifications, credentials, and opinions; Head Start classroom practices and quality measures; and the experiences and well-being of children and families. FACES studies have included assessments that measure children's cognitive skills, social-emotional skills, and physical status; observations of classroom quality; and surveys of children's parents, teachers, center directors, and program directors.
The Family and Child Experiences Survey (FACES 2014) Contextual Variables Data File contains 28 contextual, community-level variables about 399 Head Start centers included in the FACES 2014 study sample. It does not contain data collected as part of the FACES 2014 study; instead, it contains information from publicly available data sources and is designed to merge with other FACES 2014 data files to enhance the understanding of Head Start center communities. The contextual variables data describe characteristics of the census tract or block group in which Head Start centers are located.
The contextual variables include three index variables constructed by research institutions, 24 demographic and socioeconomic variables derived from the American Community Survey (ACS), and a measure of rural/urban status from the U.S. Department of Agriculture. The FACES 2014 Contextual Variables Data File is intended to be used with the other FACES 2014 data files. For example, in conjunction with the other FACES 2014 data, these data could be used to:
- describe the characteristics of neighborhoods where children attend Head Start,
- describe how children's experiences or Head Start quality differ by neighborhood characteristics, or
- explore associations among neighborhoods, Head Start experiences, and child and family well-being.
Mother and Infant Home Visiting Program Evaluation (MIHOPE), United States, 2012-2019 (ICPSR 37848)
In 2010, the United States Congress authorized the Maternal, Infant, and Early Childhood Home Visiting (MIECHV) program, which started a major expansion of evidence-based home visiting programs for families living in at-risk communities. MIECHV is administered by the Health Resources and Services Administration (HRSA) in collaboration with the Administration for Children and Families (ACF) within the U.S. Department of Health and Human Services (HHS). The authorizing legislation required an evaluation of the program, which became the Mother and Infant Home Visiting Program Evaluation (MIHOPE). The evaluation is being conducted for HHS by MDRC with James Bell Associates, Johns Hopkins University, Mathematica, the University of Georgia, and Columbia University.
MIHOPE was designed to learn whether families benefit from MIECHV-funded early childhood home visiting programs, and if so, how. The study included the four evidence-based models that 10 or more states chose in their initial MIECHV plans in fiscal year 2010-2011: Early Head Start - Home-based option, Healthy Families America, Nurse-Family Partnership, and Parents as Teachers. MIHOPE was the first study to include all of these four evidence-based models.
To provide rigorous evidence on the MIECHV-funded programs' effects, the study randomly assigned more than 4,200 families to receive either MIECHV-funded home visiting or information on community services. As is the standard method in studies that use random assignment, the primary analytical strategy in MIHOPE was to compare the outcomes of the entire program group with those of the entire control group.
As per the authorizing legislation, the study measured early effects on family and child outcomes in the areas listed below, with the exception of school readiness and academic achievement (which were not included at this point because children were too young to measure those outcomes):
- Prenatal, maternal, and newborn health
- Child health and development, including child maltreatment
- Parenting skills
- School readiness and child academic achievement
- Crime and domestic violence
- Family economic self-sufficiency
- Referrals and service coordination
Videos and Video Metadata: Two sets of videos are included in the MIHOPE restricted access files. They include:
- Mother-home visitor interactions at 387 home visits and
- Interactions between child and mother using the "Three Bags" and "Clean-Up" tasks with 2,832 families.
The mother-home visitor interaction videos were recorded only for treatment group families at two points in time: the first was, on average, about eight weeks after the family's first home visit and the second was about eight months after the family's first home visit. Overall, 264 families are included in the mother-home visitor interaction videos in total, with 123 of these families recorded at both points in time.
The mother-child interaction videos, during which the child and mother play with toys contained in three bags and place the toys back in the bags (the "Three Bags" and "Clean-Up" tasks), were recorded when the 15-month in-home assessments were conducted and are available for 2,832 families in the treatment and control groups.
The videos are only linkable to a few pieces of metadata (home visiting model, video ID, treatment status, and variables indicating whether the family appears in the home visit videos, the three-bag task videos, or both). The videos in the restricted access data are not linkable to any other data included in the restricted access files. Additionally, the videos may only be viewed at the Inter-university Consortium for Political and Social Research's on-site Physical Data Enclave in Ann Arbor, Michigan.
The Women's Employment Study, Genesee County, Michigan, 1997-2004 (ICPSR 37077)
The Women's Employment Study, or "WES" combined the insights of poverty researchers, epidemiologists, and social workers by analyzing the ways in which a broad range of labor market, mental health, physical health, and family problems affect a welfare recipient's ability to obtain and retain employment over time.
WES initially began in the Fall of 1997, and five waves of the survey were completed by early 2004, with a high proportion of the original panel sample retained throughout the study.
The unique aspects of WES rest largely with the broad range of barriers it measures, and the longitudinal nature of the study that allows researchers to look at employment and welfare experiences over time.
Relationship Dynamics and Social Life (RDSL) Study [Genesee County, Michigan], 2008-2012 [Public and Highly Restricted-Use] (ICPSR 34626)
The Relationship Dynamics and Social Life (RDSL) Study aims to investigate the types of romantic relationships that produce early and/or unintended pregnancies. The study is based on a representative sample of 1,003 women aged 18 to 22 residing in Genesee County, Michigan. The research team focused on women ages 18 to 22 because these ages are characterized by the highest rates of unintended pregnancy, as well as significant instability and change in the dynamic determinants of unintended pregnancy.
Data collection for the baseline survey was conducted March 2008 through July 2009, and consisted of a 60-minute face-to-face interview to gather information on respondent attitudes and behaviors, intimate and familial relationships, contraceptive use, reproductive history, self-reported height and weight, and socio-demographic characteristics.
The baseline survey was followed by a series of three supplemental surveys administered over a two-and-a-half year period between May 2009 and August 2011. These surveys covered a wide range of topics, including family living arrangements, socioeconomic status, employment, media consumption, mental health, violence, personality traits, assumptions and knowledge regarding various forms of contraception, and attitudes and opinions about social life. The second major component of the RDSL features journal data collected concurrently with the supplemental surveys. The focus of the journal data collection was to gather dynamic, prospective measurements of pregnancy desires and contraceptive use, as well as relationship attributes such as commitment, sexual intimacy, and decision-making regarding contraception. Please consult the crosswalk to determine which level of restriction is required for research.
Demographic information collected includes respondent age, race, ethnicity, religious affiliation, marital status, education, employment status, income, and household size and composition.
Healthy Schools Program Evaluation, 2006-2014 (ICPSR 33541)
These data were collected as part of the evaluation of the Healthy School Program (HSP), a program that provides support to elementary, middle, and high schools in the United States as they work to create healthy school environments that promote physical activity and healthy eating for students and staff. HSP was created in 2006 by the Alliance for a Healthier Generation with funding from the Robert Wood Johnson Foundation. The HSP evaluation addressed both process and impact outcomes:
Is the HSP technical assistance and training model effective in increasing the implementation of policies and programs that promote and provide access to healthier foods and more physical activity before, during and after school?
Are there distinctive or common school-level characteristics that hasten or hinder school-level implementation of policies and programs that promote and provide access to healthy foods and physical activity in the school setting in HSP schools?
Does participation in HSP contribute to an increase in healthy eating behaviors and physical activity participation among students? Does participation in HSP contribute to a decrease in body mass index (BMI) among students?
The evaluation used a mixed-method design incorporating both quantitative and qualitative components. The quantitative component of the evaluation was a longitudinal design that measured student changes in eating and physical activity behaviors and BMI and schools' implementation of policies and practices promoted by HSP. For the qualitative component the evaluation team conducted site visits in a sample of HSP schools.
Nine data files constitute this data collection:
HSP Participation and Inventory Data File, 2006-2011 (originally called the Inventory Data File)
Pilot Student Survey Data File
Pilot Student Height and Weight Measurements Data File
Survey of Students in Boston and Miami-Dade Public Schools Data File
HSP Participation and Inventory Data File, 2006-2014
Arizona, Prince George's County and Nevada Healthy Schools Youth Survey Data File
Arizona and Prince George's County Youth Height and Weight Measurements Data File
Arizona Academic Achievement Data File
Prince George's County School Wellness Coordinator Survey Data File
Dataset 1 contains data on school characteristics, HSP engagement indicators, baseline and follow-up responses to the Healthy Schools Inventory, and indices derived from the Inventory for all HSP schools as of August 2011. The Inventory collected information about each school's adherence to the Healthy Schools Program Framework, a set of best practice guidelines that promote physical activity and healthy eating among students and staff.
Datasets 2, 4 and 6 contain data from baseline and follow-up administrations of the Healthy Schools Youth Survey questionnaire in three samples of HSP schools: students in grades 5-12 in the initial pilot cohort of HSP schools; students in grades 5, 8 and 10 in the 2007-2008 cohort of HSP schools in Boston, Massachusetts and Miami-Dade County, Florida; and students in grades 5, 8 and 10 or 11 in HSP schools in Arizona, Nevada and Prince George's County, Maryland. Topics covered by the Healthy Schools Youth Survey questionnaire include eating and physical activity habits, attitudes about healthy eating and physical activity, health knowledge, and school food environments.
Datasets 3 and 7 contain baseline and follow-up height and weight measurements and derived BMIs, the former for students in grades 4-12 in schools sampled by the Pilot Student Survey and the latter for students in grades 5, 8, and 10 in Arizona and grades 1-12 in Prince George's County in schools sampled by the Arizona, Prince George's County and Nevada Healthy Schools Youth Survey.
Dataset 5 is an update to Dataset 1. Like Dataset 1 it contains data on HSP participation and engagement and school characteristics. Dataset 5 covers 8,500 schools that participated in HSP through fall 2014. It includes 4,028 of the 4,542 schools in Dataset 1.
Dataset 8 contains average math, reading and language scores for grades in HSP and comparable non-HSP schools in Arizona. Every record in the data file represents a grade (one or more of the grades 2-9) within a school (150 schools) for a given school year (up to seven years 2007-2008 to 2013-2014).
Dataset 9 contains data from a survey of HSP school coordinators in Prince Georges County. The coordinators were interviewed about the implementation of HSP in their schools.
ICPSR did not receive the site visit data.
Mother and Infant Home Visiting Program Evaluation-Strong Start (MIHOPE-Strong Start), United States, 2012-2017 (ICPSR 37847)
Mother and Infant Home Visiting Program Evaluation-Strong Start (MIHOPE-Strong Start) was a large-scale evaluation that rigorously tested the effectiveness of evidence-based home visiting in improving birth and health outcomes during pregnancy and in the year after birth. Local programs included in the study's analysis implemented one of two evidence-based models: Healthy Families America (HFA) or Nurse-Family Partnership (NFP). These models were chosen because earlier evaluations found some evidence of their having positive impacts on birth outcomes.
The Office of Planning, Research, and Evaluation (OPRE) of the Administration for Children and Families (ACF) partnered with the Center for Medicare and Medicaid Innovation (CMMI) of the Centers for Medicare and Medicaid Services (CMS) and the Maternal and Child Health Bureau (MCHB) of the Health Resources and Services Administration (HRSA) to sponsor the study. MIHOPE-Strong Start was part of the CMMI's Strong Start for Mothers and Newborns Initiative, which evaluated whether enhanced, nonmedical prenatal interventions, when provided in addition to routine medical care, have the potential to improve birth outcomes and reduce health care costs for women enrolled in Medicaid or the Children's Health Insurance Program (CHIP). Under contract with OPRE, MDRC conducted MIHOPE-Strong Start in collaboration with James Bell Associates, Johns Hopkins University, Mathematica, and New York University.
The analysis for MIHOPE-Strong Start included 2,899 women and 66 local programs (37 HFA and 29 NFP programs) operating across 17 states: California, Georgia, Illinois, Indiana, Iowa, Kansas, Massachusetts, Michigan, Nevada, New Jersey, New York, North Carolina, Pennsylvania, South Carolina, Tennessee, Washington, and Wisconsin. Women were eligible for MIHOPE-Strong Start if they were pregnant and at least 8 weeks from their due date.
The MIHOPE-Strong Start analysis included a subset of families and local programs that were recruited for MIHOPE, the national evaluation of the Maternal, Infant, and Early Childhood Home Visiting (MIECHV) program. Specifically, the MIHOPE-Strong Start impact analysis included information on 46 local home visiting programs and 1,845 families that were initially recruited for MIHOPE but met the MIHOPE-Strong Start eligibility criteria. An important distinction between MIHOPE-Strong Start and MIHOPE is that MIHOPE included only programs receiving MIECHV funding, while MIHOPE-Strong Start included both MIECHV and non-MIECHV-funded programs.
In both studies, families were randomly assigned either to an evidence-based home visiting program or to a control group who was given information on other services available in the community. The random assignment design was intended to create program and control groups that were similar when women entered the study, so that systematic differences in the outcomes of interest observed between the two groups can be attributed to the home visiting services rather than to the preexisting characteristics of the women.
Los Angeles Family and Neighborhood Survey (L.A.FANS), Los Angeles Neighborhood Services and Characteristics Data (L.A.NSC), 1980-2010 (ICPSR 37277)
This study, known as the Los Angeles Neighborhood Services and Characteristics data (L.A.NSC) includes data from governmental and business data sources on the characteristics of census tracts in Los Angeles County from 1980 to 2010. The unit of analysis is 1990 census tract (census tracts using 1990 census tract boundaries). The data also include crosswalks for census boundaries across census years. It is public use data.
The L.A.NSC data were assembled by the L.A.FANS projects to be used with L.A.FANS survey data restricted data versions 2, 2.5, and 3 all of which contain 1990 census tract numbers which can be linked with data in this file. However, the L.A.NSC files includes ALL census tracts in Los Angeles County, not just the 65 tracts sampled by L.A.FANS. Thus, the L.A.NSC data can be analyzed on their own without linkage to L.A.FANS survey data and provide longitudinal data on the characteristics of Los Angeles census tracts over a 30 year period.
Additional information on the data is available at the RAND website. See also:
- The L.A.NSC Database Users' Guides (The Los Angeles Neighborhood Services and Characteristics Database: Codebook and 2010 Neighborhood Services and Characteristics Database: NSC2).
Undiagnosed Trichomoniasis Infection in the Population of Baltimore, MD: Data from the 2006 - 2009 Monitoring STIs Survey Program (ICPSR 35066)
Child Care and Children with Special Needs: Challenges for Low Income Families, Maine, United States, 2002-2005 (ICPSR 27001)
This project was a mixed-method, multi-level study of low income families of children with special needs and the system which served them, focusing primarily on child care, employment, and balancing work and family. This approach included an analysis of existing national and state-level data sets, statewide surveys of parents and child care providers, and a field study to look at these issues at the local level in three selected communities in the state of Maine: Portland, Lewiston/Auburn, and Presque Isle. While the primary focus was on access to child care, this project also looked at the related issues of welfare reform, the impact of work force participation on having a child with special needs, and the issue of coordination of early intervention services with the child care system. The goal was to understand better the issues facing low income families with special needs children across the programs and policies affecting their employment, access to child care, and meeting the special needs of their children. In the first year of the study, qualitative research was conducted to learn directly from parents about their experiences. In the second and third years, a field study of three communities was conducted as well as statewide surveys and analysis of national data bases to supplement the data collected in the first year. This data collection is comprised of the two quantitative data files produced during the second and third years of the study which are described in more detail below.
Child Care Provider Survey: The Child Care Provider Survey was a statewide survey of child care providers selected at random from the list of licensed providers in Maine given by the state licensing agency. Questions focused on the perspective of child care providers on the issues of access and inclusion that parents raised.
Parent Survey: The Parent Survey was a statewide survey of parents and children aged 0-18 years with diagnosed special needs (enrolled in Maine Care - Katie Beckett and Title V eligibility groups - and Child Development Services early intervention caseloads). Questions focused on child care utilization and work experiences in relation to children with special needs.
Researchers interested in information about the qualitative data should contact the Child Care and Children with Special Needs Project Web site.
Chitwan Valley Family Study: Changing Social Contexts and Family Formation, Nepal, 1995-2019 (ICPSR 4538)
The Chitwan Valley Family Study (CVFS) is a comprehensive family panel study of individuals, households, and communities in the Chitwan Valley of Nepal. The study was initially designed to investigate the influence of changing community and household contexts on population outcomes such as marital and childbearing processes. Over time, the goals of the study expanded to investigate family dynamics, intergenerational influences, child health, migration, labor force participation, attitudes and beliefs, mental health, agricultural production, environmental change, and many other topics. The data include full life histories for more than 10,000 individuals, tracking and interviews with all migrants, continuous measurement of community change, over 25 years of demographic event registry, and many other data collections. For additional information regarding the Chitwan Valley Family Study, please visit the Chitwan Valley Family Study Website. A Data Guide for this study is available as a web page and for download.
Principal Investigators
- William G. Axinn, University of Michigan
- Dirgha Ghimire, University of Michigan
- Jordan Smoller, Massachusetts General Hospital
Bicol Multipurpose Survey (BMS), 1978: [Philippines] (ICPSR 6878)
Bicol Community Survey (BCS), 1981: [Philippines] (ICPSR 6888)
The Future of Families and Child Wellbeing Study (FFCWS), Public Use, United States, 1998-2024 (ICPSR 31622)
The Future of Families and Child Wellbeing Study (FFCWS, formerly known as the Fragile Families and Child Wellbeing Study) follows a cohort of nearly 5,000 children born in large, U.S. cities between 1998 and 2000. The study oversampled births to unmarried couples; and, when weighted, the data are representative of births in large U.S. cities at the turn of the century. The FFCWS was originally designed to address four questions of great interest to researchers and policy makers:
- What are the conditions and capabilities of unmarried parents, especially fathers?
- What is the nature of the relationships between unmarried parents?
- How do children born into these families fare?
- How do policies and environmental conditions affect families and children?
The FFCWS consists of interviews with mothers, fathers, and/or primary caregivers at birth and again when children are ages 1, 3, 5, 9, 15, and 22. The parent interviews collected information on attitudes, relationships, parenting behavior, demographic characteristics, health (mental and physical), economic and employment status, neighborhood characteristics, and program participation. Beginning at age 9, children were interviewed directly (either during the home visit or on the telephone). The direct child interviews collected data on family relationships, home routines, schools, peers, and physical and mental health, as well as health behaviors.
A collaborative study of the FFCWS, the In-Home Longitudinal Study of Pre-School Aged Children (In-Home Study) collected data from a subset of the FFCWS Core respondents at the Year 3 and 5 follow-ups to ask how parental resources in the form of parental presence or absence, time, and money influence children under the age of 5. The In-Home Study collected information on a variety of domains of the child's environment, including: the physical environment (quality of housing, nutrition and food security, health care, adequacy of clothing and supervision) and parenting (parental discipline, parental attachment, and cognitive stimulation). In addition, the In-Home Study also collected information on several important child outcomes, including anthropometrics, child behaviors, and cognitive ability. This information was collected through interviews with the child's primary caregiver, and direct observation of the child's home environment and the child's interactions with his or her caregiver.
Similar activities were conducted during the Year 9 follow-up. At the Year 15 follow-up, a condensed set of home visit activities were conducted with a subsample of approximately 1,000 teens. Teens who participated in the In-Home Study were also invited to participate in a Sleep Study and were asked to wear an accelerometer on their non-dominant wrist for seven consecutive days to track their sleep (Sleep Actigraphy Data) and that day's behaviors and mood (Daily Sleep Actigraphy and Diary Survey Data).
An additional collaborative study collected data from the child care provider (Year 3) and teacher (Years 9 and 15) through mail-based surveys. Saliva samples were collected at Year 9 and 15 (Biomarker file and Polygenic Scores). The Study of Adolescent Neural Development (SAND) COVID Study began data collection in May 2020 following the onset of the COVID-19 pandemic. It included online surveys with the young adult and their primary caregiver.
The FFCWS began its seventh wave of data collection in October 2020, around the focal child's 22nd birthday. Data collection and interviews continued through January 2024. The Year 22 wave included a young adult (YA) survey with the original focal child and a primary caregiver (PCG) survey. Data were also collected on the children of the original focal child (referred to as Generation 3, or G3).
In 2017, the FFCWS team announced the Fragile Families (FF) Challenge, a collaborative effort in which participants were tasked with using machine learning methods and FFCWS data (Baseline to Year 9) to build a model that would predict six key outcomes at Year 15. Materials used in the FF Challenge have been archived in this collection.
Documentation for these files is available on the FFCWS website under Data and Documentation. For details of updates made to the FFCWS data files, please see the project's Data Alerts page.
Data collection for the Future of Families and Child Wellbeing Study was supported by the Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD) of the National Institutes of Health under award numbers R01HD36916, R01HD39135, and R01HD40421, as well as a consortium of private foundations.
Below is the citation for use of the FFCWS data accessed through ICPSR. For information on additional citation requirements when using FFCWS in publications, please refer to this FAQ on the FFCWS project site.
Harmonizing Two NICHD-funded Datasets to Study Youths' Behavioral Health, United States, 1986-2016 (ICPSR 38297)
Malawi Longitudinal Study of Families and Health (MLSFH), 1998-2021 (ICPSR 20840)
The Malawi Longitudinal Study of Families and Health (MLSFH) is one of very few long-standing longitudinal cohort studies in a poor Sub-Saharan African (SSA) context. It provides a record of more than 25 years of demographic, socioeconomic, and health conditions in one of the world's poorest countries. Initial data collection began in 1998 under the Malawi Diffusion and Ideational Change Project (MDICP) to examine social networks and fertility decisions among married women and their husbands. While this initial study population is still followed, the scope of the project and population expanded to a broader focus on social and contextual determinants of health across the lifecourse in Malawi.
This collection includes Rounds 1 through 9 of the MLSFH, as well as supplemental data collections from Sexual Diaries, Migration Follow-Ups (MHM), a Biomarker Survey, Adverse Childhood Experiences (ACE), and a Benefits of Knowledge Intervention Survey. The MLSFH Data web page contains additional information and cohort profiles for all MLSFH data collections, including those not made available through ICPSR-DSDR.
Effects of High-poverty Neighborhoods on Youth (Continuation-Revised) (ICPSR 35999)
Combined Generations Wave 1 and TransPop surveys, United States, 2016-2018 (ICPSR 38421)
This collection includes a combined dataset of the Generations study wave 1 (baseline) survey and the TransPop study transgender survey. The two studies have many overlapping variables, and they examined topics such as respondents' health outcomes and behaviors, experiences with discrimination, identity, and transition-related experiences. Data from these studies were merged to allow for analysis of the combined LGBT populations. This dataset has also been reweighted to be representative of these populations.
The complete Generations study data (baseline, wave 2, and wave 3 survey data) can be found under study number 37166, and the complete TransPop study data (transgender and cisgender survey data) can be found under study number 37938. For detailed information on the Generations and TransPop studies, including related publications, please refer to their respective DSDR/ICPSR study pages.