Annual Health Survey (AHS), India, 2007-2012 (ICPSR 38097)
Version Date: Apr 13, 2022 View help for published
Principal Investigator(s): View help for Principal Investigator(s)
Diane Coffey, The University of Texas at Austin;
r.i.c.e., a research institute for compassionate economics.;
Nathan Franz, The University of Texas at Austin;
r.i.c.e., a research institute for compassionate economics.;
Samuel Arenberg, The University of Texas at Austin;
Minle Xu, The University of Texas at Austin;
Sangita Vyas, The University of Texas at Austin;
r.i.c.e., a research institute for compassionate economics.
https://doi.org/10.3886/ICPSR38097.v2
Version V2 (see more versions)
Summary View help for Summary
The Annual Health Survey (AHS), conducted by the Government of India between July 2010 and May 2013, investigates maternal and child health in nine states: Assam, Bihar, Chhattisgarh, Jharkhand, Madhya Pradesh, Orissa, Rajasthan, Uttarakhand, and Uttar Pradesh. These states constitute about 70 percent of neonatal deaths in India and about one-in-five neonatal deaths globally. The AHS consists of a three-round panel that interviewed over 4 million households in each round, as well as a one-time Clinical, Anthropometric, and Bio-Chemical Survey (CAB). The data were originally released to the public in 2015 as a set of 45 .csv files. The .csv files are included in a restricted-use zipped package as part of the ICPSR release (see dataset 21).
The survey focused on topics such as household composition, caste, fertility, family planning, pre- and post-natal care, breastfeeding, infant mortality, illness, disease, disability, and health care practices. Demographic information includes sex, age, education, occupation, marital status, household size, and religion. The CAB files contain biometric data including but not limited to height, weight, blood pressure, hemoglobin, pulse, and blood glucose.
Potential data users should note that the public-use and restricted-use versions of the datasets are the same except for the masking of day component variables for certain dates in the public-use versions of the files (please see the Description of Variables section for full details). Therefore, only researchers with a limited set of research questions that require full birth, marriage, and death dates will need to apply for the restricted-use versions of the data files.
Additionally, because the final data files are very large and potentially very time consuming to analyze on personal computers, researchers have the option to download ten-percent samples of each file (see datasets 3, 4, 7, 8, 11, 12, 15, 16, 19, and 20). These samples contain the same variables as the original files but only ten percent of the records. The samples were determined by taking a randomly selected ten percent of households in each district. P.I. codebooks were not produced for these samples. Please note that the ten-percent samples for each dataset were selected independently, so it is not advised to merge across datasets within the AHS using these samples, as the match rates will be very low.
Citation View help for Citation
Export Citation:
Funding View help for Funding
Subject Terms View help for Subject Terms
Geographic Coverage View help for Geographic Coverage
Smallest Geographic Unit View help for Smallest Geographic Unit
District
Restrictions View help for Restrictions
These data may not be used for any purpose other than statistical reporting and analysis. Use of these data to learn the identity of any person or establishment is strictly prohibited. To protect respondent privacy, certain files within this data collection are restricted from general dissemination. To obtain these files, researchers must agree to the terms and conditions of a Restricted Data Use Agreement in accordance with existing ICPSR servicing policies.
Distributor(s) View help for Distributor(s)
Time Period(s) View help for Time Period(s)
Date of Collection View help for Date of Collection
Data Collection Notes View help for Data Collection Notes
-
The Annual Health Survey was conducted by the Government of India between July of 2010 and May of 2013. The data were originally released to the public in 2015 as a set of 45 .csv files. These .csv files are included in a restricted-use zipped package as part of the ICPSR release (see dataset 21).
The University of Texas at Austin assembled the publicly available microdata, constructed the necessary identifiers, and cleaned the data. The user-friendly versions of the constituent datasets are provided by ICPSR as public and restricted-use datasets 1 through 20. For ease of use, ICPSR has retained the folder structure of the deposited .csv files and the Stata (.do) syntax files used to create the user-friendly versions of the files; therefore, the .csv files and .do files are provided together in the dataset 21 restricted-use zipped package. For users working with the public-use datasets, only the Stata (.do) syntax files are available for reference in the dataset 21 zipped package.
The Users' Guide describes how the final datasets were constructed from the original files released by the Government of India's Ministry of Health and Family Welfare, what information the final datasets contain, how to use the panel structure of the data, how to link the different datasets together, as well as some of the limitations of the data. Please see the Users' Guide for additional information about the user-friendly datasets, as well as detailed guidance regarding use of the Stata (.do) syntax and .csv files.
-
The public-use and restricted-use versions of the datasets are the same except for the masking of day component variables for certain dates in the public-use versions of the files (please see the Description of Variables section for full details). Therefore, only researchers with a limited set of research questions that require full birth, marriage, and death dates will need to apply for the restricted-use versions of the data files.
Study Purpose View help for Study Purpose
The purpose of this study was to measure the impact of the National Rural Health Mission, which launched in 2005 and featured several changes to the provision of maternal and child health in India.
Study Design View help for Study Design
The Annual Health Survey (AHS) was conducted for three years between 2010-2013. For the AHS study, population listing, sample selection, and field protocols were used to collect samples. For this survey, data were collected using questionnaires from three rounds. In the first round (known as the baseline study), household that were selected in the survey were asked to complete four schedules (questionnaires), which are the Household Schedule, Household-Listing Schedule, Woman Schedule, and Mortality Schedule.
The Clinical, Anthropometric, and Bio-Chemical Survey (CAB) is a cross sectional survey that visited different households than the ones that were interviewed in the AHS Panel. The CAB collected anthropometric data, as well as blood pressure and blood samples to measure glucose and hemoglobin. The CAB was collected in 2014.
Sample View help for Sample
The Sample design adopted for Annual Health Survey is a uni-stage stratified simple random sample without replacement except in case of larger villages in rural areas (population more than or equal to 2,000 as per the 2001 Census), wherein a two stage stratified sampling has been applied. The sample units are Census Enumeration Blocks (CEBs) in urban areas and villages in rural areas. In rural areas, the villages have been divided into two strata.
Stratum I comprises villages with population less than 2,000 and Stratum II contains villages with population 2,000 or more. Smaller villages with population less than 200 were excluded from the sampling frame in such a manner that the total population of villages so excluded did not exceed 2 per cent of the total population of the district. In case of Stratum I, the entire village is the sample unit. In case of Stratum II, the village has been divided into mutually exclusive (non-overlapping) and geographically contiguous units comprising group of EBs called segments of more or less equal size and population not exceeding 2,000 in any case. One segment from the frame of segments thus prepared was selected in a random manner to represent the selected village at the second stage of sampling.
The number of sample villages in each district was allocated between the two strata proportionally to their size (population). The villages within each size stratum were further ordered by the female literacy rate based on the Census 2001 data, and three disjoint and equal size substrata were established. The sample villages within each substratum were selected by simple random sampling without replacement. Similarly, in urban areas, the CEBs within a district were ordered by the female literacy rate based on the Census 2001 data, and three disjoint and equal size substrata were established. The sample CEBs within each substratum were selected by simple random sampling without replacement. Thus, female literacy which has a direct bearing on the fertility behavior was used for implicit stratification.
Time Method View help for Time Method
Universe View help for Universe
Individuals and households in the Indian states of Assam, Bihar, Chhattisgarh, Jharkhand, Madhya Pradesh, Odisha, Rajasthan, Uttar Pradesh, and Uttarakhand.
Unit(s) of Observation View help for Unit(s) of Observation
Data Type(s) View help for Data Type(s)
Mode of Data Collection View help for Mode of Data Collection
Description of Variables View help for Description of Variables
Public-use and restricted-use versions of the COMB, WOMAN, WPS, MORT, and CAB datasets are the same except for the masking of day component variables for full dates in the public-use versions of the files:
- DS1 and DS3: Variables DATE_OF_BIRTH (day) and DATE_OF_MARRIAGE (day) are masked in the public-use COMB files.
- DS5 and DS7: Variables DATE_OF_BIRTH (day) and DATE_OF_MARRIAGE (day) are masked in the public-use WOMAN files.
- DS9 and DS11: Variables DOB (day-child), DATE_OF_BIRTH (day-mother), and DATE_OF_MARRIAGE (day) are masked in the public-use WPS files.
- DS13 and DS15: Variables DATE_OF_DEATH (day), DATE_OF_BIRTH (day), and DATE_OF_MARRIAGE (day) are masked in the public-use MORT files.
- DS17 and DS19: Variable DATE_OF_BIRTH (day) is masked in the public-use CAB files.
Original Release Date View help for Original Release Date
2022-04-06
Version History View help for Version History
2022-04-13 Updated P.I. information in public and restricted-use documentation files. Updated DS1 and DS2 to include R data files.
2022-04-06 ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection:
- Checked for undocumented or out-of-range codes.
Weight View help for Weight
The three strata in the AHS panel are large villages (population greater than or equal to 2,000 as per the 2001 Census), small villages (population less than 2,000 as per the 2001 Census), and urban areas, where Census Enumeration Blocks acted as primary sampling units. Within each district, weights are stratum-specific; that is, observations in a given district have one of three unique weights that correspond to whether the observation is from a large village, a small village or an urban area. These weights should be applied when computing district- or region-level statistics.
HideNotes
The public-use data files in this collection are available for access by the general public. Access does not require affiliation with an ICPSR member institution.
One or more files in this data collection have special restrictions. Restricted data files are not available for direct download from the website; click on the Restricted Data button to learn more.
This study was originally processed, archived, and disseminated by Data Sharing for Demographic Research (DSDR), a project funded by the Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD).