New Approach to Evaluating Supplementary Homicide Report (SHR) Data Imputation, 1990-1995 (ICPSR 20060)
Version Date: Dec 18, 2007 View help for published
Principal Investigator(s): View help for Principal Investigator(s)
Tim Wadsworth, University of New Mexico. Department of Sociology;
John M. Roberts, University of New Mexico. Department of Sociology
https://doi.org/10.3886/ICPSR20060.v1
Version V1
Summary View help for Summary
The purpose of the project was to learn more about patterns of homicide in the United States by strengthening the ability to make imputations for Supplementary Homicide Report (SHR) data with missing values. Supplementary Homicide Reports (SHR) and local police data from Chicago, Illinois, St. Louis, Missouri, Philadelphia, Pennsylvania, and Phoenix, Arizona, for 1990 to 1995 were merged to create a master file by linking on overlapping information on victim and incident characteristics. Through this process, 96 percent of the cases in the SHR were matched with cases in the police files. The data contain variables for three types of cases: complete in SHR, missing offender and incident information in SHR but known in police report, and missing offender and incident information in both. The merged file allows estimation of similarities and differences between the cases with known offender characteristics in the SHR and those in the other two categories. The accuracy of existing data imputation methods can be assessed by comparing imputed values in an "incomplete" dataset (the SHR), generated by the three imputation strategies discussed in the literature, with the actual values in a known "complete" dataset (combined SHR and police data). Variables from both the Supplemental Homicide Reports and the additional police report offense data include incident date, victim characteristics, offender characteristics, incident details, geographic information, as well as variables regarding the matching procedure.
Citation View help for Citation
Export Citation:
Funding View help for Funding
Subject Terms View help for Subject Terms
Geographic Coverage View help for Geographic Coverage
Smallest Geographic Unit View help for Smallest Geographic Unit
jurisdiction
Distributor(s) View help for Distributor(s)
Time Period(s) View help for Time Period(s)
Date of Collection View help for Date of Collection
Data Collection Notes View help for Data Collection Notes
-
Users are encouraged to refer to the project final report for more information about imputation methods used and the processes the researchers used to assess them.
Study Purpose View help for Study Purpose
The purpose of the project was to learn more about patterns of homicide in the United States by strengthening the ability to make imputations for Supplementary Homicide Report (SHR) data with missing values. Several strategies for imputing missing homicide data have been suggested in recent years. However, aside from assessing their logic and the degree to which findings based on imputed data are in accordance with what is known broadly about homicide, such strategies are difficult to evaluate for the reason that the "true" values for cases with missing data cannot be observed. By combining different datasets available through the National Archive of Criminal Justice Data, this project offered a strategy for both testing imputation approaches that have been suggested in the literature and for using these results to advance the development of new methods for imputing SHR data.
Study Design View help for Study Design
In order to better understand why data are missing and how missing and nonmissing data differ, Supplementary Homicide Reports (SHR) and local police data from Chicago, Illinois, St. Louis, Missouri, Philadelphia, Pennsylvania, and Phoenix, Arizona, for 1990 to 1995 were merged to create a master file. The researchers used overlapping information on victim and incident characteristics from the SHR and police data from fields for which the data are very rarely missing even if the offender is unknown (age, sex, and race of victims, month of incident, and weapon) to link the two types of data sources. While this process became challenging when the characteristics of victims and incidents were identical across multiple incidents in a given month, this type of situation was extremely rare as usually at least one characteristic varied across incidents. Through this process, 96 percent of the cases in the SHR were matched with cases in the police files. The data contain variables for three types of cases: complete in SHR, missing offender and incident information in SHR but known in police report, and missing offender and incident information in both. The merged file allows estimation of similarities and differences between the cases with known offender characteristics in the SHR and those in the other two categories. The accuracy of existing data imputation methods can be assessed by comparing imputed values in an "incomplete" dataset (the SHR), generated by the three imputation strategies discussed in the literature, with the actual values in a known "complete" dataset (combined SHR and police data), for both case-by-case imputation as well as distributions at the aggregate level.
Sample View help for Sample
Not applicable.
Universe View help for Universe
Homicides known to the police in Chicago, Illinois, Philadelphia, Pennsylvania, Phoenix, Arizona, St. Louis, Missouri, from 1990 to 1995.
Unit(s) of Observation View help for Unit(s) of Observation
Data Source View help for Data Source
CHANGING PATTERNS IN SOCIAL POLICY IN PHILADELPHIA, PHOENIX, AND ST. LOUIS, 1980-1994 (ICPSR 2729). Data for St. Louis were supplied by the St. Louis Homicide Project (Scott Decker and Richard Rosenfeld, principal investigators).
HOMICIDES IN CHICAGO, 1965-1995 (ICPSR 6399)
UNIFORM CRIME REPORTS [UNITED STATES]: SUPPLEMENTARY HOMICIDE REPORTS, 1976-1999 (ICPSR 3180)
Data Type(s) View help for Data Type(s)
Mode of Data Collection View help for Mode of Data Collection
Description of Variables View help for Description of Variables
Variables from both the Supplemental Homicide Reports and the police report offense data include incident date (year and month), victim characteristics (age, sex, and race), offender characteristics (age, sex, race, and relationship to victim), and incident details (weapon used, circumstance, and situation of offense). Data also contain geographic variables (UCR state code, agency, population group, region, population, county, and SMSA), as well as matching information, such as from which source the offender data were missing, the number of victims in the SHR data, the number of offenders in the SHR data, the researchers' confidence in the case match, their comments, and several aggregated and derived variables.
Response Rates View help for Response Rates
Not applicable.
Presence of Common Scales View help for Presence of Common Scales
None
HideOriginal Release Date View help for Original Release Date
2007-12-18
Version History View help for Version History
- Wadsworth, Tim, and John M. Roberts. New Approach to Evaluating Supplementary Homicide Report (SHR) Data Imputation, 1990-1995. ICPSR20060-v1. Ann Arbor, MI: Inter-university Consortium for Political and Social Research [distributor], 2007-12-18. http://doi.org/10.3886/ICPSR20060.v1
2007-12-18 ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection:
- Checked for undocumented or out-of-range codes.
Weight View help for Weight
Data contain three weight variables associated with the SHR data: WTNONE, WTUS, and WTST.
HideNotes
The public-use data files in this collection are available for access by the general public. Access does not require affiliation with an ICPSR member institution.