Assessment of Sexual Assault Kit (SAK) Evidence Selection Leading to Development of SAK Evidence Machine-Learning Model (SAK-ML Model), California, Idaho, Utah, 2010-2022 (ICPSR 39161)
Few studies have explored aggregated DNA analysis findings from sexual assault kits (SAKs) and predictive features of developing useful DNA information related to the foreign contributor(s). Information gleaned from evaluating DNA analysis findings have significant practice and policy implications for both forensic medical examiners/sexual assault nurse examiners and forensic scientists. Results from this innovative study were obtained by tracking SAKs from evidence collection, data from sexual assault medical forensic examinations, through DNA analysis results, and data from publicly funded laboratories.
This study does not include data files. It includes 13 Python files used for statistical analysis.
Children's Out-of-Court Statements: Effects of Hearsay on Jurors' Decisions in Sacramento County, California, and Maricopa County, Arizona, 1994-1997 (ICPSR 2791)
Collecting DNA at Arrest: Policies, Practices, and Implications, in 28 States, 2005-2012 (ICPSR 34682)
These data are part of NACJD's Fast Track Release and are distributed as they were received from the data depositor. The files have been zipped by NACJD for release, but not checked or processed except for the removal of direct identifiers. Users should refer to the accompanying readme file for a brief description of the files available with this collection and consult the investigator(s) if further information is needed.
This study examined arrestee DNA laws (laws that allowed testing of arrestees DNA pre-adjudication), their implementation in the field and their subsequent effects on agency operations as well as their success in aiding investigations in the 28 states that have these laws. The study investigated five specific questions:
- What states have passed legislation authorizing the collection of DNA from arrestees?
- How do the laws and policies regarding collecting DNA from arrestees differ by state?
- How have the courts ruled on these new laws?
- How have arrestee DNA laws been implemented in each state?
- What has been the impact of requiring DNA collection from arrestees on state crime laboratories and other involved agencies?
- What evidence is available to determine the effects of collecting DNA from arrestees on public safety or other criminal justice outcomes?
To answer these questions, researchers used a mixed methods data collection plan, including reviewing relevant statutes and case law, interviewing state and federal Combined DNA Index System (CODIS) laboratory staff and other forensic experts, and collecting descriptive data from state laboratories.
Detection and Identification of Hair Dyes by Surface-Enhanced Raman Spectroscopy (SERS), 2021-2026 (ICPSR 39772)
The primary goal of this project is to develop and deploy Surface-Enhanced Raman Spectroscopy (SERS) as a confirmatory, rapid, and minimally destructive tool for the forensic analysis of colorants directly on human hair at a crime scene.
There are several datasets associated with this study that are available for download.
Development of Microscopical Methods for the Systematic Analysis of Chemically Reacted, Improvised Low Explosives and Related Residues, Chicago, Illinois, 2020-2023 (ICPSR 39116)
This 2020 study was funded by the National Institute of Justice to advance knowledge about the microscopical methods used to examine materials commonly found in commercial and improvised low explosives. To achieve this, researchers developed reference documentation and an "Atlas of Unburned, Partially Burned, and Fully Burned Low Explosive and Related Materials" for the characterization, comparison, and identification of such materials. This data collection includes 57 files with images and descriptive captions documenting methods of microscopical analysis for a variety of chemically reacted, improvised low explosives and related residues. Details on the optical and physical properties, information regarding chemical solubility, recrystallization, microcrystal and microchemical spot tests, melting points, potential decomposition products, references, and photomicrographs of these materials are included as a PDF table. Additional information on this research can be found on the McCrone Research Institute website.
Efficiency in Processing Sexual Assault Kits in Crime Laboratories and Law Enforcement Agencies, United States, 2013-2014 (ICPSR 36747)
These data are part of NACJD's Fast Track Release and are distributed as they there received from the data depositor. The files have been zipped by NACJD for release, but not checked or processed except of the removal of direct identifiers. Users should refer to the accompany readme file for a brief description of the files available with this collections and consult the investigator(s) if further information is needed.
This study presents a research-informed approach to identify the most efficient practices for addressing un-submitted sexual assault kits (SAKs) that accrue in U.S. law enforcement agencies (LEAs) as well as untested SAKs pending analysis in crime laboratories. The study examined intra- and interagency dynamics associated with SAK processing efficiency in a linked sample of crime laboratories. SAK outputs and inputs were assessed for laboratories that conduct biological forensic analysis and LEAs that submit SAK evidence to these laboratories. Production functions were estimated to examine effects of labor and capital inputs, in addition to policies, management systems, and cross-agency coordination on efficiency. Six jurisdictions were recruited for site visits, and qualitative methods were used to understand how LEAs, laboratories, and prosecutors implement practices that affect efficiency.
This study contains 7 data files including:
- Crime Lab_Raw.dta (n=147; variables =242)
- Crosswalk File.dta (n=2337; variables=2)
- lab_analysis_sample_2017-04-06.dta (n=132; variables=92)
- LEA Communication LCAs.dta (n=321; variables=15
- merged_analysis_file_JH2017-04-30.dta (n=273; variables=117)
- policy Class probabilities_LABS.dta (n=139; variables=19)
- SAK LAB COMMUNICATION LCA.dta (n=134; variables=15)
Estimating the Prevalence of Wrongful Convictions, Virginia, 1973-1987 (ICPSR 36836)
Evaluating the Impact of a Specialized Domestic Violence Police Unit in Charlotte, North Carolina, 2003-2005 (ICPSR 20461)
Evaluation of Digital Evidence Processing Efficiencies in Publicly Funded Crime Laboratories, United States, 2020-2023 (ICPSR 38998)
This study investigated the use and value of digital evidence (DE) to create an evidence base for more efficient and effective DE management and processing. Researchers aimed to assess existing laboratory protocols and better understand the scientific rationale underpinning existing laboratory processes with the ultimate goal of assisting law enforcement agencies and crime laboratories in eliminating backlogs, optimizing available resources, and decreasing justice delays. The data deposited with ICPSR include the Crime Laboratories Survey Data, which describe the processes used by crime laboratories to manage and analyze DE, and the Law Enforcement Agencies Survey Data, which includes information on methods and processes for DE seizure, management, and analysis, and data about coordination between law enforcement agencies and associated crime laboratories.
Evaluation of the Bureau of Justice Assistance Sexual Assault Kit Initiative, United States, 2018 (ICPSR 37897)
Since 2015, the Bureau of Justice Assistance (BJA) has funded sites to engage in reforms intended to improve the national response to sexual assault cases. The goals of this initiative are to (1) create a coordinated community response that ensures just resolution to unsubmitted sexual assault kit (SAK) cases through a victim-centered approach and (2) build jurisdictions' capacity to prevent the development of conditions that lead to high numbers of unsubmitted sexual assault kits. Site efforts to address these issues include agencies such as law enforcement, prosecution, forensic laboratories, and victim advocacy service organizations. Westat was awarded a contract by the National Institute of Justice (NIJ) to assess components of BJA's Sexual Assault Kit Initiative (SAKI). The study includes (1) an evaluability assessment of 17 sites to determine their readiness for an evaluation, (2) a process evaluation and system reform assessment of the 17 sites, (3) a feasibility assessment of using case level data for an outcome evaluation, and analysis of a subset of unsubmitted SAK cases to identify how characteristics of incidents, offenders, and victims are associated with case processing decisions and outcomes, and (4) development of a long-term outcome evaluation plan.
Two sources of data are archived with NAJCD: (1) coded qualitative data from primarily on-site interviews the Westat Team conducted in 2018 with stakeholders from 17 of the fiscal year (FY) 2015 SAKI grantees and 2 private lab facilities and 2) quantitative case-level data from the 2 FY 2015 SAKI grantees on SAKI cases associated with previously unsubmitted sexual assault kits that were determined to contain foreign DNA or biological evidence through laboratory testing. The interview data file contains coded data from 172 interviews the research team conducted with one or more agency representatives regarding their organization's goals, strategies, and activities for processing sexual assault kits, and associated lessons learned, challenges, and expected outcomes. The quantitative case-level data file includes case-level information on 576 sexual assault kits determined to have DNA and associated cases included in the 2 sites' SAKI inventories. The case-level data captures information on case or offense-level information (e.g., date of offense, date offense reported to police, number of victims and suspects involved, investigation and prosecution activities), victim-level information (e.g., victim age, sex, race, participation in investigation), and suspect-level information (e.g., suspect's age, race, sex, criminal history).
Evaluation of the Phoenix, Arizona, Homicide Clearance Initiative, 2003-2005 (ICPSR 26081)
Eyewitness Identification: A Systematic Investigation of Lineup Composition and Fairness, United States, 2019-2022 (ICPSR 38761)
The major objective of this project was to investigate photo array composition in order to improve eyewitness identification procedures. Photo array composition involves the fillers, or known-innocent individuals that police add to a photo array so that the perpetrator/suspect (referred to as the "target" in experimental design) does not stand out. An unbiased (fair) photo array contains fillers that match the description of the suspect provided by one or more eyewitnesses. In contrast, in a biased photo array, the suspect stands out from the fillers. Another popular procedure used by police is the showup, when the suspect is presented without any fillers. This project involved all three of these procedures.
This collection contains raw and aggregated data from 12 sets of experiments that investigated different aspects of eyewitness identification, including fair vs. biased lineups, lineup size, distinctive facial features, target-filler similarity, impact of sleep on eyewitness accuracy, memory strength, number of suspects presented, impact of courtroom instructions and expert expertise, and speed and confidence of eyewitness identification. Each experiment set followed a similar general design, with variations based on the purpose and hypotheses of the specific study. United States-based adult participants recruited via SurveyMonkey were asked to complete an online experiment in which they would be presented with a crime vignette and a suspect facial image (created from a faces database), given a distractor task, and then asked to select the suspect from a lineup and rate the confidence level of their decision.
The data were provided to ICPSR in Excel workbook format (41 data files, 3 codebooks) and are available for download as a zipped package. ICPSR has not modified the files from the format in which they were supplied. Data files are organized into subfolders that are named with a short content descriptor and citation of the relevant publication. Unless noted, data files contain a "codes" sheet that explains the variables and experimental condition groups. Articles and theses/dissertations that used each dataset are available under Data-related Publications. Please refer to the ICPSR README for more information.
Forensic Evidence and Criminal Justice Outcomes in Sexual Assault Cases in Massachusetts, 2008-2012 (ICPSR 35205)
These data are part of NACJD's Fast Track Release and are distributed as they were received from the data depositor. The files have been zipped by NACJD for release, but not checked or processed except for the removal of direct identifiers. Users should refer to the accompanying readme file for a brief description of the files available with this collection and consult the investigator(s) if further information is needed.
This project had three goals. One, to provide a more detailed description of injury evidence and biological evidence in sexual assault cases, including their timing relative to arrests. A second goal was to examine the relationship of forensic evidence to arrests. A third goal was to examine injury evidence and biological evidence in certain types of cases in which it may have had greater impact. To achieve these goals, the researchers created analysis data files that merged data from the Massachusetts Provided Sexual Crime Report, forensic evidence data from the two crime laboratories serving the state and data on arrests and criminal charges from 140 different police agencies.
Forensic Evidence and the Police, 1976-1980 (ICPSR 8186)
Forensic Evidence in Homicide Investigations, Cleveland, Ohio, 2008-2011 (ICPSR 36202)
These data are part of NACJD's Fast Track Release and are distributed as they were received from the data depositor. The files have been zipped by NACJD for release, but not checked or processed except for the removal of direct identifiers. Users should refer to the accompanying readme file for a brief description of the files available with this collection and consult the investigator(s) if further information is needed.
The objective of this study was to determine how homicide investigators use evidence during the course of their investigations. Data on 294 homicide cases (315 victims) that occurred in Cleveland between 2008 and 2011 was collected from investigative reports, forensic analysis reports, prosecutors and homicide investigators, provided by the Cleveland Ohio Police Department, Cuyahoga County Medical Examiner's Office, and Cuyahoga County Clerk of Courts.
The study collection includes 1 Stata data file (NIJ_Cleveland_Homicides.dta, n=294, 109 variables).
Impact of Forensic Evidence on Arrest and Prosecution (IFEAP) in Connecticut, United States, 2006-2009 (ICPSR 36695)
These data are part of NACJD's Fast Track Release and are distributed as they were received from the data depositor. The files have been zipped by NACJD for release, but not checked or processed except for the removal of direct identifiers. Users should refer to the accompanying readme file for a brief description of the files available with this collection and consult the investigator(s) if further information is needed.
This research was conducted in two phases. Phase one analyzed a random sample of approximately 2,000 case files from 2006 through 2009 that contain forensic analyses from the Connecticut State Forensic Science Laboratory, along with corresponding police and court case file data. As with Peterson, et al. (2010), this research had four objectives: 1) estimate the percentage of cases in which crime scene evidence is collected; 2) discover what kinds of forensic are being collected; 3)track such evidence through the criminal justice system; and 4)identify which forms of forensic evidence are most efficacious given the crime investigated.
Phase two consisted of a survey administered to detectives within the State of Connecticut regarding their comparative assessments of the utility of forensic evidence. These surveys further advance our understanding of how the success of forensic evidence in achieving arrests and convictions matches with detective opinion.
Impact of Forensic Evidence on the Criminal Justice Process in Five Sites in the United States, 2003-2006 (ICPSR 29203)
Improving Evidence Collection Through Police-Prosecutor Coordination in Baltimore, 1984-1985 (ICPSR 9290)
Improving the Investigation of Homicide and the Apprehension Rate of Murderers in Washington State, 1981-1986 (ICPSR 6134)
Improving the Production and Use of Forensic Science, 5 U.S. counties, 2006-2009 (ICPSR 36727)
This study collection sought to thoroughly understand the creation, testing, and use of forensic science in five jurisdictions across the country. A random sample was selected of recent criminal cases in the following jurisdictions and tracked from investigation to adjudication to understand how forensic evidence functions:
- Sacramento County, CA: 990 cases
- Segwick County, KS: 936 cases
- Allegheny County, PA: 978 cases
- Bexar County (San Antonio), TX: 936 cases
- King County, WA: 892 cases
The Principal Investigator sought answers to the following seven primary research questions:
- How often is forensic evidence collected and analyzed and how is it used pre-arrest?
- What are the outcomes of forensic evidence testing?
- What is the effect of forensic evidence on arrest and charging?
- How does forensic evidence affect the plea-bargaining process?
- What effect does forensic evidence have on conviction and sentencing outcomes?
- Does the turnaround time for analysis of forensic evidence have any impact on case disposition?
- Does the institutional configuration of the crime laboratory have any effect on its productivity?
Data for the following types of forensic testing are included in this data collection: hair, fibers, glass, paint, gas chromatography / mass spectrometry (GC/MS), Fourier transform infrared spectroscopy (FTIR), scanning electron microscopy / energy dispersive x-ray spectroscopy (SEM/EDX), physical match, drug identification, toxicology, serology, combined DNA index system (CODIS), DNA short tandem repeat (Y-STR), blood pattern, test fire, and comparison scope.
Influence of Eyewitness Memory Factors on Plea Bargaining Decisions by Prosecution and Defense Attorneys in California, 2010-2011 (ICPSR 32181)
Injury Evidence, Forensic Evidence and the Prosecution of Sexual Assault, United States, 2005-2011 (ICPSR 36608)
These data are part of NACJD's Fast Track Release and are distributed as they were received from the data depositor. The files have been zipped by NACJD for release, but not checked or processed except for the removal of direct identifiers. Users should refer to the accompanying readme file for a brief description of the files available with this collection and consult the investigator(s) if further information is needed.
This project explored the use and impact of injury evidence and biological evidence through a study of the role of these forms of evidence in prosecuting sexual assault in an urban district attorney's office in a metropolitan area in the eastern United States. The research questions addressed in this summary overview were as follows:
- How frequent were different forms of injury evidence and biological evidence in the sample?
- Is the presence of injury evidence and biological evidence correlated with the presence of other forms of evidence?
- Which types of cases and case circumstances are more likely to yield injury evidence and biological evidence?
- Do the presence of injury evidence and biological evidence predict criminal justice outcomes, taking into account the effects of other predictors?
- In what ways do prosecutors use injury evidence and biological evidence and what is their appraisal of their impact on case outcomes?
The collection contains 1 SPSS data file, DataArchiveFile_InjuryEvidenceForensicEvidenceandthe ProsecutionofSexualAssault4-7-17.sav (n=257; 417 variables).
The qualitative data files were excluded from deposit with ICPSR and are not available as part of this data collection at this time.
Jurors' Judgments About Forensic Identification Evidence, Arizona, 2011-2014 (ICPSR 36169)
This data file describes three different experiments that were designed to examine how differences in the way forensic scientific evidence is communicated affects jurors.
In each experiment, participants consisted of jury-eligible community members in Maricopa County, Arizona. Groups of participants attended a research session in which they were shown a 35-40-minute videotapes of one of two mock criminal trials (one, a rape case, centers around bitemark evidence, and the other, an attempted murder, centers around fingerprint evidence). Within each trial the content of a forensic scientist's testimony was manipulated. These manipulations involved: 1) whether the technique used by the forensic scientist was "high tech" or "low tech," 2) the amount of experience possessed by the forensic scientist, 3) whether the technique used by the forensic scientist had been scientifically validated, 4) whether the forensic scientist conceded that an error was possible, and 5) whether any exculpatory evidence was present at the crime scene.
Immediately following the trial, each individual participants completed a questionnaire in which they gave their individual impressions of the strength of the case. Following that, the group of participant would deliberate and attempt to reach a unanimous verdict. Finally, each individual participant completed an additional questionnaire that again measured perceptions of the case along with individual difference measures and demographics.
National Prosecutors Survey, 1990 (ICPSR 9579)
National Prosecutors Survey, 1992 (ICPSR 6273)
National Prosecutors Survey, 1994 (ICPSR 6785)
National Prosecutors Survey, 1996 (ICPSR 2433)
National Prosecutors Survey, 2005 (ICPSR 4600)
National Prosecutors Survey [Census], 2001 (ICPSR 3418)
National Prosecutors Survey [Census], 2007 (ICPSR 33202)
National Survey of Eyewitness Identification Procedure in Law Enforcement Agencies, 1994-2012 (ICPSR 34274)
Nature and Patterns of Homicide in Eight American Cities, 1978 (ICPSR 8936)
Pilot Study of State and Federal Digital Evidence Laboratories, [United States], 2014 (ICPSR 37055)
The Pilot Study of State and Federal Digital Evidence Laboratories data collection contains data collected in 2015 as part of the Census of Publicly Funded Forensic Crime Laboratories (CPFFCL). The CPFFCL examined the forensic services provided by publicly funded crime labs across the nation and the resources devoted to completing the work.
To capture more information about an emerging forensic science discipline known as digital evidence, the Bureau of Justice Statistics (BJS) expanded the scope of the 2014 CPFFCL from previous data collections to include a separate pilot study of state and federal agencies that solely analyzed digital evidence in support of criminal investigations and prosecutions. These agencies obtained digital and multimedia evidence in various formats, including audio, video, and graphical images from computers, cell phones, cameras, and other electronic devices. The traditional CPFFCL definition of a crime lab limited the information collected about digital evidence since some agencies only handle this type of evidence and employ forensic experts with training in computer science or information technology as opposed to natural sciences such as chemistry and biology.
The census collected detailed information on laboratory staff, budgets, workloads, and backlogs in requests for forensic services. The census also provides data on lab accreditations, proficiency tests, and other quality assurances.
Post-Conviction DNA Testing and Wrongful Conviction Data from Sexual Assault and Homicide Cases in Virginia, 1973-1988 (ICPSR 34306)
Prosecution and Defense Strategies in Domestic Violence Felonies in Iowa, 1989-1995 (ICPSR 2811)
The Role and Impact of Forensic Evidence on the Criminal Justice System, 2004-2008 [United States] (ICPSR 33462)
These data are part of NACJD's Fast Track Release and are distributed as they there received from the data depositor. The files have been zipped by NACJD for release, but not checked or processed except of the removal of direct identifiers. Users should refer to the accompany readme file for a brief description of the files available with this collections and consult the investigator(s) if further information is needed.
This collection includes data gathered through three separate study designs. The first study called for tracking cases and forensic evidence through local criminal justice processes for five offenses: homicide, sexual assault, aggravated assault, robbery and burglary. Two sites, Denver, Colorado, and San Diego, California, participated in the study. Demographic data were collected on victims (Victim Data n = 7,583) and defendants (Defendant Data n = 2,318). Data on forensic evidence collected at crime scenes included DNA material (DNA Evidence Data n = 1,894), firearms evidence (Ballistics Evidence Data n = 488), latent prints (Latent Print Evidence Data n = 766), trace evidence (Other Impressions Evidence Data n = 49), and drug evidence (Drug Evidence Data n = 43). Comparisons were then made between open and closed cases from the participating sites. Two smaller studies were conducted as part of this grant. The second study was an analysis of an experiment in the Miami-Date, Florida Police Department (Miami-Data County Data n = 1,421) to determine whether clearance rates for no-suspect property crimes could be improved through faster processing of DNA evidence. The third study was a survey of 75 police departments across the nation (Crime Labs Survey Data) to obtain information on the organizational placement, staffing and responsibilities of crime lab units.
Search and Seizure Data, 1963 (ICPSR 7539)
Sexual Assault Kit Backlog Study, Los Angeles, California, 1982-2010 (ICPSR 33841)
Testing the Efficacy of the SANE-SART Programs in Kansas, Massachusetts, and New Jersey, 1997-2001 (ICPSR 20341)
Towards Commercialization: Preliminary Developmental Validation of a High Resolution Melt Curve Mixture Prediction Assay and SVM Tool, Virginia, 2020-2022 (ICPSR 39133)
In the current study, roughly 170 single source samples and 32 two-person mixture samples were tested using both the integrated Quantiplex®-high resolution melt (HRM) assay and Quantifiler™ Trio-HRM assay, then the entire HRM datasets were exported for prediction modeling using both linear discriminate analysis (LDA) and support vector machine (SVM) algorithms in R Studio software. For proof-of-concept, only 8 different genotypes, including a genotype of "mixture", were represented (for each locus) in testing. A portion of the samples tested were used to "train" the software and the remaining sample data was used as unknowns (or "validation") samples for prediction. When samples were tested in the Quantiplex®-HRM assay, an overall accuracy of 87.88 percent was exhibited, correctly classifying 87.5 percent of single source samples as such and 90 percent of mixture samples. Similarly, when samples were tested in the Quantifiler™ Trio-HRM assay an overall accuracy of 79.2 percent was exhibited, with 89.2 percent of single source samples accurately classifying and 43.8 percent of mixtures accurately classifying. Additionally, quantification values obtained from the integrated assays as well as the quality metrics such as the slope, R2, and y-intercept, were not significantly different than those obtained in the standard assays.
Verification and Evaluation of a miRNA Panel for Body Fluid Identification Using DNA Extracts, United States, 2019-2021 (ICPSR 38391)
Although human identification through DNA analysis has reached a level of maturity in the Forensic Science field with regards to the sophistication of the techniques and confidence in the results, the equally important question of body fluid identification has lagged behind, and could still be considered to be in a rudimentary state. Current crime scene and in-laboratory methods utilize detection methods that exploit the properties of each biological fluid (e.g. phenolphthalin or TMB testing for blood, amylase detection for saliva, and urease tests for urine), but validated confirmatory techniques are largely limited to microscopic methods (i.e. identification of spermatozoa) or immunological methods, as seen in the widely used immunochromatographic commercial tests for blood, semen, and other biological fluids.
Thus, while there is widespread confidence in the DNA profile generated, there is often significantly less assurance in the identity of the body fluid that the DNA profile was developed from. It is common during trials for attorneys to categorically accept the STR analysis, but probe the forensic scientist on the source of the DNA that generated the profile. Because of this dichotomy, significant efforts have been made over the past fifteen years in order to develop forensic serological techniques of a more discriminatory nature.
Of late, there has been some work in the forensic science field in regards to exploring microRNAs (miRNAs) for a molecular-based, forensic body fluid identification method. MiRNAs are small structures that are 19-23 nucleotides long and regulate cellular processes through interactions with mRNA by regulating gene expression through translational suppression or cleavage of a targeted mRNA. miRNAs are highly conserved among organisms, indicating their importance in regulating biological processes. As such, some miRNAs can be consistently expressed in all human tissues, and others can be tissue-specific Because of the potential for tissue specificity, their small size and consequent inherent stability, miRNAs have been the subject of recent research interest as a potential forensic body fluid identification technique. They are found in extracellular fluids, and thus the application of unique miRNAs for forensically relevant body fluids is a distinct possibility.