Search results

Showing 1 – 2 of 2 results.
Curated

Incremental Privacy-Preserving Record Linkage (iPPRL) to Reduce Barriers to Data Sharing and Improve Data Quality [Methods Study], Colorado, 2011-2022 (ICPSR 39738)

Released/updated on: 2026-03-23
Geographic coverage: United States, Colorado
Time period: 2011-01-01--2022-01-01

Researchers often have trouble collecting complete information on patient health, as patients may receive care at different places. Linking patient records from different places may help researchers get a more complete picture.

One way to link records is through personal information, such as names and birth dates. But this method increases risks to patient privacy. Another way, known as privacy-preserving record linkage, or PPRL, masks personal information. But current PPRL methods only work when linking entire sets of patient data, including data that have already been shared and linked. Linking entire data sets takes a long time. Also, sharing the same records multiple times increases data privacy risks.

In this study, the research team developed and tested a new PPRL method called incremental PPRL. This method links only new or updated data rather than re-linking entire data sets.

Curated

Statistical Methods and Designs for Addressing Correlated Errors in Outcomes and Covariates in Studies Using Electronic Health Records Data [Methods Study], Tennessee, 2016-2021 (ICPSR 39726)

Released/updated on: 2026-03-12
Geographic coverage: United States, Tennessee
Time period: 2016-01-01--2021-01-01

Electronic health records, or EHRs, have data on patient traits, health problems, and treatments. Researchers can use EHR data to study how treatments work or which patient traits affect health outcomes. But EHR data can have errors.

The best way to get accurate EHR data is to closely review patients' original records. But reviewing all patient records isn't possible when many patients are in a study. In such cases, researchers can review and correct records for a few patients and use the revised records to adjust data for all patients. But existing methods for using revised records don't address some kinds of errors, such as errors that are related. For example, errors in a treatment starting date can lead to mistakes in the data on length of treatment.

In this project, the research team created and tested new methods to improve the accuracy of EHR data. The new methods corrected records from some patients. Then the team used the corrections to address related errors for all patients.

To access the methods and software, please visit the MeasurementErrorMethods GitHub repository.