Link Infectious Disease Cases to Vaccination and Hospitalization Records
starling: Link Infectious Disease Cases to Vaccination and Hospitaliza...
Molt: De-identify a Dataset with Hash-based Relinking
Links case, hospital or vaccination datasets
Identify Chronic Conditions Using ICD-10-AM U-Codes
Prettification of infectious diseases datasets
Calculate Age if Missing
Clean datasets and establishes common variable name nomenclature
Create Comprehensive Age Categories
Find Column by Pattern Matching
Homing: Relink De-identified Data Using Lookup Table
Examine and summarize variables in a dataset
Facilitates probabilistic record linkage between infectious disease surveillance datasets (notifiable disease registers, outbreak line-lists), vaccination registries, and hospitalization records using methods based on Fellegi and Sunter (1969) <doi:10.1080/01621459.1969.10501049> and Sayers et al. (2016) <doi:10.1093/ije/dyv322>. The package provides core functions for data preparation, linkage, and analysis: clean_the_nest() standardizes variable names and formats across heterogeneous datasets; murmuration() performs machine learning-based record linkage using blocking variables and similarity metrics; molting() deidentifies datasets for secure sharing; homing() re-identifies previously deidentified datasets; plumage() identifies and categorizes comorbidities; and preening() creates analysis-ready variables including age categories and temporal groupings. Designed for epidemiological research linking acute and post-acute disease outcomes to vaccination status and healthcare utilization. Supports multiple linkage scenarios including case-to-vaccination, case-to-hospitalization, and event-based vaccination status determination (e.g., outbreak attendees, flight passengers, exposure site visitors).