Explicit Key Assumptions for Flat-File Data
Add identity column
Bind data frames with ID handling
Bind rows of keyed data frames
Check for drift from committed snapshot
Check IDs are disjoint across datasets
Check ID integrity
Clear all snapshots from cache
Clear snapshot for a data frame
Commit a keyed data frame as reference
Compare IDs between data frames
Compare key values between two data frames
Compare structure of two data frames
Diagnose a join before executing
Extend IDs to new rows
Find duplicate keys
Get ID column
Get key column names
Check if data frame has IDs
Check if data frame has a key
Check if the key is still valid
Get key status summary
Define a key for a data frame
keyed: Explicit Key Assumptions for Flat-File Data
List all snapshots in cache
Assert that data is complete (no missing values anywhere)
Assert minimum coverage of values
Assert that columns have no missing values
Assert row count within expected range
Assert that columns are unique
Create ID from columns
Remove ID column
Summary method for keyed data frames
Remove key from a data frame
Helps make implicit data assumptions explicit by attaching keys to flat-file data that error when those assumptions are violated. Designed for CSV-first workflows without database infrastructure or version control. Provides key definition, assumption checks, join diagnostics, and optional drift detection against reference snapshots.
Useful links