Automatic Database Normalisation for Data Frames
Relational data attribute order
Relational data attributes
Database-style normalisation for data.frames
Create a normalised database from a data frame
Add foreign key references to a normalised database
Create instance of a schema
Database schemas
Databases
Decompose a data frame based on given normalised dependencies
Dependants
Determinant sets
Determine Duplicate Elements
Test data frames for equivalence under row reordering
Combine R Objects by Rows or Columns
Dependency discovery with DFD
Functional dependency vectors
Generate Graphviz input text to plot a data frame
Generate Graphviz input text to plot database schemas
Generate Graphviz input text to plot databases
Generate Graphviz input text to plot objects
Generate Graphviz input text to plot relation schemas
Generate Graphviz input text to plot relations
Insert data
Relational data keys
Merge relation schemas with empty keys
Merge relation schemas in given pairs
Create normalised database schemas from functional dependencies
Relational data records
Remove database schema relations not linked to the given relations
Remove database relations not linked to the main relations
Remove relations not linked to the main relations
Schema references
Join a database into a data frame
Relation schema vectors
Relation vectors
Rename relational data attributes
Database subrelations
Schema subschemas
Synthesise relation schemas from functional dependencies
Automatic normalisation of a data frame to third normal form, with the intention of easing the process of data cleaning. (Usage to design your actual database for you is not advised.) Originally inspired by the 'AutoNormalize' library for 'Python' by 'Alteryx' (<https://github.com/alteryx/autonormalize>), with various changes and improvements. Automatic discovery of functional or approximate dependencies, normalisation based on those, and plotting of the resulting "database" via 'Graphviz', with options to exclude some attributes at discovery time, or remove discovered dependencies at normalisation time.
Useful links