autodb3.0.0 package

Automatic Database Normalisation for Data Frames

attrs_order

Relational data attribute order

attrs

Relational data attributes

autodb-package

Database-style normalisation for data.frames

autodb

Create a normalised database from a data frame

autoref

Add foreign key references to a normalised database

create

Create instance of a schema

database_schema

Database schemas

database

Databases

decompose

Decompose a data frame based on given normalised dependencies

dependant

Dependants

detset

Determinant sets

df_duplicated

Determine Duplicate Elements

df_equiv

Test data frames for equivalence under row reordering

df_rbind

Combine R Objects by Rows or Columns

discover

Dependency discovery with DFD

functional_dependency

Functional dependency vectors

gv.data.frame

Generate Graphviz input text to plot a data frame

gv.database_schema

Generate Graphviz input text to plot database schemas

gv.database

Generate Graphviz input text to plot databases

gv

Generate Graphviz input text to plot objects

gv.relation_schema

Generate Graphviz input text to plot relation schemas

gv.relation

Generate Graphviz input text to plot relations

insert

Insert data

keys

Relational data keys

merge_empty_keys

Merge relation schemas with empty keys

merge_schemas

Merge relation schemas in given pairs

normalise

Create normalised database schemas from functional dependencies

records

Relational data records

reduce.database_schema

Remove database schema relations not linked to the given relations

reduce.database

Remove database relations not linked to the main relations

reduce

Remove relations not linked to the main relations

references

Schema references

rejoin

Join a database into a data frame

relation_schema

Relation schema vectors

relation

Relation vectors

rename_attrs

Rename relational data attributes

subrelations

Database subrelations

subschemas

Schema subschemas

synthesise

Synthesise relation schemas from functional dependencies

Automatic normalisation of a data frame to third normal form, with the intention of easing the process of data cleaning. (Usage to design your actual database for you is not advised.) Originally inspired by the 'AutoNormalize' library for 'Python' by 'Alteryx' (<https://github.com/alteryx/autonormalize>), with various changes and improvements. Automatic discovery of functional or approximate dependencies, normalisation based on those, and plotting of the resulting "database" via 'Graphviz', with options to exclude some attributes at discovery time, or remove discovered dependencies at normalisation time.

  • Maintainer: Mark Webster
  • License: BSD_3_clause + file LICENSE
  • Last published: 2025-06-25