Store and Retrieve Data.frames in a Git Repository
Re-exported Function From git2r
Create a Data Package for a directory of CSV files
Display metadata for a git2rdata object
git2rdata: Store and Retrieve Data.frames in a Git Repository
Check Whether a Git2rdata Object is Valid.
Check Whether a Git2rdata Object Has Valid Metadata.
List Available Git2rdata Files Containing Data
Optimize an Object for Storage as Plain Text and Add Metadata
Print method for git2rdata objects.
Prune Metadata Files
Re-exported Function From git2r
Re-exported Function From git2r
Read a Git2rdata Object from Disk
Retrieve the Most Recent File Change
Relabel Factor Levels by Updating the Metadata
Rename a Variable
Re-exported Function From git2r
Remove Data Files From Git2rdata Objects
Re-exported Function From git2r
Summary method for git2rdata objects.
Update the description of a git2rdata object
Upgrade Files to the New Version
Read a file an verify the presence of variables
Store a Data.Frame as a Git2rdata Object on Disk
The git2rdata package is an R package for writing and reading dataframes as plain text files. A metadata file stores important information. 1) Storing metadata allows to maintain the classes of variables. By default, git2rdata optimizes the data for file storage. The optimization is most effective on data containing factors. The optimization makes the data less human readable. The user can turn this off when they prefer a human readable format over smaller files. Details on the implementation are available in vignette("plain_text", package = "git2rdata"). 2) Storing metadata also allows smaller row based diffs between two consecutive commits. This is a useful feature when storing data as plain text files under version control. Details on this part of the implementation are available in vignette("version_control", package = "git2rdata"). Although we envisioned git2rdata with a git workflow in mind, you can use it in combination with other version control systems like subversion or mercurial. 3) git2rdata is a useful tool in a reproducible and traceable workflow. vignette("workflow", package = "git2rdata") gives a toy example. 4) vignette("efficiency", package = "git2rdata") provides some insight into the efficiency of file storage, git repository size and speed for writing and reading.
Useful links