Construct Reproducible Analytic Data Sets as R Packages
Assert that a data version in a data package matches an expectation.
Construct a datapackager.yml configuration
Get the DataVersion for a package
Create a Data Package skeleton for use with DataPackageR.
Read an object created in a previously run processing script.
Options consulted by DataPackageR
Defunct functions in package DataPackageR.
DataPackageR
Build documentation for a data package using DataPackageR.
Pre-process, document and build a data package
Get DataPackageR data path
Get DataPackageR extdata path
Get DataPackageR Project Root Path
Add a data object to a data package.
Ignore specific files by git and R build.
Add a processing script to a data package.
Add a raw data set to inst/extdata
Edit DataPackageR yaml configuration
A framework to help construct R data packages in a reproducible manner. Potentially time consuming processing of raw data sets into analysis ready data sets is done in a reproducible manner and decoupled from the usual 'R CMD build' process so that data sets can be processed into R objects in the data package and the data package can then be shared, built, and installed by others without the need to repeat computationally costly data processing. The package maintains data provenance by turning the data processing scripts into package vignettes, as well as enforcing documentation and version checking of included data objects. Data packages can be version controlled on 'GitHub', and used to share data for manuscripts, collaboration and reproducible research.
Useful links