compare_datasets function

Compare data sets

Compare data sets

Compares whether or not data sets are identical

compare_datasets(dataset_1 = NULL, dataset_2 = NULL, dataset_list = NULL)

Arguments

  • dataset_1: a data object (a data frame or a data.table)
  • dataset_2: another data object (a data frame or a data.table)
  • dataset_list: list of data objects (data.frame or data.table)

Returns

the output will be a data.table showing differences in data sets

Examples

# catch differences in class attributes of the data sets compare_datasets( dataset_1 = data.frame(a = 1:2, b = 3:4), dataset_2 = data.table::data.table(a = 1:2, b = 3:4)) # catch differences in number of columns compare_datasets( dataset_1 = data.frame(a = 1:2, b = 3:4, c = 5:6), dataset_2 = data.frame(a = 1:2, b = 3:4)) # catch differences in number of rows compare_datasets( dataset_1 = data.frame(a = 1:2, b = 3:4), dataset_2 = data.frame(a = 1:10, b = 11:20)) # catch differences in column names compare_datasets( dataset_1 = data.frame(A = 1:2, B = 3:4), dataset_2 = data.frame(a = 1:2, b = 3:4)) # catch differences in values within corresponding columns compare_datasets( dataset_1 = data.frame(a = 1:2, b = c(3, 400)), dataset_2 = data.frame(a = 1:2, b = 3:4)) compare_datasets( dataset_1 = data.frame(a = 1:2, b = 3:4, c = 5:6), dataset_2 = data.frame(a = 1:2, b = c(3, 4), c = c(5, 6))) # check if data sets in a list are identical compare_datasets( dataset_list = list( dt1 = data.frame(a = 1:2, b = 3:4, c = 5:6), dt2 = data.frame(a = 1:2, b = 3:4), dt3 = data.frame(a = 1:2, b = 3:4, c = 5:6)))