dataset_1: a data object (a data frame or a data.table)
dataset_2: another data object (a data frame or a data.table)
dataset_list: list of data objects (data.frame or data.table)
Returns
the output will be a data.table showing differences in data sets
Examples
# catch differences in class attributes of the data setscompare_datasets(dataset_1 = data.frame(a =1:2, b =3:4),dataset_2 = data.table::data.table(a =1:2, b =3:4))# catch differences in number of columnscompare_datasets(dataset_1 = data.frame(a =1:2, b =3:4, c =5:6),dataset_2 = data.frame(a =1:2, b =3:4))# catch differences in number of rowscompare_datasets(dataset_1 = data.frame(a =1:2, b =3:4),dataset_2 = data.frame(a =1:10, b =11:20))# catch differences in column namescompare_datasets(dataset_1 = data.frame(A =1:2, B =3:4),dataset_2 = data.frame(a =1:2, b =3:4))# catch differences in values within corresponding columnscompare_datasets(dataset_1 = data.frame(a =1:2, b = c(3,400)),dataset_2 = data.frame(a =1:2, b =3:4))compare_datasets(dataset_1 = data.frame(a =1:2, b =3:4, c =5:6),dataset_2 = data.frame(a =1:2, b = c(3,4), c = c(5,6)))# check if data sets in a list are identicalcompare_datasets(dataset_list = list(dt1 = data.frame(a =1:2, b =3:4, c =5:6),dt2 = data.frame(a =1:2, b =3:4),dt3 = data.frame(a =1:2, b =3:4, c =5:6)))