Data Fusion using Optimal Transportation Theory
avg_dist_closest()
verif_OT()
ham()
imput_cov()
indiv_grp_closest()
indiv_grp_optimal()
merge_dbs()
OT_joint()
OT_outcome()
power_set()
proxim_dist()
select_pred()
transfo_dist()
transfo_quali()
transfo_target()
compare_lists()
error_group()
In the context of data fusion, the package provides a set of functions dedicated to the solving of 'recoding problems' using optimal transportation theory (Gares, Guernec, Savy (2019) <doi:10.1515/ijb-2018-0106> and Gares, Omer (2020) <doi:10.1080/01621459.2020.1775615>). From two databases with no overlapping part except a subset of shared variables, the functions of the package assist users until obtaining a unique synthetic database, where the missing information is fully completed.