get_data.ds_generalization function

A datasource (DS) method to generate training and test sets