prep function

prepare a dataset by applying a pre-processing pipeline