prepsources function

Filter and aggregate the raw source dataset