load_datafile function

read in a raw datafile from the downloaded data or the GitHub repo