evaluate_models function

Evaluate model runs for calibration