Add plots to evaluation script and look at the results between the truth and forecasts Might be worth seeing the difference that #112 makes