In some cases we may have images which don't cover the full FOV of the average.
Assuming the masking is handled properly (so that edges don't get unduly deformed), we can weight the average by the FOV of each input image so that areas with less data don't get unduly downweighted.