I'm noticing a slowdown when running the one-degree global ocean sea-ice example code (with appropriate function name updates) on ClimaOcean v0.9.0 with Oceananigans v0.104.2, compared to v0.8.7 with v0.100.7, respectively.
This test was run on TACC Vista using Julia 1.12.4 with both configurations on CUDA. The log files are attached, which indicate that the older version ran 2-3x faster. Is it because a patch to WindowedTimeAverage was pushed that works concurrently with checkpointing?
older_versions.txt
newer_versions.txt