-
Notifications
You must be signed in to change notification settings - Fork 52
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
AWICM3 changes to coupling, restart file writing and DMOC #15
Conversation
1. The standart σ2 bins are chosen according to Megan (2018) (72 levels for a good representation of deep and bottom waters) and augmented with density levels to match those presented in Xu et al. 2018. Altogether we use 85 density bins spanning the range of 30.0 < σ2 < 37.2 kg m−3. 2. New fields which allow to compute surfaxe bouyancy flux etc.have been added
I'm a little unsure as to what happened in src/gen_modules_diag.F90. I think we can just take the master branch here. Comments? |
We are not bitwise identical:
Any idea which parts of the merge would cause this? |
@JanStreffing Last time I changed the values for pi-mesh check is when new advection schemes were merged. Although they are technically the same, there are some differences related to treatment of boundaries, so results of the check were different. Make sure you have everything consistent in your |
@koldunovn Does that mean also the master branch before the merge would not complete the tests? |
No the master if fine :) https://github.com/FESOM/fesom2/runs/1116986615 |
I see nice green check mark next to the PR 👍 🍺 |
Last question before pulling the trigger, does the automated test also cover restarting? I'm asking since I made some changes to the way restart files are written. |
Unfortunatelly they don't include it, that's in plans. I can test it if you wish manually. |
Hey Nikolay, |
Will try to do it tommorow and let you know. Anything in particular I should pay attention to? |
No, it's should just work bit-identical on ollie or mistral. On juwels it fixes the chunking in the POSIX layer, making the IO speed go from 6.46 MiB/s to 1480.39 MiB/s according to darshan. More info on the darshan-logs in the old gitlab issue #19 I think the NetCDF installations on other HPCs are somehow configured to do this automatically, while on juwels we had to do this manually and correctly in FESOM2. My only worry is that the options I set in the code now might conflict with the preconfigured options of the netcdf installation on other machines. Hopefully not. Good night! |
I have tested restarts, and model can run from restart created by version of the model from master. But there are still a lot of
In the output, would be nice to get rid of them :) |
Good to know. I'll hide those behind the debug output switch and then merge. |
I think @dsidoren should merge :) |
You may want to change some permissions around then, cause after the test it's giving me the option to merge it myself. |
Yes, I forgot to add branch rules, should be blocked now :) |
I resolved the conflict in io_meandata by preserving @hegish io-improvements. |
Currently fails on ollie and mistral with: initializing I/O file for sst
associating mean I/O file /scratch/a/a270088/output_mistral_sanity/sst.fesom.1948.nc
sst: current mean I/O counter = 1
writing mean record for sst; rec. count = 1
initializing I/O file for a_ice
associating mean I/O file /scratch/a/a270088/output_mistral_sanity/a_ice.fesom.1948.nc
a_ice: current mean I/O counter = 1
writing mean record for a_ice; rec. count = 1
initializing I/O file for temp
associating mean I/O file /scratch/a/a270088/output_mistral_sanity/temp.fesom.1948.nc
temp: current mean I/O counter = 1
writing mean record for temp; rec. count = 1
initializing I/O file for salt
associating mean I/O file /scratch/a/a270088/output_mistral_sanity/salt.fesom.1948.nc
salt: current mean I/O counter = 1
writing mean record for salt; rec. count = 1
Do output (netCDF, restart) ...
associating restart file /scratch/a/a270088/output_mistral_sanity/fesom.1948.oce.restart.nc
initializing restart file /scratch/a/a270088/output_mistral_sanity/fesom.1948.oce.restart.nc
current restart counter = 1
error counter= 22
Error: NetCDF: Invalid argument
Run finished unexpectedly! No problems on juwels. |
do not check for errors if NF_CHUNKED does not exist in the netcdf library, otherwise crashes on several machines (i.e. ollie)
Now it works on all three machines. |
AWICM3 changes to coupling, restart file writing and DMOC
Testing for now, dont merge yet.