Exact restart fail on test with update to fates_s1.4.1_a3.0.0 #315

ekluzek · 2018-01-12T18:15:48Z

An exact restart test fails when I updated the clm fates branch to fates_s1.4.1_a3.0.0. The test is ERS_D_Mmpi-serial_Ld5.1x1_brazil.I2000Clm50FatesGs.yellowstone_pgi.clm-fates. This same test works fine with either intel or nag compiler on hobart, but also fails with the pgi compiler on hobart. A lot of fields are different from the baseline to restart...

RMS AREA_PLANT 3.4742E-02 RMS AREA_TREES 3.4742E-02 RMS BTRAN 1.1822E-02 RMS GPP 1.0954E-08 RMS H2OSOI 1.6319E-05 RMS LITTER_IN 4.5475E-13 RMS LITTER_OUT 5.7231E-11 RMS NBP 1.4486E-08 RMS NEP 1.4486E-08 RMS NPP 1.3843E-08 RMS TLAI 1.5066E-03 RMS TOTLITC 3.4925E-10 RMS T_SCALAR 3.7401E-05 RMS AREA_PLANT 3.5480E-02 RMS AREA_TREES 3.5480E-02 RMS BTRAN 1.1685E-02 RMS GPP 9.6497E-09 RMS H2OSOI 6.3526E-05 RMS LITTER_IN 1.0232E-12 RMS LITTER_OUT 8.2481E-11 RMS NBP 3.9211E-08 RMS NEP 3.9211E-08 RMS NPP 3.6849E-08 RMS PFTbiomass 6.4373E-04 RMS TOTECOSYSC 1.4648E-03 RMS TOTLITC 2.1305E-06 RMS TOTSOMC 1.2207E-04 RMS T_SCALAR 1.5954E-04 NORMALIZED 6.2319E-01
NORMALIZED 6.2319E-01
NORMALIZED 6.2323E-01
NORMALIZED 1.8167E-02
NORMALIZED 9.6514E-05
NORMALIZED 1.3018E-06
NORMALIZED 6.2319E-01
NORMALIZED 1.5750E-02
NORMALIZED 1.5750E-02
NORMALIZED 4.7671E-02
NORMALIZED 1.9449E-02
NORMALIZED 1.7393E-06
NORMALIZED 7.7218E-05
NORMALIZED 6.4070E-01
NORMALIZED 6.4070E-01
NORMALIZED 6.4141E-01
NORMALIZED 1.8454E-02
NORMALIZED 3.8013E-04
NORMALIZED 2.9305E-06
NORMALIZED 6.5991E-01
NORMALIZED 4.4221E-02
NORMALIZED 4.4221E-02
NORMALIZED 9.3921E-02
NORMALIZED 4.4718E-05
NORMALIZED 1.5571E-06
NORMALIZED 5.7711E-03
NORMALIZED 1.3388E-07
NORMALIZED 3.0967E-04

ekluzek · 2018-01-12T18:20:37Z

The test with fates_s1.3.1_a2.0.0_n02_clm4_5_17_r265 passed the same exact restart test. The test fails for: fates_s1.4.0_a3.0.0_n04_clm4_5_17_r265 and passes for fates_s1.3.1_a2.0.0_n03_clm4_5_17_r265 on hobart_pgi. So the changes between those two tags cause the issue. there are some small changes in clm there, as well as the update from fates_s1.3.1_a2.0.0 to fates_s1.4.0_a3.0.0_rev2.

rgknox · 2018-01-12T20:26:45Z

@ekluzek , when fates_s1.3.1_a2.0.0_n02_clm4_5_17_r265 passes, which fates tag is used for that one? is it https://github.com/NGEET/fates/releases/tag/sci.1.3.1_api.2.0.0?

ekluzek · 2018-01-12T20:58:10Z

@rgknox yes, it uses: fates_s1.3.1_a2.0.0, which is sci.1.3.1_api.2.0.0. But, I updated an earlier comment to point to the version with the fail in it which is the update from fates_s1.3.1_a2.0.0 to fates_s1.4.0_a3.0.0. I don't think the CLM changes are likely to be the problem. I looked through the differences in fates, especially for new variables that aren't being set. the difference is large over 4k lines so it's difficult to say. But, I did notice the variable trunk_product in biogeochem/EDPhysiologyMod.F90 in subroutine CWD_Input isn't set for all conditions.

ekluzek · 2018-01-12T22:53:18Z

In the version in fates_clm, the compset and testmods names are different. But, I reproduced a fail
with fates-clm at clm4_5_15_r234_v1.1.0_fatesAPI_v3.0.0 and fates at sci.1.4.1_api.3.0.0 for ERS_D_Mmpi-serial_Ld5.1x1_brazil.ICLM45ED.hobart_pgi.clm-edTest. This also shows this problem isn't just for CLM50 as my original test showed, but also for CLM45, and for the older CLM version that's in fates-clm.

billsacks · 2018-03-08T13:07:07Z

In my testing for ESCOMP/CTSM#311 (using billsacks/ctsm@f9ff7cf) I got

FAIL ERS_D_Mmpi-serial_Ld5.1x1_brazil.I2000Clm45FatesGs.cheyenne_intel.clm-Fates COMPARE_base_rest

The differences between the base and restart run are in just one field:

 RMS NPP_BY_AGE                       3.2910E-07            NORMALIZED  8.1129E-02

I think the only relevant difference in this branch is that I am using a different fates initial conditions file, which is likely substantially different from the one that had been used before (see notes in the above-referenced PR for details).

@ekluzek and @rgknox do you think it's reasonable to attribute that failure to this issue, and to add this test to the ExpectedFails list in CTSM?

(For details, see /glade/scratch/sacks/tests_0306-2105c/ERS_D_Mmpi-serial_Ld5.1x1_brazil.I2000Clm45FatesGs.cheyenne_intel.clm-Fates.GC.0306-2105c_i on cheyenne.)

rgknox · 2019-03-25T17:51:42Z

Ok, putting the pieces together here.
My take is that maybe we should mark this as an expected fail, and see how these tests react to a more recent version of fates.
Is it true that only the 1x1 brazil test is reporting these errors, and that is the only fates test that uses an finitdat?

glemieux · 2021-09-17T23:48:05Z

Looking at the fates expected failures list, these test pass on the I2000Clm50FatesCru version of the tests listed (we don't use the Clm45 version of the tests anymore). @ekluzek do you think its ok to close this out?

ekluzek added the science: bug Bugs that are specific to the implementation of a scientific model label Jan 12, 2018

rgknox mentioned this issue Jan 12, 2018

Fixed uninitialized dhdd #317

Merged

ekluzek mentioned this issue Mar 25, 2019

Exact restart problem with Fates ESCOMP/CTSM#667

Closed

ekluzek mentioned this issue Aug 6, 2019

problem with FATES exact restarts ESCOMP/CTSM#110

Closed

rgknox closed this as completed May 19, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Exact restart fail on test with update to fates_s1.4.1_a3.0.0 #315

Exact restart fail on test with update to fates_s1.4.1_a3.0.0 #315

ekluzek commented Jan 12, 2018

ekluzek commented Jan 12, 2018 •

edited

Loading

rgknox commented Jan 12, 2018 •

edited

Loading

ekluzek commented Jan 12, 2018 •

edited

Loading

ekluzek commented Jan 12, 2018

billsacks commented Mar 8, 2018 •

edited

Loading

rgknox commented Mar 25, 2019

glemieux commented Sep 17, 2021

Exact restart fail on test with update to fates_s1.4.1_a3.0.0 #315

Exact restart fail on test with update to fates_s1.4.1_a3.0.0 #315

Comments

ekluzek commented Jan 12, 2018

ekluzek commented Jan 12, 2018 • edited Loading

rgknox commented Jan 12, 2018 • edited Loading

ekluzek commented Jan 12, 2018 • edited Loading

ekluzek commented Jan 12, 2018

billsacks commented Mar 8, 2018 • edited Loading

rgknox commented Mar 25, 2019

glemieux commented Sep 17, 2021

ekluzek commented Jan 12, 2018 •

edited

Loading

rgknox commented Jan 12, 2018 •

edited

Loading

ekluzek commented Jan 12, 2018 •

edited

Loading

billsacks commented Mar 8, 2018 •

edited

Loading