Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NOAA issues and testing stats #155

Closed
eclare108213 opened this issue Jun 27, 2018 · 8 comments
Closed

NOAA issues and testing stats #155

eclare108213 opened this issue Jun 27, 2018 · 8 comments
Assignees
Labels

Comments

@eclare108213
Copy link
Contributor

From Bob G:

I'm now largely successful in the base suite on theia with intel -- 99 pass, 6 fail. The bulk of the problem was compiler options -- though I copied a file labeled intel, it was actually gnu options.

One of the failures is the 40x4 roundrobin, which I think is invoking a problem we have long had with CICE here (theia/ NOAA R+D) -- model getting in to a state where it isn't writing output, but also isn't crashing gracefully. I killed it after an hour of no change.

Do we have any way of collecting stats on which tests are most likely to pass or fail? How long they take? It'd be convenient to put a fast, frequently failing, test up front in the queue when we're to a point of requiring all to pass.

@eclare108213 eclare108213 self-assigned this Jun 27, 2018
@eclare108213
Copy link
Contributor Author

Which file labeled intel had gnu options in it? We need to fix that.

The test suites now produce timings, which will be flagged in tests that run significantly differently from previous tests. I don't think we've started collecting statistics, but the results are available at
https://github.com/CICE-Consortium/Test-Results/wiki

@rgrumbine
Copy link
Contributor

rgrumbine commented Oct 11, 2018

intel compiler version was the source of 5 of the 6 failures. Default on theia is intel 14. Any version 15 to 18.1.163 clears the problems.

The remaining problem is the 40x4 roundrobin

@eclare108213
Copy link
Contributor Author

@rgrumbine Did you get the 40x4 roundrobin case to work?

@rgrumbine
Copy link
Contributor

Not yet, I've been off on other things. Just starting back to the benchmark tests.

@rgrumbine
Copy link
Contributor

All regression tests now pass on phase2, but refuses to compile (reprosum, no_I8 issue(?)) on phase3 and theia.

@eclare108213
Copy link
Contributor Author

@rgrumbine, just checking in on this issue. Did you figure it out?

@rgrumbine
Copy link
Contributor

rgrumbine commented Aug 30, 2019 via email

@rgrumbine
Copy link
Contributor

All regression tests now pass on all three of phase2, phase3, and hera.

apcraig added a commit that referenced this issue May 15, 2020
* update to include a 'high_Sierra' machine with gnu compilers,
only marginally modified from the travisCI env and Macros.

* Syncing theia and upstream/master branch

* theia execution

* intel compiler version tests -- 15.0.0 or greater

* working on variations for round robin test

* syncing to main

* sync

* testing

* end test

* successful on phase2 for all regression tests

* start on phase3 execution

* add phase3 system

* syncing and phase3 modules

* update configs manually from master

* env edit

* builds on phase3 now

* test

* trying to pull/merge cice master

* theia

* successful theia build

* garbage

* update theia intel env

* mods for running in NOAA

* start towards hera

* tmp

* theia passes with most recent of all modules

* moving to hera

* clean regression run on hera

* cleanup

* phase3 work

* minor manual move towards master copy

* re-fix phase3 script

* nearing full phase3

* update path for data

* actual final version of phase3

* from master

* cicecore updating from master

* from consortium/master

* more from master

* retry sync

* manual add/subtrace

* script perms and large diff files

* retrieve hera

* docs and cicecore sync to master

* machine syncing

* couple more

* some more

* finally the right template

* finally correct template

* typography in master

* restore working batch.csh

* towards matching consortium master

* last? add for syncing

* Revert icepack to 1.1.2

Co-authored-by: Tony Craig <apcraig@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants