Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update develop-ref after dtcenter/MET#2987 #2989

Merged
merged 141 commits into from
Oct 10, 2024

Conversation

github-actions[bot]
Copy link
Contributor

@github-actions github-actions bot commented Oct 10, 2024

Changes to the SEEP line type.
Created by @JohnHalleyGotway

Reviewed the diffs flagged in this testing workflow run and confirmed that they're all expected based on the changes to the SEEPS line type.

Howard Soh and others added 30 commits February 2, 2024 16:58
* Per #2395, add new columns to VL1L2, VAL1L2, and VCNT line types for wind direction statistics. Work still in progress.

* Per #2395, write the new VCNT columns to the output and document the additions to the VL1L2, VAL1L2, and VCNT columns.

* Per #2395, add the definition of new statistics to Appendix G.

* Per #2395, update file version history.

* Per #2395, tweak warning message about zero wind vectors and update grid-stat and point-stat to log calls to the do_vl1l2() function.

* Per #2395, refine the weights for wind direction stats, ignoring the undefined directions.

* Update src/tools/core/stat_analysis/aggr_stat_line.cc

* Update src/tools/core/stat_analysis/parse_stat_line.cc

* Update src/tools/core/stat_analysis/aggr_stat_line.cc
… broken the logic of the update_truth.yml GHA workflow. Instead of submitting a PR to merge develop into develop-ref directly, use an intermediate update_truth_for_develop branch.
* Per #2280, update to support probability threshold strings like ==8, where 8 is the number of ensemble members, to create probability bins centered on the n/8 for n = 0 ... 8.

* Per #2280, update docs about probability threshold settings.

* Per #2280, use a loose tolerance when checking for consistent bin widths.

* Per #2280, add a new unit test for grid_stat to demonstrate processing the output from gen_ens_prod.

* Per #2280, when verifying NMEP probability forecasts, smooth the obs data first.

* Per #2280, only request STAT output for the PCT line type to match unit_grid_stat.xml and minimize the new output files.

* Per #2280, update config option docs.

* Per #2280, update config option docs.
…ullptr

Feature 2673 sonarqube beta4 nullptr
…eturn

Feature 2673 sonarqube beta4 return
DanielAdriaansen and others added 27 commits August 21, 2024 09:16
#2947)

* Adds caveat regarding longitudes appearing in DEBUG statements with a different sign to the FAQ.

* Update appendixA.rst

Missing paren
* Per #2938, define CRC_Array::add_uniq(...) member function which is now used in PB2NC

* Per #2938, replace n_elements() with n() to make the code more concise. Refine log/warning message when multiple message center times are encountered.
* Per #1371, add -input command line argument and add support for ALL for the CTC, MCTC, SL1L2, and PCT line types.

* Per #1371, rename the -input command line option as -aggregate instead

* Per #1371, work in progress

* Per #1371, just comments

* Per #1371, working on aggregating CTC counts

* Per #1371, work in progress

* Per #1371, update timing info using time stamps in the aggr file

* Per #1371, close the aggregate data file

* Per #1371, define set_event() and set_nonevent() member functions

* Per #1371, add logic to aggregate MCTC and PCT counts

* Merging changes from develop

* Per #1371, work in progress aggregating all the line statistics types. Still have several issues to address

* Per #1371, switch to using get_stat() functions

* Per #1371, work in progress. More consolidation

* Per #1371, correct expected output file name

* Per #1371, consistent regridding log messages and fix the Series-Analysis PairDataPoint object handling logic.

* Per #1371, check the return status when opening the aggregate file.

* Per #1371, fix prc/pjc typo

* Per #1371, fix the series_analysis PCT aggregation logic and add a test to unit_series_analysis.xml to demonstrate.

* Per #1371, resolve a few SonarQube findings

* Per #1371, make use of range-based for loop, as recommeded by SonarQube

* Per #1371, update series-analysis to apply the valid data threshold properly using the old aggregate data and the new pair data.

* Per #1371, update series_analysis to buffer data and write it all at once instead of storing data value by value for each point.

* Per #1371, add useful error message when required aggregation variables are not present in the input -aggr file.

* Per #1371, print a Debug(2) message listing the aggregation fields being read.

* Per #1371, correct operator+= logic in met_stats.cc for SL1L2Info, VL1L2Info, and NBRCNTInfo. The metadata settings, like fthresh and othresh, were not being passed to the output.

* Per #1371, the DataPlane for the computed statistics should be initialized to a field of bad data values rather than the default value of 0. Otherwise, 0's are reported for stats a grid points with no data when they should really be reported as bad data!

* Per #1371, update logic of the compute_cntinfo() function so that CNT statistics can be derived from a single SL1L2Info object containing both scalar and scalar anomaly partial sums. These changes enable CNT:ANOM_CORR to be aggregated in the Series-Analysis tool.

* Per #1371, fix logic of climo log message.

* Per #1371, this is actually related to MET #2924. In compute_pctinfo() used obs climo data first, if provided. And if not, use fcst climo data.

* Per #1371, fix indexing bug (+i instead of +1) when check the valid data count. Also update the logic of read_aggr_total() to return a count of 0 for bad data.

* Per #1371, add logic to aggregate the PSTD BRIERCL and BSS statistics in the do_climo_brier() function. Tested manually to confirm that it works.

* Per #1371, switch to using string literals to satisfy SonarQube

* Per #1371, update series_analysis tests in unit_climatology_1.0deg.xml to demonstrate aggregating climo-based stats.

* Per #1371, remove extra comment

* Per #1371, skip writing the PCT THRESH_i columns to the Series-Analysis output since they are not used

* Per #1371, fix the R string literals to remove \t and \n escape sequences.

* Per #1371, update the read_aggr_data_plane() suggestion strings.

* Per #1371, ignore unneeded PCT 'THRESH_' variables both when reading and writing ALL PCT columns.

* Per #1371, update the test named series_analysis_AGGR_CMD_LINE to include data for the F42 lead time that had previously been included for the same run in the develop branch. Note however that the timestamps in the output file for the develop branch (2012040900_to_2012041100) were wrong and have been corrected here (2012040900_to_2012041018) to match the actual data.

* Per #1371, update the -aggr note to warn users about slow runtimes
* Per #2948, updating versions of ecbuild, eckit, and atlas

* Per #2948, Adding MET_CXX_STANDARD

* Per #2948, updated wording for MET_CXX_STANDARD description

* Per #2948, updating script to work with two versions of ecbuild, eckit, and atlas

* Per #2948, without this change, there are compilation problems if the user wants to compile wihtout python

* Per #2948, fixing logic for MET_CXX_STANDARD

* Per #2928, adding missing end bracket

* Per #2948, fixed the logic for compiling versions of ecbuild, eckit, and atlas

* Per 948, fixed syntax for setting CXXFLAGS

* Per #2948, adding new Makefile.in files and configure and changing METbaseimage 3.2 to 3.3.

* Per #2948, updating version of met base tag from 3.2 to 3.3

* Per #2948, adding --enable-all MET_CXX_STANDARD=11 job

* Update compilation_options.yml

* Per #2948, added a job10 for MET_CXX_STANDARD=14

* Per #2948, added brief documentation for the MET_CXX_STANDARD option

---------

Co-authored-by: Julie Prestopnik <jpresto@seneca.rap.ucar.edu>
Co-authored-by: John Halley Gotway <johnhg@ucar.edu>
* #1729 Allow to change to differnt grid size if the raw size is 0

* Added build_grid_by_grid_string and build_grid_by_grid_string

* #1729 Calls build_grid_by_grid_string

* #1729 Added set_attr_grid at the -field option

* #1729 Set obs_type to TYPE_NCCF if the file_type is given at the config file

* #1729 Support set_sttr_grid and changed Error messages to Warning

* #1729 FIxed SonmarQube findings

* #1729 Initial release for unit test

* #1729 Added update_missing_values

* #1729 Deleted a shadowed local variable

* #2673 Added more is_eq

* #2673 Added get_exe_duration

* 2673 Reducded nested statements

* 2673 Fixed SonarGube findings

* 2673 Fixed SonarQube findings

* 2673 Fixed SonarQube findings

* #1729 Added aan unittest plot_data_plane_set_attr_grid

* #1729 Added aan unittest point2grid_cice_set_attr_grid

* #1729 Added changed back the verbose level

* #1729 Corrected typo

---------

Co-authored-by: Howard Soh <hsoh@seneca.rap.ucar.edu>
* #2936 Support 1D lat/lon values

* #2936 Initial release

* #2936 Cast the data type to avoid a compile warning

* #2936 Added an unittest point2grid_gfs_1D_lat_lon

---------

Co-authored-by: Howard Soh <hsoh@seneca.rap.ucar.edu>
* #2968 Corrected set_attr_grid for point2grid_cice_set_attr_grid

* #2968 Compare the DataPlane size and the variable data size

* #2968 nx and ny are not ignored with set_attr_grid

* #2968 Compare the DataPlane size and the variable data size

---------

Co-authored-by: Howard Soh <hsoh@seneca.rap.ucar.edu>
* added single quotes around env var/val pairs in export statements in cmd only mode

* updated logic in unit() to check exec return value against expected return value; created TEST xml file to test this feature

* deleted TEST_ xml, added test with retval 1 to unit_ascii2nc

---------

Co-authored-by: Natalie Babij <nbabij@seneca.rap.ucar.edu>
* Per #2887, update NumArray::vals() to return a reference to the vector rather a pointer to doubles.

* Per #2887, switch over the whole ContingencyTable class heirarchy from storing integer counts to storing double-precision weights.

* Add ContingencyTable::is_integer() member function to check whether the table contains all integers

* Per #2887, update parse_stat_line.cc to get it to compile after changing PCT to store thresholds in a std::vector.

* Per #2887, update PCTInfo::clear() logic.

* Per #2887, update ctc_by_row() logic to create reproducible results with the develop branch.

* Per #2887, update logic of define_prob_bins() to add a final >=1.0 threshold if needed. While ==0.1 works fine, I found that ==0.05 did not because the last >=1.0 threshold was missing likely do to floating point precision issues. This change should fix that problem.

* Per #2887, update roc_auc() function to match the develop branch

* Per #2887, fix bug if computation of far()

* Per #2887, replaced all ==0 integer equality checks with calls to is_eq() instead and fix a couple of equations to snuff out diffs in some CTS statistics.

* Per #2887, address some of the 34 SonarQube code smells flagged for this PR. Note that the compute_ci.h/.cc changes are necessary and good since we should be computing CI's using doubles instead of integer counts.

* Per #2887, update run_sonarqube.sh to specify the target CXX standard as 11. The hope is that that will limit the findings to only those features available in the C++11 standard.

* Per #2887, update to SonarQube version 6.1.0.4477 released on 6/27/2024.

* Per #2887, updating build_met_sonarqube.sh to specify --std=c++11 since c++17 is used by default
…efine_prob_bins() utility function so that ==n probability thresholds result in the correct number of probability thresholds. We were adding an unncessary 10-th bin (from 1.07143 to 1.0) for the ==7 probability threshold type.
The docs directory was moved up to the top-level of the repository but this workflow was not updated. Changing the ignore setting so that doc-only updates do not trigger the full METplus testing workflow.
* testing AREA and AUTO changes

* Keywords B thru L

* thru R

* adding quotes back in for lower case items

* S thru the end of the document

* Removing double quotes around 3 key words

* Per #2023, adding a label name for the Attributes section

* Per #2023, adding an internal link for the MODE tool Attributes section.

* Adding quotes around Valid basins entries

* more double quote updates

* more complex updates with Julie P help

* removing double quotes

* fixing typos

* removing double quotes

* unbolding SURFACE and putting it in double quotes

* fixing grammar

* grammar

* fixing typo

* fixing typo

---------

Co-authored-by: Julie Prestopnik <jpresto@ucar.edu>
* Per #2924, remove GenEnsProd config file comment about parsing desc separately from each obs.field entry because the obs dictionary does not exist in the GenEnsProd config file.

* Per #2924, update list of needed config entry names

* Per #2924, remove const from the parent() member function so that we can perform lookups for the parent.

* Per #2924, update the signature for and logic of the utility functions that retrieve the climatology data. Rather than requiring all the climo_mean and climo_stdev dictionary entries to be defined at the same config file context level, parse each one individually. This enables the METplus wrappers to only partially override this dictionary and still rely on the default values provided in MET's default configuration files.

* Per #2924, update all calls to the climatology utility functions based on the new function signature. Also update the tools to check the number of climo fields separately for the forecast and observation climos.

* Per #2924, update the parsing logic for the climatology regrid dictionary. Use config.fcst.climo_mean.regrid first, config.fcst.regrid second, and config.climo_mean.regrid third. Notably, DO NOT use config.regrid. This is definitely the problem with having regrid specified at mutliple config file context levels. It makes the logic for which to use when very messy.

* Per #2924, forgot to add an else to print an error

* Per #2924, remove extraneous semicolon

* Per #2924, move 'fcst.regrid' into 'fcst.climo_mean.regrid'. Defining the climatology regridding logic inside fcst is problematic because it applies to the forecast data as well and you end up with the verification grid being undefined. So the climo regridding logic must be defined in 'climo_mean.regrid' either within the 'fcst' and 'obs' dictionaries or at the top-level config context.

* Per #2924, based on PR feedback from @georgemccabe, add the Upper_Left, Upper_Right, Lower_Right, and Lower_Left interpolation methods to the list of valid options for regridding, as already indicated in the MET User's Guide.

* Per #2924, update the logic of parse_conf_regrid() to (hopefully) make it work the way @georgemccabe expects it to. It now uses pointers to both the primary and default dictionaries and parses each entry individually.

* Per #2924, need to check for non-null pointer before using it

* Per #2924, revise the climo_name dictionary lookup logic when parsing the regrid dictionary.

* Per #2924, update logic for handling RegridInfo

* Per #2924, remove the default regridding information from the 'Searching' log message to avoid confusion.

---------

Co-authored-by: MET Tools Test Account <met_test@seneca.rap.ucar.edu>
* Per #2924, remove GenEnsProd config file comment about parsing desc separately from each obs.field entry because the obs dictionary does not exist in the GenEnsProd config file.

* Per #2924, update list of needed config entry names

* Per #2924, remove const from the parent() member function so that we can perform lookups for the parent.

* Per #2924, update the signature for and logic of the utility functions that retrieve the climatology data. Rather than requiring all the climo_mean and climo_stdev dictionary entries to be defined at the same config file context level, parse each one individually. This enables the METplus wrappers to only partially override this dictionary and still rely on the default values provided in MET's default configuration files.

* Per #2924, update all calls to the climatology utility functions based on the new function signature. Also update the tools to check the number of climo fields separately for the forecast and observation climos.

* Per #2924, update the parsing logic for the climatology regrid dictionary. Use config.fcst.climo_mean.regrid first, config.fcst.regrid second, and config.climo_mean.regrid third. Notably, DO NOT use config.regrid. This is definitely the problem with having regrid specified at mutliple config file context levels. It makes the logic for which to use when very messy.

* Per #2924, forgot to add an else to print an error

* Per #2924, remove extraneous semicolon

* Per #2924, move 'fcst.regrid' into 'fcst.climo_mean.regrid'. Defining the climatology regridding logic inside fcst is problematic because it applies to the forecast data as well and you end up with the verification grid being undefined. So the climo regridding logic must be defined in 'climo_mean.regrid' either within the 'fcst' and 'obs' dictionaries or at the top-level config context.

* Per #2924, based on PR feedback from @georgemccabe, add the Upper_Left, Upper_Right, Lower_Right, and Lower_Left interpolation methods to the list of valid options for regridding, as already indicated in the MET User's Guide.

* Per #2924, update the logic of parse_conf_regrid() to (hopefully) make it work the way @georgemccabe expects it to. It now uses pointers to both the primary and default dictionaries and parses each entry individually.

* Per #2924, need to check for non-null pointer before using it

* Per #2924, revise the climo_name dictionary lookup logic when parsing the regrid dictionary.

* Per #2924, update logic for handling RegridInfo

* Per #2924, remove the default regridding information from the 'Searching' log message to avoid confusion.

* Per #2924, escape sequences, like \n, cannot be used inside R-string literals.

* Per #2924, update the logic of check_climo_n_vx()

* Per #2924, revise logic in read_climo_data_plane_array(). Check the number of climo fields provided. If there's 0, just return since no data has been requested. If there's 1, use it regardless of the number of input fields. If there's more than 1, just use the requested i_vx index value.

* Per #2924, update Series-Analysis to set both i_fcst and i_obs when looping over the series entries.

* Per #2924, no real change. Just whitespace.

* Unrelated to #2924, superficial changes to formatting of method_name strings for consistency.

* Per #2924, add a new series_analysis test that ERRORS OUT prior to this PR but works after the changes in this PR.

---------

Co-authored-by: MET Tools Test Account <met_test@seneca.rap.ucar.edu>
* Per #2949, updating installation instructions

* Per #2949, adding missing colon from note directive

* Per #2949, third attempt to get the new note to show up

* Per #2949, modifying text and format

* Per #2949, removing images in favor of code blocks for easier modification

* Per #2949, modified wording for clarity

* Per #2929, corrected typo

* Update installation.rst

No changes to content, only whitespace for consistency, mostly removing tabs.

* Update docs/Users_Guide/installation.rst

Co-authored-by: John Halley Gotway <johnhg@ucar.edu>

* Per #2949, testing variable replacement, expect failures

* Per #2949, reverting to orignal state after testing

---------

Co-authored-by: John Halley Gotway <johnhg@ucar.edu>
* Per #2979, remove nc_grid.h/.cc and replace it with calls to the read_netcdf_grid(...) and write_netcdf_proj(...) library utility functions. Note that these changes do compile but I haven't tested whether they actually fix the underlying problem. Also note that nc_utils_local.h/.cc can also likely be replaced with calls to common library functions.

* Per #2979, remove references to nc_grid.o from the MTD test code.

* Per #2979, insert a newline in unit.py output between the env vars and the command.

* Per #2979, insert a newline in unit.py output between the env vars and the command.

* Per #2979, the write_netcdf_proj(...) utility function adds the lat and lon dimensions. Update mtd to NOT define those dimensions prior to calling write_netcdf_proj(...).

* Per #2979, minor changes to is_eq() calls to fix compiler warning messages

* Per #2979, for the develop branch, also replace nc_utils_local.h/.cc with calls to common library code. Also remove commented out code.

* Per #2979, delete commented out code and make error/warning message formatting consistent.

* Fixes for SonarQube

---------

Co-authored-by: MET Tools Test Account <met_test@seneca.rap.ucar.edu>
* Per #2880, remove Point2Grid quality_mark_thresh config option and add obs_quality_inc and obs_quality_exc from the config file.

* Per #2880, update point2grid docs and reformat whitespace throughout.

* Per #2880, remove quality_mark_thresh and add obs_quality_inc and obs_quality_exc.

* Unrelated to #2880, fix formatting of this R-string which cannot include any special formatting, such as \n.

* Per #2880, not working quite right yet but this is progress

* Per #2880, add a write_css(IntArray) utility function.

* Per #2880, update NcPointObsData class to read the obs quality values from the input file.

* Per #2880, update the log message about the quality  control filter options applied.

* Per #2880, tweak log messages.

* Per #2880, tweak log messages.

* Per #2880, add a Point2Grid unit test to demonstrate using the obs_quality_inc and obs_quality_exc options.

* Per #2880, declare these get accessor functions as const to avoid SonarQube code smell.

* Per #2880, many changes to the vx_nc_obs library and point2grid application to replace dynamically allocated memory with STL vectors to satisfy SonarQube code smells.

* Per #2880, fix compilation error.

* Per #2880, fix compilation error

* Per #2880, revert skip_times back to vector<bool> since I wasn't postive the casting (int *) to (bool *) would actually work in the way I expect. This is safer.

* Per #2880, rename the -qc command line option as -goes_qc, but still quietly support -qc

* Per #2880, based on guidance from @hsoh, reset the var_cell_mapping vector for each loop iteration.
* Update seeps.h

Change variable names to reduce ambiguity for interpretation and aid useability.

* Update seeps.cc

Pull through variable name changes and renaming of functions to aid legibility and clarity. Introduced some additional debug print statements.

* Update grid-stat.rst

Add documentation about the location of the gridded climatology files for SEEPS and which environment variable to use.

* Replace read_seeps_scores() with get_seeps_climo_grid()

* Manually merging Rachel's patch-1 changes.

* Getting close to getting these seeps changes to compile. But it's failing in pair_data_point.cc

* Per #2882, get branch feature_2882_seeps_qa compiling again. Recommend revisiting the volume of SEEPS-related Debug log messages and reducing them once its fully tested.

* Per #2882, need to update the handling of the PPT24_seepsweights_grid.nc file name. Rename as _v12.0.nc for the updated version with the new names so that the existing regressions tests and nightly builds for main_v11.1 and develop continue to work. We can remove the _v12.0 once this feature branch is merged into develop but for the time being, we need both versions to exist.

* Per #2882, rename the SEEPS columns from S12, S13, S21, S23, S31, S32 to the more descriptive ODFL, ODFH, OLFD, OLFH, OHFD, OHFL names.

* Per #2882, update SEEPS details

* Per #2882, store and report the weighted mean fcst and mean obs, just like the SEEPS score itself so that they're handled in a consistent manner. Note however that it's hard-coded to NOT write the weighted means/score, only the unweighted ones.

* Per #2882, change SEEPS debug log levels and correct the storage of mean_fcst and mean_obs values.

* Per #2882, correct SEEPS column name lookups

* Per #2882, call is_bad_data() instead of is_eq(..., -9999.0) to get rid of compiler warning message.

* Per #2882, add 2 more variations of the is_eq() function with mixed float and double inputs to satisfy compiler pb2nc compiler warnings.

* Per #2882, switch from dynamically allocated arrays to std::vector

* Per #2882, enhance Stat-Analysis to write the SEEPS line type to an output .stat file.

* Per #2882, update the aggregated seeps computation to use better-initialized vectors.

* Per #2882, resolve a few more SonarQube code smells.

* Per #2882, now that this PR is ready to merge, remove the v12.0 version number from the gridded SEEPS climo file name ci-skip-all

---------

Co-authored-by: mpm-meto <64001904+mpm-meto@users.noreply.github.com>
@JohnHalleyGotway JohnHalleyGotway added this to the MET-12.0.0 milestone Oct 10, 2024
@JohnHalleyGotway JohnHalleyGotway merged commit 43a1119 into develop-ref Oct 10, 2024
1 check passed
@JohnHalleyGotway JohnHalleyGotway deleted the update_develop_5b4ac7a8 branch October 10, 2024 18:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
No open projects
Status: 🏁 Done
Development

Successfully merging this pull request may close these issues.

9 participants