-
Notifications
You must be signed in to change notification settings - Fork 102
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Alert data file gfs*special*. #245
Merged
WenMeng-NOAA
merged 1 commit into
NOAA-EMC:release/gfs_v16
from
WenMeng-NOAA:post_alert_fix
Dec 22, 2020
Merged
Alert data file gfs*special*. #245
WenMeng-NOAA
merged 1 commit into
NOAA-EMC:release/gfs_v16
from
WenMeng-NOAA:post_alert_fix
Dec 22, 2020
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
EricJames-NOAA
pushed a commit
to EricJames-NOAA/UPP
that referenced
this pull request
Dec 14, 2022
* Introduce the new workflow variables DO_ENSEMBLE and NUM_ENS_MEMBERS. The user can enable ensemble forecasts by setting DO_ENSEMBLE to "TRUE" and NUM_ENS_MEMBERS to the number of ensemble members to use. Note that NUM_ENS_MEMBERS also specifies the number of digits to use in the names of the ensemble member directories, e.g. whether to use mem1, mem2, ..., mem8 or mem01, mem02, ..., mem08. For example, if NUM_ENS_MEMBERS is set to "8", then the member directory names will be mem1, mem2, ..., mem8, whereas if NUM_ENS_MEMBERS is set to "08", then the member directory names will be mem01, mem02, ..., mem08. * During the experiment generation step, generate the full list of cycle dates/times to run and create a directory for each cycle. Previously, the cycle directory for each cycle was created during the make_ics or make_lbcs task of that cycle (whichever ran first). * When running ensemble forecasts, create a set of ensemble member directories under each cycle directory and use those ensemble directories as the FV3 run directories. Note that these ensemble directories are created when the make_ics or make_lbcs task for that cycle and ensemble member runs (whichever of those two tasks happens to run first); they are not created during experiment generation, although that could be done as well. * Modify the ush/generate_FV3SAR_wflow.sh script and the ush/templates/FV3SAR_wflow.xml jinja2 template to add the capability to have more than one cycle within an experiment. This capability was previously present but was inadvertently disabled during transition to generating the rocoto XML using a jinja2 template. * Add three new WE2E tests for running ensemble forecasts: two in community mode (community_ensemble_2mems and community_ensemble_008mems) and another in NCO mode (nco_ensemble). * The two community ensemble WE2E tests differ in that one of them uses NUM_ENS_MEMBERS="2" while the other uses NUM_ENS_MEMBERS="008". The first is the simplest case of two ensemble members with member directorieds named mem1 and mem2, while the second is meant to test the ability to add leading zeros to the digits in the member directory names (should get mem001, mem002, ..., mem008) and to check that the workflow doesn't run into problems with integers being interpreted in octal because they start with leading 0's (as can happen in bash). * In order to test the (restored) capability of the workflow to run multiple cycles (possibly on different days), set up the community ensemble tests to have two cycle hours per day (instead of just one) and different starting and ending cycle days (ending is one day later than starting). For the same reason, set up the nco ensemble test to have multiple cycle hours per day (but not different starting and ending days because only one day of data is available for this nco-mode test). (A) To clarify that the contents of the subdirectory in which the external model files are placed are not the ICs or LBCs (on the native grid) but external model files that will be used to generate the ICs or LBCs, change the name of that subdirectory from "ICS" or "LBCS" to "for_ICS" or "for_LBCS", respectively. (B) Insert the new environment variable SLASH_ENSMEM_SUBDIR anywhere CYCLE_DIR appears. This variable is passed in by the rocoto XML. If not running ensemble forecasts, it is simply set to an empty string; if running ensembles, it is set to the string "/${ensmem_subdir}" where ${ensmem_subdir} is the subdirectory of the current ensemble member under the current cycle directory. This allows the subdirectories containing ICS, LBCS, and RESTART files to be placed directly under the current cycle directory when NOT running ensembles and for them to be placed under the current ensemble member directory (which is one level down from the current cycle directory) when running ensembles. (C) For clarity, add new local variable run_dir that gets set to the run directory based on the current cycle and, if applicable, the ensemble member. Note that if not running ensembles, run_dir is identical to the cycle directory (cycle_dir or CYCLE_DIR). (D) To follow convention used in which the arguments to a function are in lowercase (because they are local variables within the function itself), change the argument CYCLE_DIR of exregional_run_fcst() function to cycle_dir. (E) If the call to the function set_FV3nml_sfc_climo_filenames() fails, call print_err_msg_exit to print out an error message and exit. (F) If running ensemble forecasts, call the new function set_FV3nml_stoch_params() that takes a base FV3 namelist file and generates from it a new FV3 namelist file for each ensemble member that contains a unique set of stochastic parameters (relative to other ensemble members) and places it at the top level of the experiment directory. (G) Call the new function create_diag_table_files (in the new file ush/create_diag_table_files.sh) to create the diagnostics table files. (H) Rename the variable FV3_NML_BASE_FN to FV3_NML_BASE_SUITE_FN to clarify that it specifies the name of the FV3 namelist file for the base physics suite (which is used to generate the namelist file specific to the user-specified physic suite). This is done to better distinguish this base namelist file from the base namelist file used to generate namelist files for the various ensemble members. (The name of the latter is specified in the new workflow variable FV3_NML_BASE_ENS_FN.) (I) Introduce the new workflow variable FV3_NML_BASE_ENS_FN that specifies the name to use for the base FV3 namelist file from which to generate the namelist file for each ensemble member. This variable is not used if not running ensemble forecasts (i.e. if DO_ENSEMBLE is not set to "TRUE"). (J) Add the local variable dummy_run_dir that specifies the (dummy) directory with respect to which to set the relaive paths of the fixed files (i.e. those in the FIXam directory) in the FV3 namelist file. When running ensembles, this path is two levels up from the run directory; without ensembles, it is only one level up (as was originally the case). (K) Edit informational and/or error messages to the user. (L) Remove commented-out code. (M) Edit comments. jobs/JREGIONAL_GET_EXTRN_MDL_FILES: (A) jobs/JREGIONAL_MAKE_ICS: (B) jobs/JREGIONAL_MAKE_LBCS: (B) jobs/JREGIONAL_RUN_FCST: (B), (C), (D), (M) * Pass in the environment variables ENSMEM_INDX and SLASH_ENSMEM_SUBDIR as arguments to the function exregional_run_fcst(). These are set by the rocoto XML. jobs/JREGIONAL_RUN_POST: (B), (C) (M) * In NCO mode, change location where ensemble directories are created to be under the cycle directory instead of above it. Do this using the new environment variable SLASH_ENSMEM_SUBDIR. This change is analogous to the change to community mode described in (B). * In community mode, place the "postprd" subdirectory under the run directory instead of under CYCLE_DIR (since now, cycle directories are one level up if running ensembles and thus would be the incorrect place to create the "postprd" subdirectory). * Change the argument cycle_dir of the function exregional_run_post to run_dir since that's what we really want in this function. This is instead of passing in cycle_dir and then forming run_dir. * Create the new argument cdate to the function exregional_run_post() and pass in the environment variable CDATE for its value (this is instead of using CDATE directly in exregional_run_post()). scripts/exregional_make_grid.sh: (E), (F), (G), (M) scripts/exregional_make_ics.sh: (A) scripts/exregional_make_lbcs.sh: (A) scripts/exregional_run_fcst.sh: (C), (D), (K), (M) * Introduce new input arguments ensmem_indx and slash_ensmem_subdir that get set to the rocoto-specified environment variables ENSMEM_INDX and SLASH_ENSMEM_SUBDIR, respectively, in the call to this function in jobs/JREGIONAL_RUN_FCST. * Change cycle_dir to run_dir in most places to make the directory name more general. This is because the run directory will be the cycle directory only when not running ensembles. When running ensembles, the run directory will be one of the ensemble member directories, which will be one level down from the current cycle directory. * Fix typo where there is an extra "}" printed after $target in error messages. * If running ensemble forecasts, use the new workflow array variable FV3_NML_ENSMEM_FPS (which contains the full paths to the FV3 namelist files for each ensemble member) when creating a link in the run directory to the FV3 namelist file for the current ensemble member. Note that these namelist files are cycle-independent and thus are created only once (during the experiment generation step). * Move the creation of diagnostics table files to a new function (in ush/create_diag_table_files.sh), and call that function during experiment generation (in ush/generate_FV3SAR_wflow.sh) instead of here in exregional_run_fcst.sh. We do this because the diagnostics table files depend only on the cycle, not the ensemble member. Thus, since we know the cycles to run at experiment generation time, we generate the diagnostics file for each cycle at that time and place each in its corresponding cycle directory. * If running ensembles, create symlinks in the run directory to the diagnostics table and model configure files in the cycle directory (which will be one level up from the run directory). We don't do this when NOT running ensembles because in that case, the run directory is the cycle directory (and these two files already exist in that directory; they are created during experiment generation time). scripts/exregional_run_post.sh: (C), (L), (M) * Create the new input argument cdate [which gets set to the global variable CDATE in the call to this function, i.e. exregional_run_post(), in jobs/JREGIONAL_RUN_POST] and use it instead of the global variable CDATE. * Change the argument cycle_dir to run_dir since that's more useful in this function. Do this instead of passing in cycle_dir and then forming run_dir. * Make the local variables "POST_..." lowercase to follow the convention that local variables be in lower case. tests/baseline_configs/config.community_ensemble_008mems.sh: * New workflow configuration file to perform WE2E test of ensemble forecasts in community mode with NUM_ENS_MEMBERS set to "008". This should result in eight workflow members with leading zeros in the digits in the names of the member directories, i.e. directories should be named mem001, mem002, ..., mem008. To test the newly (re)added workflow capability to run multiple cycles, this test also uses first and last cycle dates (DATE_FIRST_CYCL and DATE_LAST_CYCL) that are one day apart (instead of both being the same day as in most other WE2E tests) and two cycle hours per day. tests/baseline_configs/config.community_ensemble_2mems.sh: * New workflow configuration file to perform WE2E test of ensemble forecasts in community mode with NUM_ENS_MEMBERS set to "2". This should result in two workflow members without leading zeros in the digits in the names of the member directories, i.e. directories should be named mem1 and mem2. To test the newly (re)added workflow capability to run multiple cycles, this test also uses first and last cycle dates (DATE_FIRST_CYCL and DATE_LAST_CYCL) that are one day apart (instead of both being the same day as in most other WE2E tests) and two cycle hours per day. tests/baseline_configs/config.nco_ensemble.sh: * New workflow configuration file to perform WE2E test of ensemble forecasts in NCO mode with NUM_ENS_MEMBERS set to "2". This should result in two workflow members without leading zeros in the digits in the names of the member directories, i.e. directories should be named mem1 and mem2. To test the newly (re)added workflow capability to run multiple cycles, this test also uses two cycle hours per day ("12" and "18"). However, unlike the ensemble tests in community mode, it uses the same first and last cycle dates (DATE_FIRST_CYCL and DATE_LAST_CYCL) because currently, the external model data for the next day (which would be 20190902) is not staged on hera. tests/baselines_list.txt: * Add the new WE2E tests for running ensemble forecasts described above. ush/config_defaults.sh: (H), (I) * Move the section for user-staged external model files parameters up to after the section that defines external model parameters. * Move the section on stochastic parameters to after the new section that specifies whether or not to run ensemble forecasts (i.e. the variables DO_ENSEMBLE and NUM_ENS_MEMBERS; see below). * Introduce the new workflow variable DO_ENSEMBLE that specifies whether or not to run ensemble forecasts. Enable ensemble forecasts by setting DO_ENSEMBLE to "TRUE". * Introduce the new workflow variable NUM_ENS_MEMBERS that specifies the number of ensemble members. This variable also specifies the number of digits to use in the names of the ensemble member directories, e.g. whether to use mem1, mem2, ..., mem8 or mem01, mem02, ..., mem08. For example, if NUM_ENS_MEMBERS is set to "8", then the member directory names will be mem1, mem2, ..., mem8; and if NUM_ENS_MEMBERS is set to "08", then the member directory names will be mem01, mem02, ..., mem08. This variable is not used if DO_ENSEMBLE is not set to "TRUE". ush/create_diag_table_files.sh: * New file that defines a function [create_diag_table_files()] that creates a diagnostics table file for each cycle date and places it in the corresponding cycle directory. ush/create_model_config_files.sh: * New file that defines a function [create_model_config_files()] that creates a model configuration file for each cycle date and places it in the corresponding cycle directory. ush/generate_FV3SAR_wflow.sh: (H), (J) (L), (M) * Add new ensemble-related parameters to the "settings" variable that is used to customize the jinja2 template for the rocoto XML file. These new parameters allow the resulting XML to loop over ensemble members, to name rocoto tasks and log files such that they contain the ensemble member name (and are thus unique within an experiment), and to pass to the j-jobs the subdirectory of the current ensemble member under the current cycle directory. * Change "settings" variable used to set parameters in the jinja template for the rocoto XML to add capability to have more than one cycle in the experiment. This capability was previously present but was inadvertently disabled during transition to generating the rocoto XML using a jinja2 template. * Use the new workflow array variable ALL_CDATES (containing all the cycle dates/times to run) to create all the cycle directories. Previously, the cycle directories were created during the make_ics or make_lbcs task. Now, this must be done during the experiment generation step because now, the model configuration file(s) and possibly also the diagnostics table file(s) (if the MAKE_GRID_TN step is being skipped), which are cycle-dependent but ensemble-member-independent, are created and placed in the cycle directories during experiment generation. * Call the new function create_model_config_files() to create a model configuration file within each cycle directory. * If not running the MAKE_GRID_TN task, then do (E), (F), and (G). ush/set_FV3nml_sfc_climo_filenames.sh: (J) ush/set_FV3nml_stoch_params.sh: * New file that defines a function [set_FV3nml_stoch_params()] that, for each ensemble member, takes the base FV3 namelist file and generates from it a new FV3 namelist file containing a unique set of stochastic parameters (relative to other ensemble members) and places it at the top level of the experiment directory. ush/set_cycle_dates.sh: * New file that defines a function [set_cycle_dates()] that sets all the cycle dates/times to run in the experiment. ush/setup.sh: (H), (I), (V), (M) * Remove unneeded local variables YYYY_FIRST_CYCL, MM_FIRST_CYCL, DD_FIRST_CYCL, and HH_FIRST_CYCL. * Call the new function set_cycle_dates() to set the new worklow array variable ALL_CDATES containing all the cycle days/times to be run as part of the experiment. * Set the new workflow variable NUM_CYCLES (defined as the number of forecasts to run as part of the experiment) to the number of elements in ALL_CDATES. * Rename FCST_LEN_HRS_MAX to fcst_len_hrs_max since it is a local variable and thus (by convention) should be lower case. * Make sure that the new workflow variable DO_ENSEMBLE is set to a valid value. * Set the new workflow variable NDIGITS_ENSMEM_NAMES that specifies the number of digits to use in the names of the ensemble member directories, e.g. whether to use mem1, mem2, ..., mem8 or mem01, mem02, ..., mem08. Note that this is not a user-specifiable variable; it is obtained by counting the number of characters in NUM_ENS_MEMBERS. For example, if NUM_ENS_MEMBERS is set to "8", then NDIGITS_ENSMEM_NAMES will get set to "1" and the member directory names will be mem1, mem2, ..., mem8; and if NUM_ENS_MEMBERS is set to "08", then NDIGITS_ENSMEM_NAMES will get set to "2" and the member directory names will be mem01, mem02, ..., mem08. * Set the new workflow array variable ENSMEM_NAMES containing the names of the ensemble members. These are used to set the ensemble member subdirectory names. * Set the new workflow array variable FV3_NML_ENSMEM_FPS containing the full paths to the FV3 namelist files of the ensemble members. ush/templates/FV3.input.yml: * Remove setting of consv_te to 1.0 for the FV3_CPT_v0 suite because it generates an FV3 runtime error that states that this variable needs to be set to 0 for all regional runs. ush/templates/FV3SAR_wflow.xml: (L) * Modify jinja2 code to allow for multiple cycles to be run. This capability was previously present but was inadvertently disabled during transition to generating the rocoto XML using a jinja2 template. * Bug fix - Change file name "make_grid_task_complete.txt" to "&MAKE_GRID_TN;_task_complete.txt" to make it changeable with the task name. * Place a jinja2-controlled metatask around all tasks starting with MAKE_ICS_TN that loops over all ensemble members if do_ensemble is set to TRUE (note that do_ensemble gets set to the workflow variable DO_ENSEMBLE in ush/generate_FV3SAR_wflow.sh). Make the print format of the loop index dependent on ndigits_ensmem_names (which in turn gets set to the workflow variable NDIGITS_ENSMEM_NAMES in ush/generate_FV3SAR_wflow.sh) to get the correct names for the ensemble member subdirectories (in terms of the number of leading zeros to use for the number portion of the subdirectory name). * For tasks that are within the metatask that loops over the ensemble members, add a string (uscore_ensmem_name) that identifies the ensemble member to task names, corresponding job names, and log file names. Note that this variable is set to an empty string if not running ensemble forecasts. * For tasks that are within the metatask that loops over the ensemble members, create the new environment variable SLASH_ENSMEM_SUBDIR that gets set to the jinja2 variable slash_ensmem_subdir, which in turn gets set (in ush/generate_FV3SAR_wflow.sh) to a null string if not running ensembles and to the string "/${name_of_ensemble_member}" when running ensembles, where ${name_of_ensemble_member} is the name of the current ensemble member. * In the RUN_FCST_TN task, create a new environment variable named ENSMEM_INDX and set it to the current value of the index used in the loop over ensemble members. This is passed via the j-job jobs/JREGIONAL_RUN_FCST to the ex-script scripts/exregional_run_fcst.sh, where, if DO_ENSEMBLE is set to "TRUE", it is used to create the symlink in the run directory to the FV3 namelist file of the current ensemble member (which is in the top level of the experiment directory). Note that this variable is set to an empty string if not running ensemble forecasts (in that case, it is not used in the ex-script). * In the dependencies section of the RUN_POST_TN task, add slash_ensmem_subdir to the paths of the dynf*.nc and phyf*.nc files (since when running ensemble forecasts, these files will be in the ensemble member directories under the cycle directories). ush/valid_param_vals.sh: * Specify valid values for the new workflow variable DO_ENSEMBLE. Co-authored-by: Benjamin.Blake EMC <Benjamin.Blake@m71a3.ncep.noaa.gov>
EricJames-NOAA
pushed a commit
to EricJames-NOAA/UPP
that referenced
this pull request
Dec 14, 2022
* adding files for getting nomads data new files: ush/NOMADS_get_extrn_mdl_files_grib.sh ush/NOMADS_get_extrn_mdl_files_nemsio.sh * updated code for getting data online in one file new file: ush/NOMADS_get_extrn_mdl_files.sh deleted files: ush/NOMADS_get_extrn_mdl_files_grib.sh ush/NOMADS_get_extrn_mdl_files_nemsio.sh * Enable ensemble forecasts (NOAA-EMC#245) * Modify workflow to enable ensemble forecasts. Summary of modifications: ------------------------ * Introduce the new workflow variables DO_ENSEMBLE and NUM_ENS_MEMBERS. The user can enable ensemble forecasts by setting DO_ENSEMBLE to "TRUE" and NUM_ENS_MEMBERS to the number of ensemble members to use. * When running ensemble forecasts, create/insert a set of ensemble member directories and create the cycle directories under these member directories. These ensemble member directories are placed at the directory level that cycle directory levels would be placed when not running ensemble forecasts. * Regardless of whether or not ensembles are enabled, change location where external model files are staged so that they are not in the cycle directories but instead one (without ensembles) or two (with ensembles) directory levels up. In the case with ensembles, this needs to be done so that the external model files are not duplicated within each ensemble member directory; they do not need to be because all ensemble members use the same external model files. This is also done for the case without ensembles in order to minimize the difference in worklfow behavior between the with and without ensemble cases. To make this change of location of external model files, the new workflow variable EXTRN_MDL_FILES_BASEDIR is introduced (it is not a user-specified variable but a secondary one). * Add two new WE2E tests for running ensemble forecasts, one in community mode (community_ensemble) and another in NCO mode (nco_ensemble). Modifications common to more than one file (used below in listing of file-by-file modifications): ------------------------------------------------------------------------------------------------ (A) Fix/add/delete comments and/or informational and/or error messages. (B) Remove commented out code. (C) Change location where external model files are staged so that they are not in the cycle directories (which are now underneath each ensemble member directory) but instead one level up. This needs to be done so that the external model files are not duplicated within each ensemble member directory; they do not need to be because all ensemble members use the same external model files. (D) Add a call to the new function set_FV3nml_stoch_params() that takes a base FV3 namelist file and generates from it a new FV3 namelist file for each ensemble member containing a unique set of stochastic parameters (relative to other ensemble members) and places it at the top level of that ensemble member's directory (so that all cycles in that member directory can create symlinks to it). (E) Rename the variable FV3_NML_BASE_FN to FV3_NML_BASE_SUITE_FN to clarify that it specifies the name of the FV3 namelist file for the base physics suite (which is used to generate the namelist file specific to the user-specified physic suite). This is done to better distinguish this base namelist file from the base namelist file used to generate namelist files for the various ensemble members. (The name of the latter is specified in the new workflow variable FV3_NML_BASE_ENS_FN.) (F) Introduce the new workflow variable FV3_NML_BASE_ENS_FN that specifies the name to use for the base FV3 namelist file from which to generate the namelist file for each ensemble member. This variable is not used if not running ensemble forecasts (i.e. if DO_ENSEMBLE is not set to "TRUE"). (G) Add the local variable dummy_cyc_dir that specifies the (dummy) directory with respect to which to set the relaive paths of the fixed files (i.e. those in the FIXam directory) in the FV3 namelist file. When running ensembles, this path is two levels up from the cycle directory; without ensembles, it is only one level up (as was originally the case). File-by-file description of modifications: ----------------------------------------- jobs/JREGIONAL_GET_EXTRN_MDL_FILES: (C) jobs/JREGIONAL_RUN_FCST: * Change CYCLE_DIR to cycle_dir since it is a local variable in this context (it is an argument to the script exregional_run_fcst.sh). jobs/JREGIONAL_RUN_POST: * For NCO mode, change directory in which output from the RUN_POST_TN task is stored such that if running ensemble forecasts, subdirectories are created under COMOUT_BASEDIR for each ensemble member. This is done via the variable SLASH_ENSMEM_DIR, which is set to either "/mem$NN" where $NN is the member number (if running ensemble forecasts) or to a null string (if not running ensemble forecasts). For community mode, the output from the post task is under CYCLE_DIR, which now gets set in the rocoto XML such that it is under an ensemble member directory (see below description of modifications to ush/templates/FV3SAR_wflow.xml). modulefiles/tasks/hera/make_ics.local: * Add wgrib2 (must have been removed by mistake?). modulefiles/tasks/hera/make_lbcs.local: * Add wgrib2 (must have been removed by mistake?). scripts/exregional_make_grid.sh: (A), (D) scripts/exregional_make_ics.sh: (C) scripts/exregional_make_lbcs.sh: (C) scripts/exregional_run_fcst.sh: * Change CYCLE_DIR to cycle_dir since it is a local variable. * Add a check such that if running ensemble forecasts, the symlink for the FV3 namelist file that must be present in the cycle directory points to the namelist file at the top level of the ensemble directory under which that cycle directory is located. tests/baseline_configs/config.community_ensemble.sh: * New workflow configuration file to perform WE2E test of ensemble forecasts in community mode. tests/baseline_configs/config.nco_ensemble.sh: * New workflow configuration file to perform WE2E test of ensemble forecasts in NCO mode. tests/baselines_list.txt: * Add two new WE2E tests for running ensemble forecasts, one in community mode (community_ensemble) and another in NCO mode (nco_ensemble). ush/config_defaults.sh: (A), (E), (F) * Introduce the new workflow variable DO_ENSEMBLE that specifies whether or not to run ensemble forecasts. Enable ensemble forecasts by setting DO_ENSEMBLE to "TRUE". * Introduce the new workflow variable NUM_ENS_MEMBERS that specifies the number of ensemble members. This variable is not used if DO_ENSEMBLE is not set to "TRUE". ush/generate_FV3SAR_wflow.sh: (A), (B), (D), (E), (G) * Add new ensemble-related parameters to the "settings" variable that is used to customize the jinja2 template for the rocoto XML file. These new parameters allow the resulting XML to loop over ensemble members, rename rocoto tasks and log files such that they contain the member number (and are thus unique), and modify cycle directories so that they are member-specific. ush/set_FV3nml_sfc_climo_filenames.sh: (G) ush/set_FV3nml_stoch_params.sh * File to define new function that takes a base FV3 namelist file and generates from it a new FV3 namelist file for each ensemble member containing a unique set of stochastic parameters (relative to other ensemble members) and places it at the top level of that ensemble member's directory (so that all cycles in that member directory can create symlinks to it). ush/setup.sh: (E), (F) * Rename FCST_LEN_HRS_MAX to fcst_len_hrs_max since it is a local variable. * Introduce the new workflow variable EXTRN_MDL_FILES_BASEDIR that specifies the base directory under which the external model files will be staged. Under this directory, a subdirectory will be created for each external model (one for ICs, another for LBCs if different from the one for ICs), and under these, subdirectories will be created for each cycle in which to stage the files. Note that EXTRN_MDL_FILES_BASEDIR is a secondary variable in the sense that it is not user-specifiable. * Rename FV3_NML_BASE_FP to FV3_NML_BASE_SUITE_FP for the same reason as renaming of FV3_NML_BASE_FN to FV3_NML_BASE_SUITE_FN (see (E) above). * Create new workflow variable FV3_NML_BASE_ENS_FP that specifies the full path to the base FV3 namelist file from which the namelist files for the individual ensemble members are generated. * Introduce the new workflow array variable ENS_MEMBER_DIRS. If running ensemble forecasts, set its elements to the ensemble member directories immediately under the experiment directory. * If running ensemble forecasts, create the ensemble directories specified in the new workflow array variable ENS_MEMBER_DIRS. * Record new variables to the workflow variable definitions file. ush/templates/FV3SAR_wflow.xml: * Bug fix - Change file name "make_grid_task_complete.txt" to "&MAKE_GRID_TN;_task_complete.txt" to make it changeable with the task name. * Remove CYCLE_BASEDIR as an environment variable from the GET_EXTRN_ICS_TN and GET_EXTRN_LBCS_TN tasks. This variable is no longer needed because the external model files are now staged outside of the cycle directories (under EXTRN_MDL_FILES_BASEDIR). * Place a jinja2-controlled metatask around all tasks starting with MAKE_ICS_TN that loops over all ensemble members if do_ensemble is set to TRUE. * For tasks that are within the metatask that loops over the ensemble members, add a string (uscore_ensmem_name) that identifies the ensemble member to task names, corresponding job names, and log file names. Note that this variable is set to an empty string if not running ensemble forecasts. * For tasks that are within the metatask that loops over the ensemble members, add a string (slash_ensmem_dir) that inserts the ensemble member directory to the definition of CYCLE_DIR (since when running ensembles, the cycle directories are under the member directories). Note that this variable is set to an empty string if not running ensemble forecasts. * Set the ensemble index (ENSMEM_INDX) as an environment variable in the RUN_FCST_TN task. This is needed in the ex-script exregional_run_fcst.sh to be able to the symlink in the cycle directory to the FV3 namelist file in the correct ensemble member directory. Note that this variable is set to an empty string if not running ensemble forecasts (in that case, it is not used). * Set the ensemble member subdirectory preceded by a slash (SLASH_ENSMEM_DIR) as an environment variable in the RUN_POST_TN task. This is needed in NCO mode when setting the directory in which to place the output of UPP. Note that this variable is set to an empty string if not running ensemble forecasts. ush/valid_param_vals.sh: * Specify valid values for the new workflow variable DO_ENSEMBLE. * Minor changes to code comments. * Apparently there is now a requirement in the FV3 code that consv_te be set to 0 on any regional grid. Make this change for the FV3_CPT_v0 suite (which is the only one for which consv_te had been set to a nonzero value). * Bug fix in a diag_table that should already be in the develop branch. * Bug fix -- Fix inconsistency in the way the ensemble member directories are named in different scripts. The workflow generation scripts create ensemble directories named, e.g., mem1, mem2, ..., mem8, but the exregional_run_fcst.sh script assumes they are mem01, mem02, ..., mem08. Make these consistent. Now, the naming convention used depends on whether or not leading zeros are included in NUM_ENS_MEMBERS. For example, if NUM_ENS_MEMBERS is set to "8", then the member directory names will be mem1, mem2, ..., mem8; and if NUM_ENS_MEMBERS is set to "08", then the member directory names will be mem01, mem02, ..., mem08. * Add new WE2E test to test use of leading zeros in ensemble member names. * Change directory structure so that ensemble member directories are beneath the cycle directories (instead of the opposite). Details below. Summary of modifications: ------------------------ * Place cycle directories above ensemble member directories, i.e. each cycle directory will contain a full set of ensemble member subdirectories that are used as the run directories. Previously, it was the other way around, i.e. each member directory contained all cycle subdirectories. * Move the external model directories into each cycle directory (instead of being in their own directory called extrn_mdl_files under the main experiment directory). * During the experiment generation step, generate the full list of cycle dates/times to run and create a directory for each cycle. * Add capability to have more than one cycle. This capability was previously present but was inadvertently disabled during transition to generating the rocoto XML using a jinja2 template. * In order to test the capability of the workflow to run multiple cycles (possibly on different days), modify the WE2E tests community_ensemble_2mems and nco_ensemble so that there are two cycle hours per day. Also, modify community_ensemble_2mems so that the starting and ending days are different (one day later). Modifications common to more than one file (used below in listing of file-by-file modifications): ------------------------------------------------------------------------------------------------ (A) Change the location where external model files are stored to be under each cycle directory (instead of a separate directory specified by EXTRN_MDL_FILES_BASEDIR under the main experiment directory). (B) Insert the new environment variable SLASH_ENSMEM_SUBDIR anywhere CYCLE_DIR appears. This variable is passed in by the rocoto XML. If not running ensemble forecasts, it is simply set to an empty string, and if running ensembles, it is set to the string "/${ensmem_subdir}" where ensmem_subdir is the subdirectory of the current ensemble member under the current cycle directory. This allows the subdirectories containing ICS, LBCS, and RESTART files to be placed directly under the current cycle directory when NOT running ensembles and for them to be placed under the current ensemble member directory (which is one level down from the current cycle directory) when running ensembles. (C) For clarity, add new local variable run_dir that gets set to the run directory based on the current cycle and, if applicable, the ensemble member. (D) Call the new function create_diag_table_files (in the new file create_diag_table_files.sh) to create diagnostics table files. (D) For correctness, rename the local variable dummy_cyc_dir to dummy_run_dir. (E) Remove any use of EXTRN_MDL_FILES_BASEDIR since it is no longer needed as a workflow variable. (F) Remove any use of ENS_MEMBER_DIRS since it is no longer needed as a workflow variable. (V) Remove unused code. (W) Edit informational and/or error messages. (X) Remove trailing whitespace. (Y) Remove commented out code. (Z) Edit comments. File-by-file description of modifications: ----------------------------------------- jobs/JREGIONAL_GET_EXTRN_MDL_FILES: (A) jobs/JREGIONAL_MAKE_ICS: (B) jobs/JREGIONAL_MAKE_LBCS: (B) jobs/JREGIONAL_RUN_FCST: (B), (C), (Z) * Pass in ENSMEM_INDX and SLASH_ENSMEM_SUBDIR as arguments to the function exregional_run_fcst(). jobs/JREGIONAL_RUN_POST: (B), (Z) * Create the new local variable run_dir in which to store the path to the run directory (for the current cycle and possibly ensemble member). * In NCO mode, change location where ensemble directories are created to be under the cycle directory instead of above it (analogous change to NCO mode as is done in (B) for community mode). * In community mode, place the postprd subdirectory under the run directory instead of under CYCLE_DIR (since now, cycle directories are one level up if running ensembles and thus would be the incorrect place to create postprd). * Create the new argument cdate to the function exregional_run_post() and pass in the environment variable CDATE for its value (this is instead of using CDATE directly in exregional_run_post()). * Change the argument cycle_dir of the function exregional_run_post to run_dir since that's what we really want in that function. This is instead of passing in cycle_dir and then forming run_dir. scripts/exregional_make_grid.sh: (D) scripts/exregional_make_ics.sh: (A), (X) scripts/exregional_make_lbcs.sh: (A), (X) scripts/exregional_run_fcst.sh: (C), (W) * Introduce new input arguments ensmem_indx and slash_ensmem_subdir that get set to the rocoto-specified environment variables ENSMEM_INDX and SLASH_ENSMEM_SUBDIR, respectively, in the call to this function in jobs/JREGIONAL_RUN_FCST. * Change cycle_dir to run_dir in most places to make the directory name more general. This is because the run directory will be the cycle directory only when not running ensembles. When running ensembles, the run directory will be one of the ensemble member directories, which will be one level down from the current cycle directory. * Fix typo where there is an extra "}" printed after $target in error messages. * Use the new workflow array variable FV3_NML_ENSMEM_FPS (which contains the full paths to the FV3 namelist files for each ensemble member) when creating a link in the run directory to the FV3 namelist file for the current ensemble member. Note that these namelist files are cycle-independent and thus are created only once during the experiment generation step. * Move the creation of diagnostics table files to a new function (in ush/create_diag_table_files.sh), and call that function during experiment generation (in ush/generate_FV3SAR_wflow.sh) instead of here in exregional_run_fcst.sh. We do this because the diagnostics table files depend only on the cycle, not the ensemble member. Thus, since we know the cycles to run at experiment generation time, we generate the diagnostics file for each cycle then and place each in its corresponding cycle directory. * If running ensembles, create symlinks in the run directory to the diagnostics table and model configure files in the cycle directory (which will be one level up from the run directory). We don't do this when NOT running ensembles because in that case, the run directory is the cycle directory (and these two files already exist in that directory; they are created during experiment generation time). scripts/exregional_run_post.sh: (C), (Y) * Create the new input argument cdate (which gets set to the global variable CDATE in the call to this function (exregional_run_post)) and use it instead of the global variable CDATE. * Change the argument cycle_dir to run_dir since that's more useful in this function. This is instead of passing in cycle_dir and then forming run_dir. * Make the local variables "POST_..." lowercase to follow the convention that local variables be in lower case. tests/baseline_configs/config.community_ensemble_2mems.sh: * Modify settings in this test configuration so that the starting and ending days of the cycles are not the same and so that there are two cycle hours per day. This is to have more thorough testing of the ensembles feature in community mode. tests/baseline_configs/config.nco_ensemble.sh: * Modify settings in this test configuration so that there are two cycle hours per day. This is to have more thorough testing of the ensembles feature in NCO mode. ush/create_diag_table_files.sh: * New file containing a function that creates a diagnostics table file for each cycle date and places it in the corresponding cycle directory. ush/create_model_config_files.sh: * New file containing a function that creates a model configuration file for each cycle date and places it in the corresponding cycle directory. ush/set_cycle_dates.sh: * New function that sets all the cycle dates/times to run. ush/generate_FV3SAR_wflow.sh: (D) * For clarity and consistency with other scripts, change variable name from slash_ensmem_dir to slash_ensmem_subdir. * Change "settings" variable used to set parameters in the jinja template for the rocoto XML to add capability to have more than one cycle. This capability was previously present but was inadvertently disabled during transition to generating the rocoto XML using a jinja2 template. * Use the new workflow array variable ALL_CDATES (containing all the cycle dates/times to run) to create all the cycle directories. Previously, the cycle directories were created during the make_ics or make_lbcs task, but it is clearer to do it during experiment generation. Also, it now must be done during experiment generation because now, the model configuration file(s) and possibly also the diagnostics table file(s) (if the MAKE_GRID_TN step is being skipped), which are cycle-dependent but ensemble-member-independent, are created and placed in the cycle directories during experiment generation. * Call the new function create_model_config_files() to create a model configuration file within each cycle directory. * If not running the MAKE_GRID_TN task, call the new function create_diag_table_files() to create a diagnostics table file within each cycle directory. ush/set_FV3nml_sfc_climo_filenames.sh: (D) ush/set_FV3nml_stoch_params.sh: (Z) * For consistency with other scripts, rename the variable fv3_nml_ens_fp to fv3_nml_ensmem_fp. * Use the new workflow array variable FV3_NML_ENSMEM_FPS (which contains the full paths to the FV3 namelist files for each ensemble member) to set the full path to the current ensemble member's FV3 namelist file. ush/setup.sh: (E), (F), (V), (Z) * Call the new function set_cycle_dates() to set the new worklow array variable ALL_CDATES containing all the cycle days/times to be run. * Set the new workflow variable NUM_CYCLES to the number of elements in ALL_CDATES. * Set the new workflow array variable ENSMEM_NAMES containing the names of the ensemble members. * Set the new workflow array variable FV3_NML_ENSMEM_FPS containing the full paths to the FV3 namelist files of the ensemble members. * Remove creation of ensemble member directories. These are now created in the j-jobs of the MAKE_ICS_TN or the MAKE_LBCS_TN task (whichever runs first). ush/templates/FV3SAR_wflow.xml: (Y) * Modify jinja code to allow for multiple cycles to be run. This capability was previously present but was inadvertently disabled during transition to generating the rocoto XML using a jinja2 template. * Add CYCLE_DIR as an environment variable to the GET_EXTRN_ICS_TN and GET_EXTRN_LBCS_TN tasks. * In the MAKE_ICS_TN, MAKE_LBCS_TN, RUN_FCST_TN, and RUN_POST_TN tasks, change the way CYCLE_DIR is set so that it does not include the ensemble member subdirectory. * In the MAKE_ICS_TN, MAKE_LBCS_TN, RUN_FCST_TN, and RUN_POST_TN tasks, create the new environment variable SLASH_ENSMEM_SUBDIR that gets set to a null string if not running ensembles and to the string "/${name_of_ensemble_member}" when running ensembles. * Bug fixes. These bugs were introduced during the previous merge of the develop branch into this fork (feature/ensemble). * To test the multiple-days and multiple-cycle-hours capabilities with ensemble forecasts in community mode, change WE2E test "community_ensemble_008" to include 2 days and 2 cycle hours per day (instead of 1 and 1, respectively). * Add WCOSS changes for the feature/ensemble branch * Minor changes to code comments. Co-authored-by: Benjamin.Blake EMC <Benjamin.Blake@m71a3.ncep.noaa.gov> * updated codes according to the comments modified files: scripts/exregional_get_extrn_mdl_files.sh ush/NOMADS_get_extrn_mdl_files.sh ush/config_defaults.sh ush/generate_FV3SAR_wflow.sh ush/valid_param_vals.sh * Change filename: ush/generate_FV3SAR_wflow.sh -> ush/generate_FV3LAM_wflow.sh * adding one test to WE2W for downloading files new file for the test: tests/baseline_configs/config.user_download_extrn_files.sh modified file for the test: tests/baselines_list.txt modified files for recent changes(SAR to LAM, JPgrid to ESGgrid) scripts/exregional_get_extrn_mdl_files.sh ush/config_defaults.sh ush/valid_param_vals.sh Co-authored-by: Linlin.Pan <Linlin.Pan@noaa.gov> Co-authored-by: gsketefian <31046882+gsketefian@users.noreply.github.com> Co-authored-by: Benjamin.Blake EMC <Benjamin.Blake@m71a3.ncep.noaa.gov>
EricJames-NOAA
pushed a commit
to EricJames-NOAA/UPP
that referenced
this pull request
Dec 14, 2022
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Add a dbnet alert for the data file gfsspecial in GFS V16 post workflow.