Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CASMCMS-9162: Add BOS option: cfs-read-timeout #5475

Draft
wants to merge 2 commits into
base: release/1.5
Choose a base branch
from
Draft
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
26 changes: 25 additions & 1 deletion operations/boot_orchestration/Options.md
Original file line number Diff line number Diff line change
Expand Up @@ -22,16 +22,19 @@ Example output:

```json
{
"cfs_read_timeout": 20,
"cleanup_completed_session_ttl": "7d",
"component_actual_state_ttl": "4h",
"default_retry_policy": 3,
"disable_components_on_completion": true,
"discovery_frequency": 300,
"logging_level": "INFO",
"max_boot_wait_time": 600,
"max_component_batch_size": 1800,
"max_power_off_wait_time": 180,
"max_power_on_wait_time": 30,
"polling_frequency": 60
"polling_frequency": 60,
"session_limit_required": false
}
```

Expand All @@ -43,6 +46,12 @@ The values for all BOS global options can be modified with the `cray bos v2 opti

The following are the BOS global options:

* `cfs_read_timeout`

The amount of time in seconds BOS will wait for a response from CFS to a request. After this time, the request will
time out. The default is 10 seconds.
Note: This option is only available as a 'hotfix' in CSM-1.5.
jsollom-hpe marked this conversation as resolved.
Show resolved Hide resolved

jsollom-hpe marked this conversation as resolved.
Show resolved Hide resolved
* `cleanup_completed_session_ttl`

Delete complete sessions that are older than `cleanup_completed_session_ttl` (in hours). `0h` disables cleanup behavior.
Expand Down Expand Up @@ -81,6 +90,12 @@ The following are the BOS global options:

How long BOS will wait for a node to boot into a usable state before rebooting it again (in seconds).

* `max_component_batch_size`

The maximum number of components that BOS will group together in a single API request it makes. This can be used to limit the load
on other services by forcing BOS to break up its requests into smaller chunks.
Note: This option is only available as a 'hotfix' in CSM-1.5.

* `max_power_off_wait_time`

How long BOS will wait for a node to power off before forcefully powering it off (in seconds).
Expand All @@ -92,3 +107,12 @@ The following are the BOS global options:
* `polling_frequency`

How frequently the BOS operators check component state for needed actions (in seconds).

* `session_limit_required`

If enabled, BOS sessions cannot be created without specifying the `limit` parameter.
This can be helpful in avoiding accidental reboots of more components than intended.
If this option is enabled, it is still possible to effectively create a session with no limit
by specifying `*` as the limit parameter (if this is done on the command line, it must be
quoted it in order to prevent it from being interpreted by the shell).
Note: This option is only available as a 'hotfix' in CSM-1.5.