-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Tablet throttler: get remote tablets metrics from Realtime Stats , with auto-detection #13034
Tablet throttler: get remote tablets metrics from Realtime Stats , with auto-detection #13034
Conversation
…imeStats Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
…id actively probing for relevant tablet Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Review ChecklistHello reviewers! 👋 Please follow this checklist when reviewing this Pull Request. General
If a new flag is being introduced:
If a workflow is added or modified:
Bug fixes
Non-trivial changes
New/Existing features
Backward compatibility
|
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
This PR is being marked as stale because it has been open for 30 days with no activity. To rectify, you may do any of the following:
If no action is taken within 7 days, this PR will be closed. |
This PR is being marked as stale because it has been open for 30 days with no activity. To rectify, you may do any of the following:
If no action is taken within 7 days, this PR will be closed. |
We're not going to pursue this path. Instead, we will convert throttler's HTTP calls with RPC calls. |
Description
An enhancement of #13018 ; per #13018 (comment), this is a modification where the newly introduced
--feature-throttler-read-realtime-stats
command line flag is not required, and removed in this PR.In this PR we track availability of throttler metrics in
RealtimeStats
. If a throttle metric was seen inRealtimeStats
in the past minute, we do not run probes on the relevant tablet. If no metric has been seen for a tablet in the past minute, then the throttler runs the usual probes (currently HTTP based) for that tablet.I'm not sure this approach is better than #13018, and the reason has to do with probe frequency. The
PRIMARY
tablet runs the standard probes run in subsecond intervals. However, it has no control over the probing frequency in other tablets. Thus, if--health_check_interval
is high on replica tablets, say10s
, that means thePRIMARY
has low resolution for throttler metrics (in particular, replication lag).It does make sense when the throttler's threshold accommodates
--health_check_interval
. For example,health_check_interval
of5s
makes sense if throttler is configured for the default replication lag metric, and the threshold is configured to, say,30
. But if the threshold is at5s
, then I'd expect a1s-2s
value forhealth_check_interval
.Related Issue(s)
Checklist
Deployment Notes