[statsd] Disable statsd buffering by default #692

sgnn7 · 2021-10-13T18:58:41Z

What does this PR do?

Due to impact on users of other clients where buffering was enabled by
default, especially in environments and frameworks where fork() is
used we will for the time being disable buffering by default until a
decision is made on the path forward. Buffering can still be turned on
with disable_buffering = False flag it's just that for now it is
defaulted to True.

Description of the Change

Use of DogStatsd via statsd module-level object defaults to synchronous (un-buffered) metric sending.
Use of DogStatsd via the default constructor defaults to synchronous (un-buffered) metric sending.

Alternate Designs

In discussion (TBD). We may revert this and use automated ways to detect environments
that are incompatible to thread-based buffering.

Possible Drawbacks

Lower performance in high-throughput environments.

Verification Process

Unit tests cover most of this:

python$(python --version 2>&1 | cut -c8-8)" -m unittest -vvv tests.unit.dogstatsd.test_statsd

Additional Notes

This change may be reversed in the future but for now, the risk is too high
for user disruption.

Release Notes

Disable statsd buffering by default. Buffering can be enabled with disable_buffering = False flag.

Note: Since this is just a partial revert of #670, removal of both release notes can be done as it's a net-0 change

Review checklist (to be filled by reviewers)

Feature or bug fix MUST have appropriate tests (unit, integration, etc...)
PR title must be written as a CHANGELOG entry (see why)
Files changes must correspond to the primary purpose of the PR as described in the title (small unrelated changes should have their own PR)
PR must have one changelog/ label attached. If applicable it should have the backward-incompatible label attached.
PR should not have do-not-merge/ label attached.
If Applicable, issue must have kind/ and severity/ labels attached at least.

Due to impact on users of other clients where buffering was enabled by default, especially in environments and frameworks where `fork()` is used we will for the time being disable buffering by default until a decision is made on the path forward. Buffering can still be turned on with `disable_buffering = False` flag it's just that for now it isi defaulted to `True`.

Since we are at least for the time being disabling buffering by default, the docs here are being updated to match the changes applied.

sgnn7 · 2021-10-13T19:09:45Z

/azp run

azure-pipelines · 2021-10-13T19:10:01Z

Azure Pipelines successfully started running 2 pipeline(s).

With a non-buffered environment, our context manager may not work properly. This change ensures that we propely test this behavior.

truthbk

This looks good to me, I added a note, but if the answer dismisses any concerns I'm OK with the change.

truthbk · 2021-10-13T21:27:07Z

datadog/dogstatsd/base.py

@@ -453,16 +450,16 @@ def open_buffer(self, max_buffer_size=None):

        self._manual_buffer_lock.acquire()

+        # XXX Remove if `disable_buffering` default is changed to False
+        self._send = self._send_to_buffer


This stuff is not thread-safe. I presume it never really was and I assume that's beyond the expected use-cases.

@truthbk Generally things should be thread-safe due to various locks on the socket/buffer/context ops but you may get unexpectedly-buffered metrics if some threads are opening/closing buffers while another thread is just writing metrics without a context manager or open_buffer. Overall though, there should be no data loss or exceptions. As for the setter here, the lock + GIL makes this effectively an atomic operation. We do have a test around at least part of this here. If I'm missing something though, I'll fix up the PR for sure.

I presume it never really was and I assume that's beyond the expected use-cases.

Somewhat. The way it was coded originally was not really thread safe but the changes made subsequently over this year have closed up most of the glaring exception issues. The buffering-by-default fixed the self._send = GIL reliance but we're backing that part out with this PR for the time being 😢 .

therve · 2021-10-14T06:45:14Z

/azp run

azure-pipelines · 2021-10-14T06:45:30Z

Azure Pipelines successfully started running 2 pipeline(s).

Since DataDog/datadogpy#692, datadog emits the following logs for every request: ``` pypi-warehouse-web-84cff6b7c7-w7brv web {"logger": "datadog.dogstatsd", "level": "INFO", "event": "Statsd buffering is disabled", "thread": 140486667159360} pypi-warehouse-web-84cff6b7c7-w7brv web {"logger": "datadog.dogstatsd", "level": "INFO", "event": "Statsd periodic buffer flush is disabled", "thread": 140486667159360} ``` This sets the log level to silence these and clean up production logs.

sgnn7 added resource/dogstatsd documentation Documentation related changes changelog/Changed Changed features results into a major version bump kind/feature-request Feature request related issue severity/normal Normal severity issue labels Oct 13, 2021

sgnn7 added this to the Next milestone Oct 13, 2021

sgnn7 requested review from a team as code owners October 13, 2021 18:58

sgnn7 self-assigned this Oct 13, 2021

sgnn7 added 2 commits October 13, 2021 14:05

[statsd] Update notes about buffering in statsd

a8a12cd

Since we are at least for the time being disabling buffering by default, the docs here are being updated to match the changes applied.

sgnn7 force-pushed the sgnn7/temporarly-disable-dsd-buffering branch from c802331 to a8a12cd Compare October 13, 2021 19:05

gh123man previously approved these changes Oct 13, 2021

View reviewed changes

[tests] Ensure that context maanger works in non-buffered envs

bbca063

With a non-buffered environment, our context manager may not work properly. This change ensures that we propely test this behavior.

sgnn7 dismissed gh123man’s stale review via bbca063 October 13, 2021 19:27

gh123man approved these changes Oct 13, 2021

View reviewed changes

truthbk approved these changes Oct 13, 2021

View reviewed changes

therve approved these changes Oct 14, 2021

View reviewed changes

therve merged commit 2cba9f6 into master Oct 14, 2021

therve deleted the sgnn7/temporarly-disable-dsd-buffering branch October 14, 2021 12:35

ewdurbin mentioned this pull request Jan 4, 2022

Quiet logs from datadog pypi/warehouse#10553

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[statsd] Disable statsd buffering by default #692

[statsd] Disable statsd buffering by default #692

sgnn7 commented Oct 13, 2021

sgnn7 commented Oct 13, 2021

azure-pipelines bot commented Oct 13, 2021

truthbk left a comment

truthbk Oct 13, 2021

sgnn7 Oct 13, 2021

therve commented Oct 14, 2021

azure-pipelines bot commented Oct 14, 2021

[statsd] Disable statsd buffering by default #692

[statsd] Disable statsd buffering by default #692

Conversation

sgnn7 commented Oct 13, 2021

What does this PR do?

Description of the Change

Alternate Designs

Possible Drawbacks

Verification Process

Additional Notes

Release Notes

Review checklist (to be filled by reviewers)

sgnn7 commented Oct 13, 2021

azure-pipelines bot commented Oct 13, 2021

truthbk left a comment

Choose a reason for hiding this comment

truthbk Oct 13, 2021

Choose a reason for hiding this comment

sgnn7 Oct 13, 2021

Choose a reason for hiding this comment

therve commented Oct 14, 2021

azure-pipelines bot commented Oct 14, 2021