Disable cleanup_timeout by default in docker and kubernetes autodiscover #24681

jsoriano · 2021-03-22T16:24:13Z

What does this PR do?

Disable cleanup_timeout by default in docker and kubernetes autodiscover for all beats except Filebeat.

It is kept to 60 seconds in Filebeat, to give a time to collect logs.

Why is it important?

Keeping configurations running for some time after containers have stopped is needed in some cases to complete the collection of logs. But in the rest of cases it is not usually needed, and leads to errors when querying endpoints known to be down.
It can also lead to query IPs that are being reused in newer containers, what can be misleading if the newer pod answers because these events will still have the metadata of the old container.

Checklist

My code follows the style guidelines of this project
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
I have made corresponding change to the default configuration files
I have added tests that prove my fix is effective or that my feature works
I have added an entry in CHANGELOG.next.asciidoc or CHANGELOG-developer.next.asciidoc.

How to test this PR locally

Start metricbeat and filebeat in Kubernetes with autodiscover and some configuration.
Create a pod in Kubernetes that matches the existing configurations.
Check that metricbeat and filebat start collecting metrics and logs.
Delete the pod.
Check that metricbeat stops collecting metrics.
Check that filebeat stops collecting metrics about 60 seconds later.

Related issues

Fixes Set a different autodiscover cleanup_timeout per beat #20543.

elasticmachine · 2021-03-22T16:24:20Z

Pinging @elastic/integrations (Team:Integrations)

elasticmachine · 2021-03-22T18:32:38Z

💚 Build Succeeded

the below badges are clickable and redirect to their specific view in the CI or DOCS

Expand to view the summary

Build stats

Build Cause: Pull request #24681 updated
Start Time: 2021-03-24T13:00:35.295+0000
Duration: 68 min 29 sec
Commit: 224623d

Test stats 🧪

Test	Results
Failed	0
Passed	46419
Skipped	5104
Total	51523

Trends 🧪

💚 Flaky test report

Tests succeeded.

Expand to view the summary

Test stats 🧪

Test	Results
Failed	0
Passed	46419
Skipped	5104
Total	51523

jsoriano · 2021-03-22T18:36:29Z

/test

jsoriano · 2021-03-23T10:24:00Z

/package

jsoriano · 2021-03-23T10:24:18Z

/test

ChrsMark

lgtm!

jsoriano · 2021-03-23T17:51:17Z

/test

jsoriano · 2021-03-24T09:16:35Z

/test

…eout

…ver (elastic#24681) It is kept to 60 seconds in Filebeat, to give a time to collect logs. Keeping configurations running for some time after containers have stopped is needed in some cases to complete the collection of logs. But in the rest of cases it is not usually needed, and leads to errors when querying endpoints known to be down. It can also lead to query IPs that are being reused in newer containers, what can be misleading if the newer pod answers because these events will still have the metadata of the old container. (cherry picked from commit 439b808)

…ver (#24681) (#24730) It is kept to 60 seconds in Filebeat, to give a time to collect logs. Keeping configurations running for some time after containers have stopped is needed in some cases to complete the collection of logs. But in the rest of cases it is not usually needed, and leads to errors when querying endpoints known to be down. It can also lead to query IPs that are being reused in newer containers, what can be misleading if the newer pod answers because these events will still have the metadata of the old container. (cherry picked from commit 439b808)

jsoriano added review needs_backport PR is waiting to be backported to other branches. Team:Integrations Label for the Integrations team breaking change test-plan Add this PR to be manual test plan v7.13.0 labels Mar 22, 2021

jsoriano self-assigned this Mar 22, 2021

botelastic bot added needs_team Indicates that the issue/PR needs a Team:* label and removed needs_team Indicates that the issue/PR needs a Team:* label labels Mar 22, 2021

Disable cleanup_timeout by default in docker and kubernetes autodiscover

f4ffe6f

jsoriano force-pushed the disable-cleanup-timeout branch from 4ae2235 to f4ffe6f Compare March 22, 2021 16:26

ChrsMark approved these changes Mar 23, 2021

View reviewed changes

Merge remote-tracking branch 'origin/master' into disable-cleanup-tim…

224623d

…eout

jsoriano merged commit 439b808 into elastic:master Mar 24, 2021

jsoriano deleted the disable-cleanup-timeout branch March 24, 2021 14:26

jsoriano mentioned this pull request Mar 24, 2021

Cherry-pick #24681 to 7.x: Disable cleanup_timeout by default in docker and kubernetes autodiscover #24730

Merged

6 tasks

jsoriano removed the needs_backport PR is waiting to be backported to other branches. label Mar 24, 2021

andresrc added the test-plan-added This PR has been added to the test plan label Apr 22, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Disable cleanup_timeout by default in docker and kubernetes autodiscover #24681

Disable cleanup_timeout by default in docker and kubernetes autodiscover #24681

jsoriano commented Mar 22, 2021

elasticmachine commented Mar 22, 2021

elasticmachine commented Mar 22, 2021 •

edited by jenkins-beats-ci bot

Loading

Build stats

Test stats 🧪

Trends 🧪

Test stats 🧪

jsoriano commented Mar 22, 2021

jsoriano commented Mar 23, 2021

jsoriano commented Mar 23, 2021

ChrsMark left a comment

jsoriano commented Mar 23, 2021

jsoriano commented Mar 24, 2021

Disable cleanup_timeout by default in docker and kubernetes autodiscover #24681

Disable cleanup_timeout by default in docker and kubernetes autodiscover #24681

Conversation

jsoriano commented Mar 22, 2021

What does this PR do?

Why is it important?

Checklist

How to test this PR locally

Related issues

elasticmachine commented Mar 22, 2021

elasticmachine commented Mar 22, 2021 • edited by jenkins-beats-ci bot Loading

💚 Build Succeeded

Build stats

Test stats 🧪

Trends 🧪

💚 Flaky test report

Test stats 🧪

jsoriano commented Mar 22, 2021

jsoriano commented Mar 23, 2021

jsoriano commented Mar 23, 2021

ChrsMark left a comment

Choose a reason for hiding this comment

jsoriano commented Mar 23, 2021

jsoriano commented Mar 24, 2021

elasticmachine commented Mar 22, 2021 •

edited by jenkins-beats-ci bot

Loading