Add telemetry job #1448

pleshakov · 2024-01-05T18:32:15Z

Proposed changes

Problem:

We want to have a telemetry job that periodically reports product telemetry every 24h. For now, telemetry data is empty and report is sent to the debug log.

Solution:

Refactor leader election to use controller-runtime manager capabilities. This simplifies the existing code and make it easier to add a telemetry Job.
Add a telemetry Job that periodically reports empty telemetry to the debug log.
Make the period configurable at build time via TELEMETRY_REPORT_PERIOD Makefile variable.

Note: leader elector refactoring changes behavior of NGF process when leadership gets lost:
Before: the Manager would shutdown waiting for the runnables to exit. After: the Manager doesn't wait. It similar to NGF process panicing. This should be OK, as NGF container will restart and recover any potentially broken state (update not fully populated statuses, restore correct NGINX configuration).

Testing:

Unit tests
Manual testing:
- Ensure leader election works as expected - both leader and non-pods run successfully.
- Ensure NGF container exits when stop being leader.
- Ensure an upgrade from Release 1.1.0 is successful for leader election - the leader gets elected among the new pods.
- Ensure the telemetry Job reports telemetry multiple times, using a small value of ELEMETRY_REPORT_PERIOD

CLOSES #1382

More notes:

disabling telemetry will be covered in Mechanism to Opt Out of all Telemetry #1317
documentation in Telemetry Documentation #1319

Checklist

Before creating a PR, run through this checklist and mark each as complete.

I have read the CONTRIBUTING doc
I have added tests that prove my fix is effective or that my feature works
I have checked that all unit tests pass after adding my changes
I have updated necessary documentation
I have rebased my branch onto main
I will ensure my PR is targeting the main branch and pulling from my branch from my own fork

internal/mode/static/manager.go

internal/mode/static/telemetry/job.go

.goreleaser.yml

kate-osborn

Just a couple of questions, but it looks good to me!

internal/framework/runnables/runnables_test.go

internal/mode/static/manager.go

internal/mode/static/telemetry/job_test.go

Problem: We want to have a telemetry job that periodically reports product telemetry every 24h. For now, telemetry data is empty and report is sent to the debug log. Solution: - Refactor leader election to use controller-runtime manager capabilities. This simplifies the existing code and make it easier to add a telemetry Job. - Add a telemetry Job that periodically reports empty telemetry to the debug log. - Make the period configurable at build time via TELEMETRY_REPORT_PERIOD Makefile variable. Note: leader elector refactoring changes behavior of NGF process when leadership gets lost: Before: the Manager would shutdown waiting for the runnables to exit. After: the Manager doesn't wait. It similar to NGF process panicing. This should be OK, as NGF container will restart and recover any potentially broken state (update not fully populated statuses, restore correct NGINX configuration). Testing: - Unit tests - Manual testing: - Ensure leader election works as expected - both leader and non-pods run successfully. - Ensure NGF container exits when stop being leader. - Ensure an upgrade from Release 1.1.0 is successful for leader election - the leader gets elected among the new pods. - Ensure the telemetry Job reports telemetry multiple times, using a small value of ELEMETRY_REPORT_PERIOD CLOSES nginxinc#1382

Co-authored-by: Saylor Berman <s.berman@f5.com>

pleshakov requested a review from a team as a code owner January 5, 2024 18:32

github-actions bot added the enhancement New feature or request label Jan 5, 2024

pleshakov mentioned this pull request Jan 5, 2024

Add telemetry job - option 2 #1392

Closed

6 tasks

sjberman reviewed Jan 5, 2024

View reviewed changes

internal/mode/static/manager.go Outdated Show resolved Hide resolved

internal/mode/static/manager.go Outdated Show resolved Hide resolved

internal/mode/static/telemetry/job.go Show resolved Hide resolved

pleshakov requested a review from lucacome January 8, 2024 16:19

pleshakov commented Jan 8, 2024

View reviewed changes

.goreleaser.yml Show resolved Hide resolved

kate-osborn reviewed Jan 8, 2024

View reviewed changes

internal/framework/runnables/runnables_test.go Show resolved Hide resolved

internal/mode/static/manager.go Outdated Show resolved Hide resolved

internal/mode/static/telemetry/job_test.go Show resolved Hide resolved

pleshakov requested review from sjberman and kate-osborn January 8, 2024 22:57

sjberman approved these changes Jan 9, 2024

View reviewed changes

bjee19 reviewed Jan 9, 2024

View reviewed changes

internal/mode/static/telemetry/job_test.go Outdated Show resolved Hide resolved

pleshakov and others added 10 commits January 10, 2024 10:36

Update internal/mode/static/manager.go

744b402

Co-authored-by: Saylor Berman <s.berman@f5.com>

Updated jitter

d226662

Update error message

f05598a

Update goreleaser settings

3050949

Fix YAML linting

bb12043

Simplify test for Start of Job

57b2576

Changed LeaderElectionReleaseOnCancel to false and added a comment

be8b4e0

Add an assertion in TestEnableAfterBecameLeader

1a27c77

Update minReports and extend the comment

06832aa

pleshakov force-pushed the feature/telemetry-job branch from 5c5fcf1 to 06832aa Compare January 10, 2024 15:37

pleshakov requested a review from bjee19 January 10, 2024 15:38

bjee19 approved these changes Jan 10, 2024

View reviewed changes

kate-osborn approved these changes Jan 10, 2024

View reviewed changes

Merge branch 'main' into feature/telemetry-job

b7a40aa

pleshakov merged commit 9d9c1f2 into nginxinc:main Jan 10, 2024
27 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add telemetry job #1448

Add telemetry job #1448

pleshakov commented Jan 5, 2024 •

edited

Loading

kate-osborn left a comment

Add telemetry job #1448

Add telemetry job #1448

Conversation

pleshakov commented Jan 5, 2024 • edited Loading

Proposed changes

Checklist

kate-osborn left a comment

Choose a reason for hiding this comment

pleshakov commented Jan 5, 2024 •

edited

Loading