fix(discovery): plugin registration bugfixes #1650

andrewazores · 2023-09-01T18:57:59Z

Welcome to Cryostat! 👋

Before contributing, make sure you have:

Read the contributing guidelines
Linked a relevant issue which this PR resolves
Linked any other relevant issues, PR's, or documentation, if any
Resolved all conflicts, if any
Rebased your branch PR on top of the latest upstream main branch
Attached at least one of the following labels to the PR: [chore, ci, docs, feat, fix, test]
Signed all commits using a GPG signature

To recreate commits with GPG signature git fetch upstream && git rebase --force --gpg-sign upstream/main

Fixes: https://github.com/cryostatio/cryostat/issues/1633
See also cryostatio/cryostat-agent#193
Based on #1636

Description of the change:

Improves error handling and cleanup when plugin registrations fail and are associated with stored credentials.

Motivation for the change:

This along with cryostatio/cryostat-agent#193 increases the resiliency of the server/agent registration system so that temporary networking failures or registration conflicts or other bugs are less likely to leave the server and agent both in a state where neither recognizes the other and yet neither is able to clean up and reset the registration status.

How to manually test:

Run CRYOSTAT_IMAGE=quay.io... sh smoketest.sh...
Check that agent instances properly register & publish, and are visible in the Topology view
podman kill some agent instances to prevent clean shutdown
podman run (reference smoketest.sh for exact invocation) to restart some of the killed agent instances to spin them back up
stop the smoketest, then restart it without clearing databases, and ensure that the agent instances are able to re-register. This may take a couple of minutes for the server to recognize that the old state is stale, clear it, and allow agents to register again.

github-actions · 2023-09-01T18:58:12Z

Hi @andrewazores! Add at least one of the required labels to this PR

Required labels are : chore,ci,cleanup,docs,feat,fix,perf,refactor,style,test

github-actions · 2023-09-01T18:58:44Z

Hi @andrewazores! Add at least one of the required labels to this PR

Required labels are : chore,ci,cleanup,docs,feat,fix,perf,refactor,style,test

andrewazores · 2023-09-05T18:37:12Z

/request_review

github-actions · 2023-09-07T00:42:53Z

This PR/issue depends on:

~~cryostatio/cryostat#1636~~
By Dependent Issues (🤖). Happy coding!

andrewazores · 2023-09-07T13:01:17Z

/build_test

github-actions · 2023-09-07T13:24:43Z

ARCH	IMAGE
amd64	ghcr.io/cryostatio/cryostat:pr-1650-b15566908b022e3cd352af56e64550f2f3b748a6-linux-amd64
arm64	ghcr.io/cryostatio/cryostat:pr-1650-b15566908b022e3cd352af56e64550f2f3b748a6-linux-arm64

To run smoketest:

# amd64          
CRYOSTAT_IMAGE=ghcr.io/cryostatio/cryostat:pr-1650-b15566908b022e3cd352af56e64550f2f3b748a6-linux-amd64 sh smoketest.sh

# or arm64
CRYOSTAT_IMAGE=ghcr.io/cryostatio/cryostat:pr-1650-b15566908b022e3cd352af56e64550f2f3b748a6-linux-arm64 sh smoketest.sh

src/main/java/io/cryostat/net/NetworkModule.java

smoketest.sh

tthvo

Comments for testing Mergify config :))

aali309 · 2023-09-07T18:50:52Z

Comments for testing Mergify config :))

Looks like this worked as expected here.

…egistration/stale prune

… on deregistration/stale prune

* fix(discovery): delete plugin stored credentials automatically on deregistration/stale prune * delete any stored credentials on plugin callback ping failure * bump minimum event loop pool size * use more specific response code for duplicate matchexpression

github-actions bot added the needs-triage Needs thorough attention from code reviewers label Sep 1, 2023

mergify bot added the safe-to-test label Sep 1, 2023

andrewazores mentioned this pull request Sep 1, 2023

fix(registration): discovery plugin registration bugfixes and refactor cryostatio/cryostat-agent#193

Merged

andrewazores added fix and removed needs-triage Needs thorough attention from code reviewers labels Sep 1, 2023

github-actions bot added the dependent label Sep 1, 2023

andrewazores force-pushed the gh1633 branch from 1e92f76 to 02218be Compare September 5, 2023 14:31

andrewazores marked this pull request as ready for review September 5, 2023 18:08

andrewazores requested review from aali309 and mwangggg September 5, 2023 18:08

github-actions bot added review-requested and removed dependent labels Sep 5, 2023

andrewazores force-pushed the gh1633 branch from 02218be to b155669 Compare September 7, 2023 13:01

andrewazores force-pushed the gh1633 branch from b155669 to 570fd2f Compare September 7, 2023 17:57

mergify bot requested a review from a team September 7, 2023 18:11

andrewazores force-pushed the gh1633 branch from 78cd776 to e3064fc Compare September 7, 2023 18:11

tthvo reviewed Sep 7, 2023

View reviewed changes

src/main/java/io/cryostat/net/NetworkModule.java Show resolved Hide resolved

tthvo reviewed Sep 7, 2023

View reviewed changes

smoketest.sh Outdated Show resolved Hide resolved

tthvo reviewed Sep 7, 2023

View reviewed changes

mergify bot removed the review-requested label Sep 7, 2023

andrewazores mentioned this pull request Sep 7, 2023

[Bug] Application startup tasks can block the Vertx event loop #1661

Open

andrewazores force-pushed the gh1633 branch from cbb0d6f to 8d4fb70 Compare September 8, 2023 21:24

mwangggg approved these changes Sep 12, 2023

View reviewed changes

andrewazores added 10 commits September 13, 2023 09:51

chore(discovery): refactor plugin registration to extract method

85f31c0

fix(discovery): delete plugin stored credentials automatically on der…

68a29b8

…egistration/stale prune

fixup! fix(discovery): delete plugin stored credentials automatically…

e6b02ea

… on deregistration/stale prune

delete any stored credentials on plugin callback ping failure

a672372

vertx-fib-demos don't dual-discover with duplicate connection URLs

42c2199

give quarkus-test-agent instances distinct names

7bbb0c2

bump minimum event loop pool size

891fc38

update podman discovery hostname

c6d3c8f

fixup! update podman discovery hostname

d7f5201

use more specific response code for duplicate matchexpression

e6d8f5e

andrewazores force-pushed the gh1633 branch from 8d4fb70 to e6d8f5e Compare September 13, 2023 13:51

andrewazores merged commit 0197f7b into cryostatio:main Sep 13, 2023
8 checks passed

andrewazores deleted the gh1633 branch September 13, 2023 13:53

andrewazores mentioned this pull request Nov 23, 2023

[Task] Update discovery plugin registration scheme to match 2.4 cryostatio/cryostat#189

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(discovery): plugin registration bugfixes #1650

fix(discovery): plugin registration bugfixes #1650

andrewazores commented Sep 1, 2023 •

edited

Loading

github-actions bot commented Sep 1, 2023

github-actions bot commented Sep 1, 2023

andrewazores commented Sep 5, 2023

github-actions bot commented Sep 7, 2023

andrewazores commented Sep 7, 2023

github-actions bot commented Sep 7, 2023

tthvo left a comment

aali309 commented Sep 7, 2023 •

edited

Loading

fix(discovery): plugin registration bugfixes #1650

fix(discovery): plugin registration bugfixes #1650

Conversation

andrewazores commented Sep 1, 2023 • edited Loading

Welcome to Cryostat! 👋

Before contributing, make sure you have:

Description of the change:

Motivation for the change:

How to manually test:

github-actions bot commented Sep 1, 2023

github-actions bot commented Sep 1, 2023

andrewazores commented Sep 5, 2023

github-actions bot commented Sep 7, 2023

andrewazores commented Sep 7, 2023

github-actions bot commented Sep 7, 2023

tthvo left a comment

Choose a reason for hiding this comment

aali309 commented Sep 7, 2023 • edited Loading

andrewazores commented Sep 1, 2023 •

edited

Loading

aali309 commented Sep 7, 2023 •

edited

Loading