Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add healing feature #1113

Merged
merged 9 commits into from
Nov 11, 2021

Conversation

glazychev-art
Copy link
Contributor

@glazychev-art glazychev-art commented Oct 21, 2021

Description

This PR adds new healing feature

Continuation of #1107

@glazychev-art glazychev-art force-pushed the reselect_heal branch 28 times, most recently from 3d72a00 to 9c2180e Compare October 28, 2021 13:14
@glazychev-art glazychev-art force-pushed the reselect_heal branch 2 times, most recently from 2415e07 to 1a7a430 Compare October 29, 2021 13:43
Signed-off-by: Artem Glazychev <artem.glazychev@xored.com>
@glazychev-art glazychev-art force-pushed the reselect_heal branch 2 times, most recently from 69652b3 to 47f40c6 Compare November 10, 2021 10:32
Copy link
Member

@denis-tingaikin denis-tingaikin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Look good, only one moment related to heal client.

Could you explain why we can not use heal client in sandbox tests?

pkg/tools/sandbox/node.go Outdated Show resolved Hide resolved
pkg/tools/sandbox/node.go Outdated Show resolved Hide resolved
Signed-off-by: Artem Glazychev <artem.glazychev@xored.com>
@glazychev-art
Copy link
Contributor Author

We can use it. The point was to keep heal tests separate. PR was updated

@denis-tingaikin
Copy link
Member

denis-tingaikin commented Nov 10, 2021

@edwarnicke with 16a15e5 we do not need to change grpcfd. Is 16a15e5 look good for you?

@edwarnicke
Copy link
Member

LGTM

I'm going to hold off on merging till integration-k8s-kind is passing from the P2MP work.

@denis-tingaikin denis-tingaikin merged commit 0f652be into networkservicemesh:main Nov 11, 2021
nsmbot pushed a commit to networkservicemesh/cmd-nsc-init that referenced this pull request Nov 11, 2021
…k@main

PR link: networkservicemesh/sdk#1113

Commit: 0f652be
Author: Artem Glazychev
Date: 2021-11-11 19:50:13 +0700
Message:
  - Add healing feature (#1113)
* Add begin.WithReselect option

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* begin: Merge new outside Requests with heal/refresh information

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Add trimpath chain elements

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* [qfix] panic protection in dialer chain element

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Very very rough first pass at heal.

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Not quite working restoration of the nsmgr heal tests

TestNSMGR_HealForwarder is still failing.  It is probably failing
because the interpose chain element does not reselect Forwarders
if one does not become available.

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* heal: fix TestNSMGR_HealForwarder and remove grpc.WaitTillReady() and grpc.WithBlock() from sandbox.DialOptions()

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Fixes failing tests

Signed-off-by: Artem Glazychev <artem.glazychev@xored.com>

* Add healClient option

Signed-off-by: Artem Glazychev <artem.glazychev@xored.com>
Signed-off-by: NSMBot <nsmbot@networkservicmesh.io>
nsmbot pushed a commit to networkservicemesh/cmd-nsmgr-proxy that referenced this pull request Nov 11, 2021
…k@main

PR link: networkservicemesh/sdk#1113

Commit: 0f652be
Author: Artem Glazychev
Date: 2021-11-11 19:50:13 +0700
Message:
  - Add healing feature (#1113)
* Add begin.WithReselect option

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* begin: Merge new outside Requests with heal/refresh information

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Add trimpath chain elements

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* [qfix] panic protection in dialer chain element

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Very very rough first pass at heal.

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Not quite working restoration of the nsmgr heal tests

TestNSMGR_HealForwarder is still failing.  It is probably failing
because the interpose chain element does not reselect Forwarders
if one does not become available.

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* heal: fix TestNSMGR_HealForwarder and remove grpc.WaitTillReady() and grpc.WithBlock() from sandbox.DialOptions()

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Fixes failing tests

Signed-off-by: Artem Glazychev <artem.glazychev@xored.com>

* Add healClient option

Signed-off-by: Artem Glazychev <artem.glazychev@xored.com>
Signed-off-by: NSMBot <nsmbot@networkservicmesh.io>
nsmbot pushed a commit to networkservicemesh/cmd-registry-memory that referenced this pull request Nov 11, 2021
…k@main

PR link: networkservicemesh/sdk#1113

Commit: 0f652be
Author: Artem Glazychev
Date: 2021-11-11 19:50:13 +0700
Message:
  - Add healing feature (#1113)
* Add begin.WithReselect option

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* begin: Merge new outside Requests with heal/refresh information

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Add trimpath chain elements

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* [qfix] panic protection in dialer chain element

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Very very rough first pass at heal.

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Not quite working restoration of the nsmgr heal tests

TestNSMGR_HealForwarder is still failing.  It is probably failing
because the interpose chain element does not reselect Forwarders
if one does not become available.

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* heal: fix TestNSMGR_HealForwarder and remove grpc.WaitTillReady() and grpc.WithBlock() from sandbox.DialOptions()

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Fixes failing tests

Signed-off-by: Artem Glazychev <artem.glazychev@xored.com>

* Add healClient option

Signed-off-by: Artem Glazychev <artem.glazychev@xored.com>
Signed-off-by: NSMBot <nsmbot@networkservicmesh.io>
nsmbot pushed a commit to networkservicemesh/cmd-nsmgr that referenced this pull request Nov 11, 2021
…k@main

PR link: networkservicemesh/sdk#1113

Commit: 0f652be
Author: Artem Glazychev
Date: 2021-11-11 19:50:13 +0700
Message:
  - Add healing feature (#1113)
* Add begin.WithReselect option

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* begin: Merge new outside Requests with heal/refresh information

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Add trimpath chain elements

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* [qfix] panic protection in dialer chain element

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Very very rough first pass at heal.

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Not quite working restoration of the nsmgr heal tests

TestNSMGR_HealForwarder is still failing.  It is probably failing
because the interpose chain element does not reselect Forwarders
if one does not become available.

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* heal: fix TestNSMGR_HealForwarder and remove grpc.WaitTillReady() and grpc.WithBlock() from sandbox.DialOptions()

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Fixes failing tests

Signed-off-by: Artem Glazychev <artem.glazychev@xored.com>

* Add healClient option

Signed-off-by: Artem Glazychev <artem.glazychev@xored.com>
Signed-off-by: NSMBot <nsmbot@networkservicmesh.io>
nsmbot pushed a commit to networkservicemesh/cmd-nse-vfio that referenced this pull request Nov 11, 2021
…k@main

PR link: networkservicemesh/sdk#1113

Commit: 0f652be
Author: Artem Glazychev
Date: 2021-11-11 19:50:13 +0700
Message:
  - Add healing feature (#1113)
* Add begin.WithReselect option

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* begin: Merge new outside Requests with heal/refresh information

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Add trimpath chain elements

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* [qfix] panic protection in dialer chain element

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Very very rough first pass at heal.

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Not quite working restoration of the nsmgr heal tests

TestNSMGR_HealForwarder is still failing.  It is probably failing
because the interpose chain element does not reselect Forwarders
if one does not become available.

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* heal: fix TestNSMGR_HealForwarder and remove grpc.WaitTillReady() and grpc.WithBlock() from sandbox.DialOptions()

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Fixes failing tests

Signed-off-by: Artem Glazychev <artem.glazychev@xored.com>

* Add healClient option

Signed-off-by: Artem Glazychev <artem.glazychev@xored.com>
Signed-off-by: NSMBot <nsmbot@networkservicmesh.io>
nsmbot pushed a commit to networkservicemesh/cmd-admission-webhook-k8s that referenced this pull request Nov 11, 2021
…k@main

PR link: networkservicemesh/sdk#1113

Commit: 0f652be
Author: Artem Glazychev
Date: 2021-11-11 19:50:13 +0700
Message:
  - Add healing feature (#1113)
* Add begin.WithReselect option

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* begin: Merge new outside Requests with heal/refresh information

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Add trimpath chain elements

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* [qfix] panic protection in dialer chain element

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Very very rough first pass at heal.

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Not quite working restoration of the nsmgr heal tests

TestNSMGR_HealForwarder is still failing.  It is probably failing
because the interpose chain element does not reselect Forwarders
if one does not become available.

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* heal: fix TestNSMGR_HealForwarder and remove grpc.WaitTillReady() and grpc.WithBlock() from sandbox.DialOptions()

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Fixes failing tests

Signed-off-by: Artem Glazychev <artem.glazychev@xored.com>

* Add healClient option

Signed-off-by: Artem Glazychev <artem.glazychev@xored.com>
Signed-off-by: NSMBot <nsmbot@networkservicmesh.io>
nsmbot pushed a commit to networkservicemesh/sdk-k8s that referenced this pull request Nov 11, 2021
…k@main

PR link: networkservicemesh/sdk#1113

Commit: 0f652be
Author: Artem Glazychev
Date: 2021-11-11 19:50:13 +0700
Message:
  - Add healing feature (#1113)
* Add begin.WithReselect option

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* begin: Merge new outside Requests with heal/refresh information

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Add trimpath chain elements

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* [qfix] panic protection in dialer chain element

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Very very rough first pass at heal.

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Not quite working restoration of the nsmgr heal tests

TestNSMGR_HealForwarder is still failing.  It is probably failing
because the interpose chain element does not reselect Forwarders
if one does not become available.

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* heal: fix TestNSMGR_HealForwarder and remove grpc.WaitTillReady() and grpc.WithBlock() from sandbox.DialOptions()

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Fixes failing tests

Signed-off-by: Artem Glazychev <artem.glazychev@xored.com>

* Add healClient option

Signed-off-by: Artem Glazychev <artem.glazychev@xored.com>
Signed-off-by: NSMBot <nsmbot@networkservicmesh.io>
nsmbot pushed a commit to networkservicemesh/cmd-registry-proxy-dns that referenced this pull request Nov 11, 2021
…k@main

PR link: networkservicemesh/sdk#1113

Commit: 0f652be
Author: Artem Glazychev
Date: 2021-11-11 19:50:13 +0700
Message:
  - Add healing feature (#1113)
* Add begin.WithReselect option

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* begin: Merge new outside Requests with heal/refresh information

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Add trimpath chain elements

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* [qfix] panic protection in dialer chain element

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Very very rough first pass at heal.

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Not quite working restoration of the nsmgr heal tests

TestNSMGR_HealForwarder is still failing.  It is probably failing
because the interpose chain element does not reselect Forwarders
if one does not become available.

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* heal: fix TestNSMGR_HealForwarder and remove grpc.WaitTillReady() and grpc.WithBlock() from sandbox.DialOptions()

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Fixes failing tests

Signed-off-by: Artem Glazychev <artem.glazychev@xored.com>

* Add healClient option

Signed-off-by: Artem Glazychev <artem.glazychev@xored.com>
Signed-off-by: NSMBot <nsmbot@networkservicmesh.io>
nsmbot pushed a commit to networkservicemesh/sdk-kernel that referenced this pull request Nov 11, 2021
…k@main

PR link: networkservicemesh/sdk#1113

Commit: 0f652be
Author: Artem Glazychev
Date: 2021-11-11 19:50:13 +0700
Message:
  - Add healing feature (#1113)
* Add begin.WithReselect option

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* begin: Merge new outside Requests with heal/refresh information

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Add trimpath chain elements

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* [qfix] panic protection in dialer chain element

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Very very rough first pass at heal.

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Not quite working restoration of the nsmgr heal tests

TestNSMGR_HealForwarder is still failing.  It is probably failing
because the interpose chain element does not reselect Forwarders
if one does not become available.

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* heal: fix TestNSMGR_HealForwarder and remove grpc.WaitTillReady() and grpc.WithBlock() from sandbox.DialOptions()

Signed-off-by: Ed Warnicke <hagbard@gmail.com>

* Fixes failing tests

Signed-off-by: Artem Glazychev <artem.glazychev@xored.com>

* Add healClient option

Signed-off-by: Artem Glazychev <artem.glazychev@xored.com>
Signed-off-by: NSMBot <nsmbot@networkservicmesh.io>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants