Further improved availability recovery #3711

eskimor · 2021-08-24T17:22:56Z

Don't fetch more chunks than needed, except when we have a non zero error rate.
Properly track undead (soft cancelled) requests in order to make sure to always have enough "live" requests in flight.
Add metrics that should be useful

* master: Fix Try-Runtime (#3725) XCM v2: Scripting, Query responses, Exception handling and Error reporting (#3629) Bump async-trait from 0.1.50 to 0.1.51 (#3721) allow some overhead in MERKLE_NODE_MAX_SIZE (#3724)

node/primitives/src/lib.rs

Lldenaurois · 2021-08-26T15:23:48Z

node/network/availability-recovery/src/lib.rs

 		while let Some(request_result) =
-			self.requesting_chunks.next().timeout(MAX_CHUNK_WAIT).await.flatten()
+			self.requesting_chunks.next_with_timeout(TIMEOUT_START_NEW_REQUESTS).await


* master: Further improved availability recovery (#3711) node/service: Update finality target to fix disputes tests (#3732)

rphmeier · 2021-08-27T20:00:11Z

node/network/availability-recovery/src/lib.rs

+		// 2. We request more chunks to make up for it
+		// 3. Bandwidth is spread out even more, so we get even more timeouts
+		// 4. We request more chunks to make up for it ...
+		let max_requests_boundary = std::cmp::min(N_PARALLEL, threshold);


side note: we should revisit N_PARALLEL. Perhaps this is something the lower-level networking code should care about and not the higher-level.

rphmeier · 2021-08-27T20:02:00Z

node/network/availability-recovery/src/metrics.rs

+				registry,
+			)?,
+			time_chunk_request: prometheus::register(
+				prometheus::Histogram::with_opts(prometheus::HistogramOpts::new(


Are the default buckets definitely good enough for this?

Good point - I will check!

eskimor added 7 commits August 20, 2021 20:28

WiP.

df5294c

Merge branch 'master' into rk-availabilty-recovery-v2

a19363a

Things compile.

7cf2d23

cargo fmt

77f70a5

Passing tests + fix warnings.

49a44a5

Metrics for availability recovery.

f45424b

Basic test.

6fdf002

eskimor added A0-please_review Pull request needs code review. B0-silent Changes should not be mentioned in any release notes C1-low PR touches the given topic and has a low impact on builders. labels Aug 24, 2021

eskimor and others added 9 commits August 24, 2021 19:31

Fix typos and actually check for overflow.

3650380

Merge branch 'master' into rk-availabilty-recovery-v2

326467e

cargo fmt

36d5e46

Register metrics.

147e0b8

Merge branch 'master' into rk-availabilty-recovery-v2

0259770

More tests.

ee7f548

Fix warning.

c82673e

Merge branch 'master' into rk-availabilty-recovery-v2

699dd98

* master: Fix Try-Runtime (#3725) XCM v2: Scripting, Query responses, Exception handling and Error reporting (#3629) Bump async-trait from 0.1.50 to 0.1.51 (#3721) allow some overhead in MERKLE_NODE_MAX_SIZE (#3724)

cargo +nightly fmt

56c8c62

ordian reviewed Aug 26, 2021

View reviewed changes

node/primitives/src/lib.rs Outdated Show resolved Hide resolved

bkchr and others added 4 commits August 26, 2021 14:49

Fix metrics

5d8b0ad

Get rid of unsafe.

4545527

tabify

62f9158

spellcheck

39abaed

Lldenaurois approved these changes Aug 26, 2021

View reviewed changes

ordian approved these changes Aug 26, 2021

View reviewed changes

ordian added this to the v0.9.10 milestone Aug 27, 2021

ordian merged commit 8fab1d8 into master Aug 27, 2021

ordian deleted the rk-availabilty-recovery-v2 branch August 27, 2021 16:59

ordian added a commit that referenced this pull request Aug 27, 2021

Merge branch 'master' into bernhard-malus-fx

7449c86

* master: Further improved availability recovery (#3711) node/service: Update finality target to fix disputes tests (#3732)

rphmeier reviewed Aug 27, 2021

View reviewed changes

chevdor mentioned this pull request Sep 1, 2021

release v0.9.10 pr set1 #3763

Merged

rphmeier mentioned this pull request Sep 7, 2021

Flakey Test in Availability Recovery #3798

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Further improved availability recovery #3711

Further improved availability recovery #3711

eskimor commented Aug 24, 2021 •

edited

Loading

Lldenaurois Aug 26, 2021

rphmeier Aug 27, 2021 •

edited

Loading

rphmeier Aug 27, 2021

eskimor Aug 28, 2021

Further improved availability recovery #3711

Further improved availability recovery #3711

Conversation

eskimor commented Aug 24, 2021 • edited Loading

Lldenaurois Aug 26, 2021

Choose a reason for hiding this comment

rphmeier Aug 27, 2021 • edited Loading

Choose a reason for hiding this comment

rphmeier Aug 27, 2021

Choose a reason for hiding this comment

eskimor Aug 28, 2021

Choose a reason for hiding this comment

eskimor commented Aug 24, 2021 •

edited

Loading

rphmeier Aug 27, 2021 •

edited

Loading