Consider observed latencies in weighing #65

aarshkshah1992 · 2023-03-07T07:47:07Z

Give nodes a weight boost if they're in the 80th <-> 99th percentile of the fastest(download speed) nodes.
However, only do so once we have enough speed observations across many different nodes and a minimum threshold on the number of speed observations.
Also introduces a cool off period on weight bump ups for speed to reward only those nodes who show consistency in their speed.

willscott

What i'd like at some point is a higher level sense that over a lot of requests we see reasonable steady-state behavior.

so, e.g.

set up backends that work 90% of the time.
send a bunch of requests while pushing forward time (or having a small de-bounce)
make sure there's a healthy pool at the end

aarshkshah1992 · 2023-03-08T05:55:46Z

@willscott

We already have the Bifrost staging environment we deploy to and collect pool heatlh metrics on and now we have L1 load distribution metrics on the Saturn side too. I am going to ask Lidel to deploy this PR and see how those metrics shape up over a day.

Or do you imagine a more automated setup here where we setup our own actual L1 backends for testing Caboose ?

willscott · 2023-03-08T07:06:58Z

Or do you imagine a more automated setup here where we setup our own actual L1 backends for testing Caboose ?

Happy to have synthetic L1s.
Something to have a sense of what the stable dynamics of changes will be that's lighter weight than deploying to real traffic. we've had to role back a couple times because that's the only way we have to test right now.

aarshkshah1992 · 2023-03-08T07:16:06Z

@willscott Saturn does have a testnet with a few L1s. I'll sync up with the Saturn team and write a load testing tool or a script or something to test the Caboose <-> Saturn flow without having to deploy to prod. This makes sense to me.

willscott · 2023-03-08T07:17:40Z

if it's something we can run ourselves or have run against PRs in CI that would let us experiment much more than needing to wait on an external team's schedule for deployment

filecoin-saturn/caboose#65

lidel · 2023-03-08T18:11:16Z

Deployed to bifrost-stage1-ny

2023/03/08 17:50:16 Starting bifrost-gateway 2023-03-08-7d6ef21

Looks good:

…rn/caboose into feat/latency-based-weights

filecoin-saturn/caboose#65 (commits) filecoin-saturn/caboose@14f87d8

lidel · 2023-03-09T19:02:27Z

Deployed cd9c1d8 in ipfs-inactive/bifrost-gateway@9eac2e7 to staging:

root@bifrost-stage1-ny:~# docker logs -f bifrost-gw
2023/03/09 18:58:03 Starting bifrost-gateway 2023-03-09-9eac2e7

lidel

(I only reviewed metrics code, and deployed to staging, seems to work ok)

willscott · 2023-04-03T14:29:23Z

@aarshkshah1992 is this still relevant or has it been superseded by subsequent changes?

aarshkshah1992 · 2023-04-04T13:01:34Z

@willscott I think we can close this as we have the shiny new L1 server timings now and can use that for weighing.

aarshkshah1992 added 3 commits March 7, 2023 11:46

observed latencies in weighing

1c76820

write tests

84b6ad4

cool off weight boost and tests

70a1851

aarshkshah1992 changed the title ~~[WIP] Consider observed latencies in weighing~~ Consider observed latencies in weighing Mar 7, 2023

aarshkshah1992 requested a review from willscott March 7, 2023 09:03

willscott reviewed Mar 7, 2023

View reviewed changes

aarshkshah1992 added 2 commits March 8, 2023 09:56

increase duration

dab086d

fix flaky tests

ddb6e87

fix test

9f4a1d3

aarshkshah1992 mentioned this pull request Mar 8, 2023

flaky TestCabooseFailures #47

Closed

aarshkshah1992 and others added 4 commits March 8, 2023 16:02

integrationg testing

691cb32

fix static check

ad83d55

fix static check

41a746b

Merge branch 'main' into feat/latency-based-weights

e460b3f

lidel added a commit to ipfs-inactive/bifrost-gateway that referenced this pull request Mar 8, 2023

chore: caboose/pull/25

7d6ef21

filecoin-saturn/caboose#65

aarshkshah1992 added 3 commits March 9, 2023 09:04

fix itest

14f87d8

Merge branch 'feat/latency-based-weights' of github.com:filecoin-satu…

5b6dd0e

…rn/caboose into feat/latency-based-weights

more logging in fetcher

cd9c1d8

lidel added a commit to ipfs-inactive/bifrost-gateway that referenced this pull request Mar 9, 2023

fix(caboose): TTFB per block per peer metric

9eac2e7

filecoin-saturn/caboose#65 (commits) filecoin-saturn/caboose@14f87d8

willscott approved these changes Mar 14, 2023

View reviewed changes

lidel approved these changes Mar 15, 2023

View reviewed changes

aarshkshah1992 closed this Apr 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider observed latencies in weighing #65

Consider observed latencies in weighing #65

aarshkshah1992 commented Mar 7, 2023 •

edited

Loading

willscott left a comment

aarshkshah1992 commented Mar 8, 2023 •

edited

Loading

willscott commented Mar 8, 2023

aarshkshah1992 commented Mar 8, 2023

willscott commented Mar 8, 2023

lidel commented Mar 8, 2023 •

edited

Loading

lidel commented Mar 9, 2023

lidel left a comment

willscott commented Apr 3, 2023

aarshkshah1992 commented Apr 4, 2023

Consider observed latencies in weighing #65

Consider observed latencies in weighing #65

Conversation

aarshkshah1992 commented Mar 7, 2023 • edited Loading

willscott left a comment

Choose a reason for hiding this comment

aarshkshah1992 commented Mar 8, 2023 • edited Loading

willscott commented Mar 8, 2023

aarshkshah1992 commented Mar 8, 2023

willscott commented Mar 8, 2023

lidel commented Mar 8, 2023 • edited Loading

lidel commented Mar 9, 2023

lidel left a comment

Choose a reason for hiding this comment

willscott commented Apr 3, 2023

aarshkshah1992 commented Apr 4, 2023

aarshkshah1992 commented Mar 7, 2023 •

edited

Loading

aarshkshah1992 commented Mar 8, 2023 •

edited

Loading

lidel commented Mar 8, 2023 •

edited

Loading