Changing the default `post-parallel-reads` value #9074

rjan90 · 2022-07-22T08:26:26Z

donkabat · 2022-09-14T05:44:41Z

I checked few values:
128 (default): 10% bad
50: 2% bad
30: 0 bad

rjan90 · 2022-09-14T06:52:19Z

Thanks for adding your input here @donkabat!

@benjaminh83 which values where you using for the post-parallel-reads again?

@magik6k Not sure if we should wait for more datapoints around this, or make some changes based on the values/tests we already have?

benjaminh83 · 2022-09-14T09:19:14Z

@rjan90 I was running it at 32: 0 bad
This looks like what @donkabat was experiencing. I cannot reproduce it now, as it was only able to fail sector reads when I had a degraded cluster storage attached that did perform very poorly under stress. This was not reproducible when using storage workers.
Lastly, reducing the parallel read to 32 did not really have much impact on the overall wdPoST times, so maybe a better setting for reduced risk of saturating the network/storage resulting in the timeout.

donkabat · 2022-09-14T10:13:25Z

About my case:
I moved from NFS to only-storage workers. wdPoSt times reduced from ~15-20min to 6-10min.
After I set post-parallel-reads 30 wdPoSt takes ~10min

donkabat · 2022-09-15T08:35:42Z

After yesterday's wd post, sectors:
total 25.755
faults: 12
post-parallel-reads = 30

example of error:
CheckProvable Sector FAULT: generating vanilla proof {"sector": {"ID":{"Miner":1127678,"Number":28953},"ProofType":8}, "err": "do request: Post \"http://192.168.88.16:1168/remote/vanilla/single\": context deadline exceeded", "errVerbose": "do request:\n github.com/filecoin-project/lotus/storage/paths.(*Remote).GenerateSingleVanillaProof\n /home/filecoin/networks/mainnet/build/lotus/storage/paths/remote.go:819\n - Post \"http://192.168.88.16:1168/remote/vanilla/single\": context deadline exceeded"}

donkabat · 2022-09-15T17:53:24Z

post-parallel-reads = 25

total 25.755
faults: 0

rjan90 added need/triage kind/enhancement Kind: Enhancement area/proving Area: Proving and removed need/triage labels Jul 22, 2022

rjan90 mentioned this issue Feb 24, 2023

Lotus-Miner Backlog Sprint #10338

Closed

rjan90 added this to the v1.21.0 milestone Feb 28, 2023

rjan90 mentioned this issue Feb 28, 2023

fix: post: Tune down default post-parallel-reads #10365

Merged

7 tasks

magik6k closed this as completed in #10365 Feb 28, 2023

rjan90 moved this to ✅ Done in Lotus-Miner-V2 Mar 2, 2023

rjan90 added this to Lotus-Miner-V2 Mar 2, 2023

rjan90 modified the milestones: v1.21.0, Lotus-Miner Backlog Sprint Mar 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Changing the default `post-parallel-reads` value #9074

Changing the default `post-parallel-reads` value #9074

rjan90 commented Jul 22, 2022 •

edited

Loading

donkabat commented Sep 14, 2022

rjan90 commented Sep 14, 2022

benjaminh83 commented Sep 14, 2022 •

edited

Loading

donkabat commented Sep 14, 2022

donkabat commented Sep 15, 2022

donkabat commented Sep 15, 2022

Changing the default post-parallel-reads value #9074

Changing the default post-parallel-reads value #9074

Comments

rjan90 commented Jul 22, 2022 • edited Loading

Checklist

Lotus component

Improvement Suggestion

donkabat commented Sep 14, 2022

rjan90 commented Sep 14, 2022

benjaminh83 commented Sep 14, 2022 • edited Loading

donkabat commented Sep 14, 2022

donkabat commented Sep 15, 2022

donkabat commented Sep 15, 2022

Changing the default `post-parallel-reads` value #9074

Changing the default `post-parallel-reads` value #9074

rjan90 commented Jul 22, 2022 •

edited

Loading

benjaminh83 commented Sep 14, 2022 •

edited

Loading