Long-duration protocol exercise in simulation #125

anorth · 2024-03-15T00:10:21Z

Before production use, we should extensively exercise the protocol for long durations under different simulated chain and network conditions. This is an umbrella issue motivating a number of specific items to improve.

The simulator was mostly constructed to allow hand-crafting of specific situations to exercise protocol and coordination boundaries, mostly scoped to a single protocol instance. This is good, but limited to specific scenarios that we can think of. We should complement this with stochastic exercise over many instances under different macro conditions. The existing simulator can probably be used as a base, but the EC chain views and network need more complexity.

Some things to consider:

More complexity in the EC chain, different view of head, different propagation of EC information to nodes, long-running forks etc (see Support divergent EC chain views in simulator #115)
Network instability, periodic network halt, dropped messages
Dynamic participation, some nodes refusing to participate
Realistic network sizes of thousands of participants
Highly uneven or more even power distribution

Some such exercises will be too slow to run in CI, but we should set up the capability to run multi-hour simulations on demand.

These exercises should be accompanied by some more metrics so we can measure things like the total network traffic, signature verification rate, count of instances that make progress vs basechain, etc (this might warrant a separate issue).

BigLep · 2025-01-23T02:18:48Z

@masih : I have added this to MX: Priority and sequencing TBD since I'm assuming this isn't needed anytime between now and shortly after f3 activation. Let me know if that is wrong.

anorth added the testing Related to testing and validation label Mar 15, 2024

rjan90 added this to FilOz Mar 15, 2024

anorth mentioned this issue Apr 3, 2024

Use delayed power table to validate or drop messages from future instances #151

Closed

masih self-assigned this Apr 29, 2024

jennijuju added this to F3 May 15, 2024

masih mentioned this issue May 16, 2024

Comprehensive Simulation Testing #226

Closed

jennijuju removed this from FilOz May 16, 2024

Kubuxu added this to the Milestone 1: Passive Testing Completed milestone May 16, 2024

masih mentioned this issue May 17, 2024

Epic: F3 Testing & Network Testing #249

Closed

anorth mentioned this issue May 31, 2024

Test power-table lookback and message queue means no rebroadcast needed when participants lag #295

Open

masih modified the milestones: Milestone 1: Passive Testing Readiness, Milestone 2: Harderning and Mainnet Deployment Readiness Aug 7, 2024

BigLep moved this to Todo in F3 Aug 28, 2024

Stebalien removed this from the Milestone 2: Harderning and Mainnet Deployment Readiness milestone Aug 30, 2024

BigLep added this to the MX: Priority and sequencing TBD milestone Jan 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Long-duration protocol exercise in simulation #125

Long-duration protocol exercise in simulation #125

anorth commented Mar 15, 2024

BigLep commented Jan 23, 2025

Long-duration protocol exercise in simulation #125

Long-duration protocol exercise in simulation #125

Comments

anorth commented Mar 15, 2024

BigLep commented Jan 23, 2025