feat(slashing): lazily slash reluctant block proposers #2550

guidiaz · 2024-12-09T11:51:43Z

drcpu-github

I guess the idea of this patch is that by increasing the replication factor, we'll eventually find a validator willing to propose a block and at that point we can enforce power slashing?

drcpu-github · 2024-12-09T18:55:29Z

data_structures/src/chain/mod.rs

-            4
+            // linearly increasing replication factor
+            return if epoch >= prev_epoch {
+                u16::try_from(12 * (epoch - prev_epoch)).unwrap_or(u16::MAX)


A default replication factor of 12 is kind of high, no?

The higher the growing factor, the less probable that a big enough group of malicious stake entries can collude to temporarily censor the chain. For taking over up to 10 consecutive blocks, an attacker would need to keep at least 120 stake entries placed on the top ranking, simultaneously and during at least 10 epochs. Depending on the actual census, and whether multiple stake entries per validator address is enabled or not, this number could either be considered relatively high, or most probably (in the long run, if we expect census to grow in time), relatively low.

Nevertheless, the replication factor is proposed to be capped to some proportion of the total census. Two thirds has been initially proposed under this rationale: "if more than two thirds of census decide to censor the chain, the chain is to be censored".

In terms of P2P network traffic, one could argue that up to 2/3rds of eligible block proposers may potentially flood the network with competing block proposals, but not really:

Nodes should only propagate received block candidates only if better than the best validated so far, and also if signed by a different validator.

Eligible validators will only be able to propose one single block, no matter how many stake entries they have within the top ranking.

I'm okay with the capping, but was thinking that it would be better to follow the same power law as with data request eligibility: 4 * 2 ** (epoch - prev_epoch - 1).

epoch power linear

1 4 12

2 8 24

3 16 36

4 32 48

5 64 60

6 128 72

7 256 84

8 512 96

9 1024 108

10 2048 120

In the end it's unlikely to really matter though, both for the ratio of blocks mined and actual liveness.

We did consider the idea of implementing an exponentially increasing replication factor for mining, but discarded it for two main reasons:

Exponential increase opens the door to pass too quickly from a situation where all candidates in epoch N refrain from propsoing blocks, to a situation in epoch N+1 where half of eligible validators are prone to propose.

Being realistic, and as long as we manage wit/2.0 to not foster "sock puppets" behaviours, we can expect the census to be in the order of hundreds or thousands.

I'm not fully bullish on going linear instead of exponential, though. We just don't have enough info and cannot forecast the future. If we were afraid of DoS attacks being kind of probable, even those lasting more than 10 epochs, yeah, going exponential could be a thing. Then again, I don't believe these attacks will be that easy, and tend to think that going exponential may introduce more instability than the one we pretend to avoid.

Btw, an improved witness eligibility and slashing mechanisms are about to be proposed. Spoiler alert: "witnesses should be randomly selected out from the first ranking tertile of the mining census".

Yeah, the idea behind an exponential approach is obviously start smaller, ramp up quicker so we do not have 10 epochs of missing blocks due to an attack. I doubt we'll need this in any realistic scenario though.

Any attack longer than an epoch or two will be quite costly or even straight up impossible (taking into account the total circulating supply). If we assume a base factor of 12, you need to control the top 12 most powerful stake entries, which means you'd likely need to control 120M WIT as a start, 240M the epoch after it etc. The assumption here of course is that there will be honest actors with the maximum stake of 10M (but it would be quite a pity if that is not true).

That's my line of thinking too. When assessing the right replication factor RF, we need to take into account how credibly can an attacker take over the top RF positions in the rank. Under consideration of the economic criteria presented by @drcpu-github above, and based on what we've seen in the 4 years of mainnet, it is reasonable to think that 12 may be a bit too much for a start.

On the other hand, what's the downside of a high-ish initial RF value? Bloating the network with redundant messages doesn't sound specially of concern here, under the assumption that block candidates are still subject to comparison and filtering as they travel through the network.

As per the power vs. linear progression, I favor power because it has worked pretty decently so far, but don't oppose to linear if it is found to be more suitable for PoS.

When assessing the right replication factor RF, we need to take into account how credibly can an attacker take over the top RF positions in the rank.

An RF of 4 seems somewhat likely that someone can suppress block production in the network for a single epoch as that only requires ~40M WIT. Doubling or quadrupling that for the second and third epoch (using a power progression) requirement seems like it would already require significant (capital) effort, but still possible. Anything higher seems quite unlikely given what we know about the coin distribution.

It is certainly true that a power progression starting off with a low base RF (e.g., 4) is more likely to lead to single or double epoch denial-of-blocks than a linear progression with a high base RF (e.g., 12). I guess we could make the base RF 8 (4 * 2 ** (epoch - previous_epoch)) as a middle ground if we choose power progression after all.

In the end, I'm pretty ambivalent to the final choice, it's just that I like powers-of-two more. 😉

On the other hand, what's the downside of a high-ish initial RF value?

A couple of things I can think of, but all are pretty unlikely:

If the ranking does not work entirely as expected on breaking ties, selecting more initial proposers could introduce some imbalance.

More valid block candidates in the network means a higher likelihood of partitioning under less-than-ideal network connections between peers.

data_structures/src/staking/stakes.rs

node/src/actors/chain_manager/mod.rs

data_structures/src/staking/stakes.rs

… protocol versions

…tor's stake entries

…ing blocks

guidiaz force-pushed the feat/linear-replication-factor branch from 0c340a5 to 06dae81 Compare December 9, 2024 18:58

drcpu-github reviewed Dec 9, 2024

View reviewed changes

data_structures/src/staking/stakes.rs Outdated Show resolved Hide resolved

guidiaz force-pushed the feat/linear-replication-factor branch from 06dae81 to dc1f500 Compare December 10, 2024 09:21

guidiaz mentioned this pull request Dec 10, 2024

Make a final decision on first approach to slashing #2475

Open

drcpu-github reviewed Dec 10, 2024

View reviewed changes

node/src/actors/chain_manager/mod.rs Outdated Show resolved Hide resolved

guidiaz force-pushed the feat/linear-replication-factor branch from 9b170f7 to bf46b07 Compare December 10, 2024 17:37

drcpu-github reviewed Dec 10, 2024

View reviewed changes

data_structures/src/staking/stakes.rs Outdated Show resolved Hide resolved

guidiaz force-pushed the feat/linear-replication-factor branch from dd74cb0 to 0a23b5f Compare December 11, 2024 09:11

aesedepece and others added 13 commits December 11, 2024 18:09

feat(versioning): make it easier to derive activation timestamps from…

bc3eeda

… protocol versions

fix(session): allow synchronization across protocol version hot swaps

98c3428

feat(staking): add PartialOrd trait to Epoch

a53659b

feat(staking): linearly increasing mining replication factor

989e855

fix(staking): ranked entries should instead iterate on StakeKey<Address>

dda3a0b

fix(staking): mining eligibility strategy

ccbb294

feat(slashing): lazyly slash reluctant block proposers

0334202

fix(staking): census entries should instead iterate on StakeKey<Address>

4a548e4

chore(slashing): polish comment and debug message

62708fc

fix(staking): query_power must take the max power out from the valida…

5b90ff5

…tor's stake entries

fix(staking): use block's epoch instead of current epoch when validat…

07be4c5

…ing blocks

fix(stakes): fix mining age reset function and add test

ee023de

chore(staking): accepting linear -> exponential replication factor

f4f1c62

guidiaz force-pushed the feat/linear-replication-factor branch from d736586 to f4f1c62 Compare December 11, 2024 17:12

aesedepece merged commit f4f1c62 into witnet:master Dec 11, 2024
1 check failed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(slashing): lazily slash reluctant block proposers #2550

feat(slashing): lazily slash reluctant block proposers #2550

guidiaz commented Dec 9, 2024 •

edited

Loading

drcpu-github left a comment

drcpu-github Dec 9, 2024

guidiaz Dec 10, 2024 •

edited

Loading

drcpu-github Dec 10, 2024

guidiaz Dec 10, 2024 •

edited

Loading

drcpu-github Dec 10, 2024

aesedepece Dec 10, 2024

drcpu-github Dec 10, 2024

feat(slashing): lazily slash reluctant block proposers #2550

feat(slashing): lazily slash reluctant block proposers #2550

Conversation

guidiaz commented Dec 9, 2024 • edited Loading

drcpu-github left a comment

Choose a reason for hiding this comment

drcpu-github Dec 9, 2024

Choose a reason for hiding this comment

guidiaz Dec 10, 2024 • edited Loading

Choose a reason for hiding this comment

drcpu-github Dec 10, 2024

Choose a reason for hiding this comment

guidiaz Dec 10, 2024 • edited Loading

Choose a reason for hiding this comment

drcpu-github Dec 10, 2024

Choose a reason for hiding this comment

aesedepece Dec 10, 2024

Choose a reason for hiding this comment

drcpu-github Dec 10, 2024

Choose a reason for hiding this comment

guidiaz commented Dec 9, 2024 •

edited

Loading

guidiaz Dec 10, 2024 •

edited

Loading

guidiaz Dec 10, 2024 •

edited

Loading