Change sequence number in gossipsub to be linearly increasing #3453

MarcoPolo · 2023-02-10T22:04:02Z

Summary

Apologies, if this is hashed out somewhere else, but I couldn't find any discussion on this.

The pubsub spec says

The seqno field (optional) is a 64-bit big-endian uint that is a linearly increasing number that is unique among messages originating from each given peer

Right now the implementation in this repo uses a random number per message: https://github.com/libp2p/rust-libp2p/blob/master/protocols/gossipsub/src/lib.rs#L50

The Go implementation starts with the current nano timestamp, then increments from there: https://github.com/libp2p/go-libp2p-pubsub/blob/master/pubsub.go#L296

I don't believe the current go implementation relies on an increasing sequence number, but I could imagine an optimization where it does (e.g. instead of keeping a map of all message IDs seen, we only keep the highest ID seen and discard anything lower).

I'm guessing the reason to pick a random number is to not rely on the current time.

The risk of relying on the current time is if you reboot your node and start with a an earlier time, you may be reusing IDs. However, I'd counter that argument by saying many other things would break in this scenario (including any TLS encryption). In practice, it hasn't seemed to be a problem in the Go implementation.

mxinden · 2023-02-13T16:43:02Z

Apologies, if this is hashed out somewhere else

I remember discussing this with @AgeManning in the past, searching through some history, in 2019:

#898 (comment)

Relevant quote by @AgeManning:

From memory, the go-implementation uses sequential number and rust has chosen to use randomize them. I am indifferent about which way to go, but maintained the randomness and the note for consistency in the rust implementation.

@AgeManning are you still of the same opinion? Maybe @divagant-martian do you have thoughts on this? I am indifferent as well.

However, I'd counter that argument by saying many other things would break in this scenario (including any TLS encryption).

@MarcoPolo can you expand on how TLS's security model breaks with clock drifts <1s?

MarcoPolo · 2023-02-13T19:46:22Z

@MarcoPolo can you expand on how TLS's security model breaks with clock drifts <1s?

I didn't say anything about <1s. It breaks with significant clock drifts > 1hr / days. At the point where the Not Before / Not After are incorrectly invalid.

In the situation with a libp2p node, a node will be up for a bit of time (>1s) and then maybe restart. If it restarts, it will have been up for a bit so even if there's a bit of clock drift, we won't reuse IDs.

I'll also point out that WireGuard uses incrementing numbers in a similar way and hasn't had a problem in practice:

... we include a TAI64N timestamp in the first message. The server keeps track of the greatest timestamp received per client and discards packets containing timestamps less than or equal to it.

From https://www.wireguard.com/protocol/#dos-mitigation

AgeManning · 2023-02-13T23:32:55Z

There was a discussion where this was argued, but I can't find the repo or the comment.

Here are some relevant issues/PRs tho:

Here is how I remember the arguments going.

Before we had message signing, there was an attack vector, where if sequence numbers were linearly increasing, then an attacker could send a message with a target peer id and many future sequence numbers such that our node would then filter out all of the future messages from that peer as they would have been "already seen" based on the default message-id (which was/is a concatenation of the source peer-id with the sequence number).

I believe in rust-libp2p we opted for random sequence numbers to mitigate this kind of thing.

With the introduction of message signing, an attacker cannot perform this kind of attack. However, in rust-libp2p we leave message signing optional as some use cases (I'm thinking of me in particular in Ethereum consensus) we do not use message signing (although we don't use sequence numbers either :p).

So the risk here, is that currently message signing is optional. If a user were to not use message signing and keep the default message-id function, then they would be vulnerable to this kind of attack again.

We are aware that the gossipsub implementation does use increasing sequence numbers, I haven't looked if they use optional message signing tho (I assume they must because we have go ethereum clients). Perhaps they leave it up to the user to make sure they don't use the default message-id.

Have you experienced any issue with this or is this just to be more spec compliant?

I believe someone was going to make a spec PR to make the specs more accommodating in this regard.

AgeManning · 2023-02-13T23:41:18Z

I just realized in one of the PRs I linked, I actually kept the wording "linearly increasing" in the specs. Woops.

MarcoPolo · 2023-02-28T21:00:58Z

This came up again since we're adding a (non-default) validator that keeps track of the max seqno per peer: https://github.com/libp2p/go-libp2p-pubsub/pull/525/files. I think this validator will break against rust peers.

Shouldn't break anyone by default, but just another thing to keep in mind.

vyzo · 2023-02-28T21:11:00Z

Yeah, by not monotonically increasing, it breaks all possibility of having such a validator.

We want to use this to eliminate perpetual messages when the diameter exceeds the span of the seen message timecache.
This has been observed in practice, so we are adding it as a pluggable default validator.

AgeManning · 2023-02-28T23:28:21Z

Ok, so I'm not particularly concerned about which way we go here.

We can adjust the rust implementation to use linearly increasing sequence numbers and we switch to random sequence numbers if message signing is disabled (and sequence numbers are enabled).

Does this sound like a reasonable way forward?

vyzo · 2023-03-01T01:43:37Z

Yes, that's perfectly fine.

This modifies the gossipsub implementation to use monotonically increasing sequence numbers for signed messages (as dictated by the specification). There is a discussion about this in #3453. This change will make rust-libp2p gossipsub align with the go-implementation when messages are signed. Messages will however still use randomized sequence numbers when messages are unsigned for security reasons (as discussed in the issue linked). This shouldn't change any user-level API, only the seqno behavior. It is fully backwards compatible. Resolves #3453. Pull-Request: #3551.

MarcoPolo mentioned this issue Feb 28, 2023

Default validator support libp2p/go-libp2p-pubsub#525

Merged

1 task

AgeManning mentioned this issue Mar 6, 2023

fix(gossipsub): signed messages use monotonically increasing seq numbers #3551

Merged

mergify bot closed this as completed in #3551 Mar 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change sequence number in gossipsub to be linearly increasing #3453

Change sequence number in gossipsub to be linearly increasing #3453

MarcoPolo commented Feb 10, 2023 •

edited

Loading

mxinden commented Feb 13, 2023

MarcoPolo commented Feb 13, 2023

AgeManning commented Feb 13, 2023 •

edited

Loading

AgeManning commented Feb 13, 2023

MarcoPolo commented Feb 28, 2023

vyzo commented Feb 28, 2023 •

edited

Loading

AgeManning commented Feb 28, 2023

vyzo commented Mar 1, 2023

Change sequence number in gossipsub to be linearly increasing #3453

Change sequence number in gossipsub to be linearly increasing #3453

Comments

MarcoPolo commented Feb 10, 2023 • edited Loading

Summary

mxinden commented Feb 13, 2023

MarcoPolo commented Feb 13, 2023

AgeManning commented Feb 13, 2023 • edited Loading

AgeManning commented Feb 13, 2023

MarcoPolo commented Feb 28, 2023

vyzo commented Feb 28, 2023 • edited Loading

AgeManning commented Feb 28, 2023

vyzo commented Mar 1, 2023

MarcoPolo commented Feb 10, 2023 •

edited

Loading

AgeManning commented Feb 13, 2023 •

edited

Loading

vyzo commented Feb 28, 2023 •

edited

Loading