identify: specify max message size #492

lidel · 2022-12-06T20:45:33Z

This PR proposes specifying max size of Identify message to unify behavior across implementations.

Limit of 4096 bytes is based on current value in rust-libp2p, but we can pick something else.
(I was not able to quickly find any limit in go-libp2p)

License: MIT Signed-off-by: Marcin Rataj <lidel@lidel.org>

marten-seemann · 2022-12-06T21:49:08Z

identify/README.md

@@ -83,6 +83,8 @@ message Identify {
 }
 ```

+Implementations must reject `Identify` messages bigger than 4096 bytes.


Suggested change

Implementations must reject `Identify` messages bigger than 4096 bytes.

Implementations SHOULD NOT send `Identify` messages bigger than 4096 bytes.

Proposal by @marten-seemann sounds good to me.

To add some rational.

There might be valid use-cases where > 4096 bytes is a good idea. Thus using SHOULD. This would allow individual implementations to offer a flag to disable the check.

Pushing the requirement to the sender instead of the receiver would follow the robustness princicple, i.e. be conservative of what one sends, but liberal in what one accepts. Though in the end, I don't think it matters much.

@lidel in addition of the above proposal, what do you think of mentioning the main motivation, resilience against DOS attacks. Something along the lines: "This limit allows receivers to drop messages exceeding the limit, thus better defend against DOS attacks."

sgtm, will apply

Suggested change

Implementations must reject `Identify` messages bigger than 4096 bytes.

Implementations SHOULD NOT send `Identify` messages bigger than 4096 bytes.

This limit allows receivers to drop messages exceeding the limit, thus better defend against DOS attacks.

mxinden

Looks good to me from the rust-libp2p side. Thanks @lidel for the patch!

@Menduist what do you think for nim-libp2p?

@Nashatyrev what do you think for jvm-libp2p?

mxinden · 2022-12-08T14:59:29Z

identify/README.md

@@ -83,6 +83,8 @@ message Identify {
 }
 ```

+Implementations must reject `Identify` messages bigger than 4096 bytes.


Proposal by @marten-seemann sounds good to me.

To add some rational.

There might be valid use-cases where > 4096 bytes is a good idea. Thus using SHOULD. This would allow individual implementations to offer a flag to disable the check.

Pushing the requirement to the sender instead of the receiver would follow the robustness princicple, i.e. be conservative of what one sends, but liberal in what one accepts. Though in the end, I don't think it matters much.

@lidel in addition of the above proposal, what do you think of mentioning the main motivation, resilience against DOS attacks. Something along the lines: "This limit allows receivers to drop messages exceeding the limit, thus better defend against DOS attacks."

Menduist · 2022-12-08T15:08:07Z

LGTM, notes that this implies a limit on the SPR too

mxinden · 2022-12-14T15:40:17Z

Unless there are any objections, I will move forward here next week.

@lidel could you please bump the specification revision and accept @marten-seemann's suggestion above?

License: MIT Signed-off-by: Marcin Rataj <lidel@lidel.org>

lidel · 2022-12-15T00:56:18Z

@mxinden done in e4cb42c :)

Nashatyrev · 2022-12-15T05:52:15Z

Just for the reference here is Jvm-libp2p commit addressing the issue: libp2p/jvm-libp2p@5e23467
We set the Identify message size limit to 1Mb there.

May be 4Kb is too restrictive for general use? There could be applications with a lot of supported protocols which wouldn't fit to Identify.protocols field

From my perspective it's important to bound the size with any reasonable value. So may be 1Mb would be as safe as 4Kb.

Nashatyrev · 2022-12-15T06:12:06Z

E.g. Ethereum has just 6 RPC methods, but each of them

is identified by ~64 symbol protocol name
has several versions (distinct protocol for each version)
during hardforks the total number of protocols might increase x2 (as the protocol name incorporates fork identifier)

So even now the protocols field could be as large as 2Kb.
After adding 2-3 new method versions it could exceed 4Kb

I see that 4Kb is kind of 'recommendation' but just a bit worried that this limit could be hardcoded to some Libp2p implementation and could just stop working at some point.
(upd): like it's currently done in Rust :)

Winterhuman · 2022-12-15T14:02:18Z

Maybe the maximum could be 16KiB to mimic the IPNS record max size? If IPNS is safe using 16KiB then it seems it should be a safe value in general

mxinden

I see that 4Kb is kind of 'recommendation' but just a bit worried that this limit could be hardcoded to some Libp2p implementation and could just stop working at some point.

I doubt we will find a value that will fit all use-cases. Thus agreed that this is a recommendation and should be configurable by implementations.

Thanks for sharing details for the Ethereum network.

@Nashatyrev unless you feel strongly about bumping this value, I suggest we proceed here.

identify/README.md

Nashatyrev · 2022-12-30T14:19:45Z

@Nashatyrev unless you feel strongly about bumping this value, I suggest we proceed here.

I'm totally fine with this 👍

marten-seemann · 2022-12-30T20:19:01Z

I disagree that setting a small message size is the way to go. This has already led us to implement a chunked encoding (https://github.com/libp2p/go-libp2p/pull/958/files). I would have hoped that the days of chunked encodings were behind us...

Regarding DoS defense, there's little advantage to splitting up a large message into multiple smaller ones. Yes, you get incremental verifiability, but this hardly counts as an advantage when we're dealing with chunks of a few kB.

Really, if we actually think that the maximum of data we can accept without risking exposing ourselves to a DoS attack is that close to sizes that are actually useful for the protocol, there's something wrong with the protocol (I do think that Identify is a bloated protocol, but that's a different story).
We need to acknowledge: Shipping lists of supported protocols and lists of multiaddrs takes up space. A DoS protection limit should be set such that well-behaved implementations don't have to worry about running into this limit. DoS limits exist to deter malicious nodes, not to make the life of well-behaved nodes miserable.

Looking at memory consumption of typical a libp2p connection, a size limit of 4 kB seems extreme stingy. We allow stream muxers to consume up to 15 MB in both go-libp2p and rust-libp2p. A memory limit of 128 - 256 kB per Identify wouldn't be crazy, if the number of concurrent Identify streams is limited.

identify: specify max message size

9c869ee

License: MIT Signed-off-by: Marcin Rataj <lidel@lidel.org>

lidel requested review from marten-seemann and mxinden December 6, 2022 20:45

marten-seemann reviewed Dec 6, 2022

View reviewed changes

mxinden approved these changes Dec 8, 2022

View reviewed changes

identify: apply feedback from review

e4cb42c

License: MIT Signed-off-by: Marcin Rataj <lidel@lidel.org>

mxinden reviewed Dec 29, 2022

View reviewed changes

identify/README.md Outdated Show resolved Hide resolved

Update identify/README.md

f6a5f2e

Menduist mentioned this pull request Feb 28, 2023

feat(webrtc): add WebRTC (prev. browser-to-browser) spec #497

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

identify: specify max message size #492

identify: specify max message size #492

lidel commented Dec 6, 2022 •

edited

Loading

marten-seemann Dec 6, 2022

mxinden Dec 8, 2022

lidel Dec 15, 2022

mxinden left a comment

mxinden Dec 8, 2022

Menduist commented Dec 8, 2022

mxinden commented Dec 14, 2022

lidel commented Dec 15, 2022

Nashatyrev commented Dec 15, 2022

Nashatyrev commented Dec 15, 2022 •

edited

Loading

Winterhuman commented Dec 15, 2022

mxinden left a comment

Nashatyrev commented Dec 30, 2022

marten-seemann commented Dec 30, 2022

	Implementations must reject `Identify` messages bigger than 4096 bytes.
	Implementations SHOULD NOT send `Identify` messages bigger than 4096 bytes.

	Implementations must reject `Identify` messages bigger than 4096 bytes.
	Implementations SHOULD NOT send `Identify` messages bigger than 4096 bytes.
	This limit allows receivers to drop messages exceeding the limit, thus better defend against DOS attacks.

identify: specify max message size #492

Are you sure you want to change the base?

identify: specify max message size #492

Conversation

lidel commented Dec 6, 2022 • edited Loading

marten-seemann Dec 6, 2022

Choose a reason for hiding this comment

mxinden Dec 8, 2022

Choose a reason for hiding this comment

lidel Dec 15, 2022

Choose a reason for hiding this comment

mxinden left a comment

Choose a reason for hiding this comment

mxinden Dec 8, 2022

Choose a reason for hiding this comment

Menduist commented Dec 8, 2022

mxinden commented Dec 14, 2022

lidel commented Dec 15, 2022

Nashatyrev commented Dec 15, 2022

Nashatyrev commented Dec 15, 2022 • edited Loading

Winterhuman commented Dec 15, 2022

mxinden left a comment

Choose a reason for hiding this comment

Nashatyrev commented Dec 30, 2022

marten-seemann commented Dec 30, 2022

lidel commented Dec 6, 2022 •

edited

Loading

Nashatyrev commented Dec 15, 2022 •

edited

Loading