protocols/relay: Implement circuit relay specification #1838

mxinden · 2020-11-15T17:10:27Z

This pull request implements the libp2p circuit relay specification. It is based on previous work from #1134.

Instead of altering the Transport trait, the approach taken in this pull request is to wrap an existing implementation of Transport allowing one to:

Intercept dial requests with a relayed address.
Inject incoming relayed connections with the local node being the destination.
Intercept listen_on requests pointing to a relay, ensuring to keep a constant connection to the relay, waiting for incoming requests with the local node being the destination.

More concretely one would wrap an existing Transport implementation as seen below, allowing the Relay behaviour and the RelayTransport to communicate via channels.

Example

let (relay_transport, relay_behaviour) = new_transport_and_behaviour(
    RelayConfig::default(),
    MemoryTransport::default(),
);

let transport = relay_transport
    .upgrade(upgrade::Version::V1)
    .authenticate(plaintext)
    .multiplex(YamuxConfig::default())
    .boxed();

let mut swarm = Swarm::new(transport, relay_behaviour, local_peer_id);

let relay_addr = Multiaddr::from_str("/memory/1234").unwrap()
    .with(Protocol::P2p(PeerId::random().into()))
    .with(Protocol::P2pCircuit);
let dst_addr = relay_addr.clone().with(Protocol::Memory(5678));

// Listen for incoming connections via relay node (1234).
Swarm::listen_on(&mut swarm, relay_addr).unwrap();

// Dial node (5678) via relay node (1234).
Swarm::dial_addr(&mut swarm, dst_addr).unwrap();

Status Quo

This pull request is ready to be reviewed and tested. See #1838 (comment) for details.

Closes #725.

mxinden · 2021-03-06T17:12:12Z

Something I wasn't quite sure about yet from the implementation: This does not cover the ability for "multi hop relaying" as specified under "future work" in the spec, or does it?

Correct. It does not. There is also a proposal for circuit relay v2 (libp2p/go-libp2p-circuit#125) which would be worth considering in the future.

mxinden · 2021-03-06T18:17:11Z

I am reasonably sure I addressed all the comments above, most notably the re-work of the listener logic (#1838 (comment)).

In case you have time for another review @romanb, that would be terrific.

romanb

Looks good to me! Glad to see the Arc<Mutex> gone. I only left two more minor comments.

protocols/relay/src/behaviour.rs

romanb · 2021-03-10T10:42:22Z

protocols/relay/src/handler.rs

+            } => {
+                let err_code = match error {
+                    ProtocolsHandlerUpgrErr::Timeout => {
+                        self.pending_error = Some(ProtocolsHandlerUpgrErr::Timeout);


Do we really want to terminate the connection on a single relay substream negotiation timeout? Other protocols may use the connection (successfully) as well, right?

If I understand correctly, given that libp2p-relay uses the default timeout, this will only happen after 10 seconds. I would deem a node that is not able to respond to the light-weight circuit relay requests within 10 seconds as either misbehaving or overloaded. For the former, disconnecting seems to be the way to go. For the latter I would say disconnecting is beneficial for both sides. The local node might be able to succeed in whatever it is up to via another relay or destination node. In addition I would guess other protocols running on the same connection are not making progress either if the light-weight circuit relay negotiation does not make progress. The overloaded remote node would receive less traffic (both through the relay protocol and any other protocol out there) and thus become less overloaded overall

The above said, I don't know what the correct behaviour is and I don't know whether it should be consistent across protocols. E.g. libp2p-request-response does not terminate the whole connection on Timeout, which makes sense to me as the payloads being exchanged could potentially be very large.

rust-libp2p/protocols/request-response/src/handler.rs

Lines 252 to 260 in f48bb15

fn inject_listen_upgrade_error(

&mut self,

info: RequestId,

error: ProtocolsHandlerUpgrErr<io::Error>

) {

match error {

ProtocolsHandlerUpgrErr::Timeout => {

self.pending_events.push_back(RequestResponseHandlerEvent::InboundTimeout(info))

}

With my reasoning above in mind, what do you think @romanb we should do in case of a ProtocolsHandlerUpgrErr::Timeout?

In addition I would guess other protocols running on the same connection are not making progress either if the light-weight circuit relay negotiation does not make progress.

I think conceptually, different protocols using different substreams on the same connection should be considered independently, while necessarily sharing the same network resource. Making it a requirement that all protocols used on a connection work well (e.g. without timeouts) all the time seems a bit problematic. Especially when it comes to timeouts on (outbound) substreams, in my mind, these should be reported to client code which can then a) decide whether to retry and if so how often and with what kind of back-off strategy and b) whether the entire connection should be closed after x timeouts. The concrete application knows what protocols are used on a particular connection, whereas a single protocol does not know with which others it must share the connections. These were also the considerations for libp2p-request-response. Making the fixed choice within a particular protocol that the connection is killed if a particular substream protocol upgrade does not complete within 10 seconds seems very rigid. It is probably fine in this particular instance, so I'm not opposed, but in general I think any errors other than protocol violations, (outbound) timeouts in particular, should just be reported on the API and left to client code to handle.

Add `actively_connect_to_dst_nodes` configuration option. Configures whether to actively establish an outgoing connection to a destination node, when being asked by a source node to relay a connection to said destination node. For security reasons this behaviour is disabled by default, thus a relay node will not actively establish an outgoing connection to a destination node in case it is not yet connected to said destination node. Instead a destination node should establish a connection to a relay node before advertising their relayed address via that relay node to a source node.

mxinden · 2021-03-10T15:01:41Z

I want to draw attention to the most recent commit 7956f0b which is not based on any of the above review comments.

protocols/relay: Disable active relay behaviour by default

Add actively_connect_to_dst_nodes configuration option. Configures
whether to actively establish an outgoing connection to a destination
node, when being asked by a source node to relay a connection to said
destination node.

For security reasons this behaviour is disabled by default, thus a relay
node will not actively establish an outgoing connection to a destination
node in case it is not yet connected to said destination node. Instead a
destination node should establish a connection to a relay node before
advertising their relayed address via that relay node to a source node.

For comparison, here is the same configuration option of the Golang implementation.

mxinden · 2021-03-10T17:58:05Z

Continuing on the discussion in #1838 (comment) above.

In addition I would guess other protocols running on the same connection are not making progress either if the light-weight circuit relay negotiation does not make progress.

I think conceptually, different protocols using different substreams on the same connection should be considered independently, while necessarily sharing the same network resource. Making it a requirement that all protocols used on a connection work well (e.g. without timeouts) all the time seems a bit problematic. Especially when it comes to timeouts on (outbound) substreams, in my mind, these should be reported to client code which can then a) decide whether to retry and if so how often and with what kind of back-off strategy and b) whether the entire connection should be closed after x timeouts. The concrete application knows what protocols are used on a particular connection, whereas a single protocol does not know with which others it must share the connections. These were also the considerations for libp2p-request-response. Making the fixed choice within a particular protocol that the connection is killed if a particular substream protocol upgrade does not complete within 10 seconds seems very rigid. It is probably fine in this particular instance, so I'm not opposed, but in general I think any errors other than protocol violations, (outbound) timeouts in particular, should just be reported on the API and left to client code to handle.

The above reasoning makes sense to me. Thank you for the detailed write-up.

In general this seems like something worth striving for in a consistent manner across ProtocolHandler implementations. E.g. libp2p-gossipsub currently terminates the connection whereas, as mentioned above, libp2p-request-response doesn't.

In the specific case here d55bd99 makes libp2p-relay not close connections on dial and incoming upgrade Timeout errors. Instead connections would be eventually closed through the keep alive mechanism, in case both libp2p-relay as well as all other protocols have no more use for the connection. In the future one could explore actively setting KeepAlive::No on Timeout errors, only switching back to KeepAlive::Yes or KeepAlive::Until in case of any future success. That would speed up the connection garbage collection process.

mxinden · 2021-03-10T18:00:32Z

Glad to see the Arc<Mutex> gone.

Very much agreed!

mxinden · 2021-03-11T15:11:23Z

After 133 commits this is finally merged. 🎉

Thanks goes to @tomaka for the initial version (#1134), @romanb for the reviews and @dvc94ch for the initial testing.

I will publish a release of libp2p-relay v0.1.0 soon.

With libp2p/rust-libp2p#1838 merged rust-libp2p supports the circuit relay v1 protocol. Tagged as "Usable" given that it is still a bit bare bone.

tomaka and others added 30 commits May 19, 2019 20:42

Add back the relay

69bc317

Merge remote-tracking branch 'upstream/master' into relay

11b0c9a

Merge branch 'libp2p/master' into relay

ccbc906

*: Add unit test instantiating a relay

a4d4049

protocols/transport: Add channel between transport and behaviour

c404b45

protocols/relay: Add custom Transport structs

6a17ff5

protocols/relay: Send relayed dial to behaviour

2721f79

protocols/relay/tests: Connect to relay via listen_on

96f2294

protocols/relay: Add TODO on how to specify relays

7dc0142

protocols/relay: Send listen requests to behaviour

5dc6ab8

protocols/relay: Track progress of listen requests in behaviour

43a2eec

protocols/relay: Poll inner upgrade listener

6cbd22f

protocols/relay: Track relay listeners

98a5ff1

protocols/relay/tests: Introduce node a dialing node b via relay

ed98b91

protocols/relay: Split relay and destination address

77469d5

protocols/relay: Dial relay on dial request

e9b7edd

protocols/relay: Send hop request to relay

ff37628

protocols/relay: Parse relay request

0f0e052

protocols/relay: Ask the remote to act as a destination

240d258

protocols/relay: Accept incoming requests

db56783

protocols/relay: Pass incoming substream to transport

559527e

protocols/relay: Pass source address to transport.rs

a513866

protocols/relay: Copy bytes from source to destination and vice versa

babd49b

protocols/relay: Pass negotiated connection back to dialer

2a99f5f

protocols/relay: Stricten test execution

55956eb

protocols/relay: Fix warnings

1dd5cd6

protocols/relay: Refactor test

43ce960

protocols/relay: Add Ping to behaviour

88a3dae

Merge branch 'libp2p/master' into relay

18bcc70

protocols/relay/src/lib: Add terminology section

d300c36

mxinden added 4 commits March 6, 2021 17:54

protocols/relay/protocol: Use Bytes returned by split_to

2ef5e51

protocols/relay: Rename RelayTransportWrapper to RelayTransport

4f16531

protocols/relay: Terminate listener when channel to behaviour closed

6bd1c98

protocols/relay/behaviour: Ignore error when returning dialed conection

b8f7a00

mxinden added 4 commits March 6, 2021 18:16

protocols/relay: Fix broken intra doc links

5899ecd

protocols/relay: Do not panic when accepting dst req fails

a6f1fe7

protocols/relay: Track peer id as peer_id

bc4959d

protocols/relay: Implement Display for RelayError

4c37a94

romanb approved these changes Mar 10, 2021

View reviewed changes

mxinden added 2 commits March 10, 2021 12:32

protocols/relay: Print trace log when dropping event to unknown listener

c5aa466

protocols/relay: Do not close connection on upgrade timeouts

d55bd99

mxinden added 2 commits March 11, 2021 14:42

Merge branch 'master' into relay

0c7f825

*: Update changelogs

ded2d51

mxinden merged commit 2f9c175 into libp2p:master Mar 11, 2021

This was referenced Mar 15, 2021

Update minicbor requirement from 0.7 to 0.8 #1998

Merged

protocols/relay: Ignore IdentifyEvent::Error #2001

Merged

Deprecation of Sentry Nodes paritytech/substrate#6845

Closed

mxinden added a commit to mxinden/website-1 that referenced this pull request Mar 19, 2021

data/bundles: Tag Rust libp2p-relay as Usable

fe8b301

With libp2p/rust-libp2p#1838 merged rust-libp2p supports the circuit relay v1 protocol. Tagged as "Usable" given that it is still a bit bare bone.

mxinden mentioned this pull request Mar 19, 2021

data/bundles: Tag Rust libp2p-relay as Usable libp2p/website#118

Merged

romanb mentioned this pull request Mar 22, 2021

Update to libp2p-0.36 paritytech/substrate#8420

Merged

elenaf9 mentioned this pull request Apr 9, 2021

Feat/comms relay iotaledger/stronghold.rs#183

Merged

7 tasks

mxinden mentioned this pull request Apr 24, 2021

relay/README: Add rust-libp2p to list of implementations libp2p/specs#318

Merged

elenaf9 mentioned this pull request Jun 14, 2021

Relay: Destination peer disconnects after idle time #2102

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

protocols/relay: Implement circuit relay specification #1838

protocols/relay: Implement circuit relay specification #1838

mxinden commented Nov 15, 2020 •

edited

Loading

mxinden commented Mar 6, 2021

mxinden commented Mar 6, 2021

romanb left a comment

romanb Mar 10, 2021

mxinden Mar 10, 2021

romanb Mar 10, 2021 •

edited

Loading

mxinden commented Mar 10, 2021

mxinden commented Mar 10, 2021

mxinden commented Mar 10, 2021

mxinden commented Mar 11, 2021

	fn inject_listen_upgrade_error(
	&mut self,
	info: RequestId,
	error: ProtocolsHandlerUpgrErr<io::Error>
	) {
	match error {
	ProtocolsHandlerUpgrErr::Timeout => {
	self.pending_events.push_back(RequestResponseHandlerEvent::InboundTimeout(info))
	}

protocols/relay: Implement circuit relay specification #1838

protocols/relay: Implement circuit relay specification #1838

Conversation

mxinden commented Nov 15, 2020 • edited Loading

Example

Status Quo

mxinden commented Mar 6, 2021

mxinden commented Mar 6, 2021

romanb left a comment

Choose a reason for hiding this comment

romanb Mar 10, 2021

Choose a reason for hiding this comment

mxinden Mar 10, 2021

Choose a reason for hiding this comment

romanb Mar 10, 2021 • edited Loading

Choose a reason for hiding this comment

mxinden commented Mar 10, 2021

mxinden commented Mar 10, 2021

mxinden commented Mar 10, 2021

mxinden commented Mar 11, 2021

mxinden commented Nov 15, 2020 •

edited

Loading

romanb Mar 10, 2021 •

edited

Loading