Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

relay/DCUtR: Add Direct Connection Upgrade through Relay protocol #173

Merged
merged 29 commits into from
Aug 23, 2021
Merged
Changes from 18 commits
Commits
Show all changes
29 commits
Select commit Hold shift + click to select a range
727f8b1
initial DCUtR draft
vyzo May 29, 2019
9db77f0
add paragraph about stream migration
vyzo May 29, 2019
fee2b99
add boilerplate
vyzo May 29, 2019
97e5d61
fix formatting.
raulk May 29, 2019
75ed30b
Merge branch 'libp2p/master' into rfc/dcutr
mxinden Aug 11, 2021
4ccccf5
relay/DCUtR: Copy Protocol Buffer schema from Golang impl
mxinden Aug 11, 2021
4b9549a
relay/DCUtR: Add table of contents
mxinden Aug 11, 2021
73064f9
relay/DCUtR: Add note on protocol id
mxinden Aug 11, 2021
dfc988c
relay/DCUtR: Remove go specific implementation considerations
mxinden Aug 11, 2021
46bd410
relay/DCUtR: Add TODO for retry logic
mxinden Aug 11, 2021
4e94481
relay/DCUtR: Document message length prefixing
mxinden Aug 13, 2021
9d42524
relay/DCUtR: Document maximum message size
mxinden Aug 13, 2021
9958df2
relay/DCUtR: Wrap at 80 chars
mxinden Aug 17, 2021
4b7c1ce
relay/DCUtR: Add mxinden to interest group
mxinden Aug 17, 2021
fe64a21
relay/DCUtR: Document retry logic
mxinden Aug 17, 2021
db9475e
relay/DCUtR: Remove implementation specific event emission
mxinden Aug 17, 2021
2d8b38f
relay/DCUtR: Remove note on obs address sending
mxinden Aug 17, 2021
6530d45
relay/DCUtR: Update date
mxinden Aug 17, 2021
0076c69
relay/DCUtR: Add Marten to interest group
mxinden Aug 22, 2021
b420064
relay/DCUtR: Assign roles and describe hole punching on QUIC (#361)
marten-seemann Aug 23, 2021
af0b9bb
relay/DCUtR: Stress that one should connect to all addresses
mxinden Aug 23, 2021
6f475de
relay/DCUtR: Fix typo
mxinden Aug 23, 2021
17f6275
relay/DCUtR: Mention addressing specification for ObsAddrs field
mxinden Aug 23, 2021
5943d3b
relay/DCUtR: Do not reuse same stream on retry
mxinden Aug 23, 2021
6f558f1
relay/DCUtR: Reword steps for each address
mxinden Aug 23, 2021
f7b43df
relay/DCUtR: Detail on success case
mxinden Aug 23, 2021
85f567d
relay/DCUtR: Inline `Sync` reasoning
mxinden Aug 23, 2021
cab60cc
relay/DCUtR: Remove concrete success rates
mxinden Aug 23, 2021
8001cd9
relay/DCUtR: Reword
mxinden Aug 23, 2021
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
158 changes: 158 additions & 0 deletions relay/DCUtR.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,158 @@
# Direct Connection Upgrade through Relay

| Lifecycle Stage | Maturity | Status | Latest Revision |
|-----------------|---------------|--------|--------------------|
| 1A | Working Draft | Active | r0, 2021-08-17 |

Authors: [@vyzo]

Interest Group: [@raulk], [@stebalien], [@whyrusleeping], [@mxinden]
mxinden marked this conversation as resolved.
Show resolved Hide resolved

[@vyzo]: https://github.com/vyzo
[@raulk]: https://github.com/raulk
[@stebalien]: https://github.com/stebalien
[@whyrusleeping]: https://github.com/whyrusleeping
[@mxinden]: https://github.com/mxinden
mxinden marked this conversation as resolved.
Show resolved Hide resolved

See the [lifecycle document](https://github.com/libp2p/specs/blob/master/00-framework-01-spec-lifecycle.md)
for context about maturity level and spec status.

## Table of Contents

- [Direct Connection Upgrade through Relay](#direct-connection-upgrade-through-relay)
- [Table of Contents](#table-of-contents)
- [Introduction](#introduction)
- [The Protocol](#the-protocol)
- [RPC messages](#rpc-messages)
- [FAQ](#faq)
- [References](#references)

## Introduction

NAT traversal is a quintessential problem in peer-to-peer networks.

We currently utilize relays, which allow us to traverse NATs by using
a third party as proxy. Relays are a reliable fallback, that can
connect peers behind NAT albeit with a high-latency, low-bandwidth
connection. Unfortunately, they are expensive to scale and maintain
if they have to carry all the NATed node traffic in the network.

It is often possible for two peers behind NAT to communicate directly
by utilizing a technique called _hole punching_[1]. The technique
relies on the two peers synchronizing and simultaneously opening
connections to each other to their predicted external address. It
works well for UDP, with an estimated 80% success rate, and reasonably
well for TCP, with an estimated 60% success rate.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Didn't we see much better numbers than this?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think they have been in the same ballpark, but I might as well be mistaken. Unfortunately I am unable to access the data from project flare phase 1. Either the data or my access seems to be removed.

@vyzo do you know more here?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Discussion continued on #173 (comment).


The problem in hole punching, apart from not working all the time, is
the need for rendezvous and synchronization. This is usually
accomplished using dedicated signaling servers [2]. However, this
introduces yet another piece of infrastructure, while still requiring
the use of relays as a fallback for the cases where a direct
connection is not possible.

In this specification, we describe a synchronization protocol for direct
connectivity with hole punching that eschews signaling servers and utilizes
existing relay connections instead. That is, peers start with a relay connection
and synchronize directly, without the use of a signaling server. If the hole
punching attempt is successful, the peers _upgrade_ their connection to a direct
connection and they can close the relay connection. If the hole punching attempt
fails, they can keep using the relay connection as they were.

## The Protocol

Consider two peers, `A` and `B`. `A` wants to connect to `B`, which is
behind a NAT and advertises relay addresses. `A` may itself be behind
a NAT or be a public node.

The protocol starts with the completion of a relay connection from `A`
to `B`. Upon observing the new connection, the inbound peer (here `B`)
raulk marked this conversation as resolved.
Show resolved Hide resolved
checks the addresses advertised by `A` via identify. If that set
includes public addresses, then `A` _may_ be reachable by a direct
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Isn't it possible that A may also be directly reachable at a private address if A and B are on the same local network?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes it is possible, but that would have been dialed directly as the private addresses are still advertised with relay addresses.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think @albrow has a point. @vyzo: while that should be the case, if we want to be resilient and robust, this protocol should not make assumptions about how any other part of the system behaves. Usually those implicit assumptions make systems brittle.

Luckily our spec lifecycle process allows us to add this topic as an active discussion:

To facilitate open progress tracking and observability, as the Working Draft
evolves, the author(s) SHOULD assemble a checklist of items that are pending
specification, explicitly stating which items are compulsory for promoting the
spec to a Candidate Recommendation.

from: https://github.com/libp2p/specs/blob/master/00-framework-01-spec-lifecycle.md

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Not making this assumption will make us dial private addresses in vain multiple times.
We already have a problem with that.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

At best, we can consider dialing them in the bidirectional part of the protocol.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also, if A is public and B is private, we can't possibly be behind the same NAT.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Furthermore, for the bidirectional part of the protocol we could check the public address of the other node. If that doesn't match our own, we can't possibly be behind the same NAT and dialing private addrs is pointless.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It would be nice to avoid dialing private addrs if we can avoid it though. Perhaps we could still exchange them, but in a separate field. Then they can be ignored unless your public address matches the other node and you infer that you're behind the same NAT. Or your implementation may be able to always ignore them, since they would have been dialed previously.

Anyway, I agree that we could punt on this for this round and discuss when we promote to candidate rec.

connection, in which case `B` attempts a unilateral connection upgrade
by initiating a direct connection to `A`.

If the unilateral connection upgrade attempt fails or if `A` is itself a NATed
peer that doesn't advertise public address, then `B` initiates the direct
connection upgrade protocol as follows:
1. `B` opens a stream to `A` using the `/libp2p/dcutr` protocol.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Note the protocol name /libp2p/dcutr.

2. `B` sends to `A` a `Connect` message containing its observed (and possibly
predicted) addresses from identify and starts a timer to measure RTT of the
relay connection.
3. Upon receving the `Connect`, `A` responds back with a `Connect` message
containing its observed (and possibly predicted) addresses.
4. Upon receiving the `Connect`, `B` sends a `Sync` message and starts a timer
for half the RTT measured from the time between sending the initial `Connect`
and receiving the response.
5. Simultaneous Connect
mxinden marked this conversation as resolved.
Show resolved Hide resolved
- Upon receiving the `Sync`, `A` immediately starts a direct dial to B using
the addresses obtained from the `Connect` message.
mxinden marked this conversation as resolved.
Show resolved Hide resolved
- Upon expiry of the timer, `B` starts a direct dial to `A` using the
addresses obtained from the `Connect` message.
mxinden marked this conversation as resolved.
Show resolved Hide resolved
6. On failure go back to step (2), reusing the same stream opened in (1).
Inbound peers (here `B`) SHOULD retry twice (thus a total of 3 attempts)
before considering the upgrade as failed.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Note the added retry logic. Also see FAQ further below for additional reasoning.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That change makes a lot of sense to me.


The purpose of the `Sync` message and `B`'s timer is to allow the two peers to
synchronize so that they perform a simultaneous open that allows hole punching
to succeed.

If the direct connection is successful, then the peers should migrate
marten-seemann marked this conversation as resolved.
Show resolved Hide resolved
to it by prioritizing over the existing relay connection. All new
streams should be opened in the direct connection, while the relay
connection should be closed after a grace period. Existing indefinite
duration streams will have to be recreated in the new connection once
the relay connection is closed.


### RPC messages

All RPC messages sent over a stream are prefixed with the message length in
bytes, encoded as an unsigned variable length integer as defined by the
[multiformats unsigned-varint spec][uvarint-spec].

raulk marked this conversation as resolved.
Show resolved Hide resolved
Implemntations SHOULD refuse encoded RPC messages (length prefix excluded)
mxinden marked this conversation as resolved.
Show resolved Hide resolved
larger than 4 KiB.

RPC messages conform to the following protobuf schema:

```proto
syntax = "proto2";

package holepunch.pb;

message HolePunch {
enum Type {
CONNECT = 100;
SYNC = 300;
}

optional Type type=1;

repeated bytes ObsAddrs = 2;
mxinden marked this conversation as resolved.
Show resolved Hide resolved
}
```

## FAQ

- *Why exchange `CONNECT` and `SYNC` messages once more on each retry?*

Doing an additional CONNECT and SYNC for each retry prevents a flawed RTT
measurement on the first attempt to distort all following retry attempts.

- *Why reuse the same stream for retries?*

Stream opening and stream protocol negotiation might distort the measured
round-trip-time. Reusing the stream from the first attempt allows cutting out
mxinden marked this conversation as resolved.
Show resolved Hide resolved
these distortions, allowing a more precise round-trip-time measurement on the
second and third attempt.

## References

1. Peer-to-Peer Communication Across Network Address Translators. B. Ford and P.
Srisuresh. https://pdos.csail.mit.edu/papers/p2pnat.pdf
2. Interactive Connectivity Establishment (ICE): A Protocol for Network Address
Translator (NAT) Traversal for Offer/Answer Protocols. IETF RFC 5245.
https://tools.ietf.org/html/rfc5245

[uvarint-spec]: https://github.com/multiformats/unsigned-varint