core, eth: split eth package, implement snap protocol #21482

karalabe · 2020-08-25T05:38:22Z

This PR is a rebase of #20800. Since the code changed quite a lot and I needed to do an ugly rebase, I wanted to have the old code exist somewhere to compare if something does't behave correctly.

WIP

renaynay · 2020-09-24T10:01:28Z

eth/handler.go

+	Database   ethdb.Database            // Database for direct sync insertions
+	Chain      *core.BlockChain          // Blockchain to serve data from
+	TxPool     txPool                    // Transaction pool to propagate from
+	Network    uint64                    // Network identifier to adfvertise


Suggested change

Network uint64 // Network identifier to adfvertise

Network uint64 // Network identifier to advertise

renaynay · 2020-09-24T10:04:06Z

eth/handler.go

-
-func (pm *ProtocolManager) runPeer(p *peer) error {
-	if !pm.chainSync.handlePeerEvent(p) {
+// runEthPeer


missing documentation

renaynay · 2020-09-24T10:04:54Z

eth/handler.go

-	// Read the next message from the remote peer, and ensure it's fully consumed
-	msg, err := p.rw.ReadMsg()
-	if err != nil {
+// runSnapPeer


missing documentation here too

renaynay · 2020-09-24T10:08:49Z

eth/handler.go

+	if eth != nil {
+		eth.Peer.Disconnect(p2p.DiscUselessPeer)
+	}
+	if snap != nil {


is there a reason for doing the additional check for snap != nil down here too?

can you disconnect useless peer after unregistering the peer after L346?

eth/peerset.go

holiman · 2020-10-05T19:08:31Z

eth/downloader/downloader.go

-func (d *Downloader) RegisterPeer(id string, version int, peer Peer) error {
-	logger := log.New("peer", id)
+func (d *Downloader) RegisterPeer(id string, version uint, peer Peer) error {
+	logger := log.New("peer", id[:16])


geth copydb causes this to panic:

logger := log.New("peer", id[:16]) logger.Trace("Registering sync peer") if err := d.peers.Register(newPeerConnection(id, version, peer, logger)); err != nil ```

holiman

A few comments

eth/protocols/snap/handler.go

holiman

Some potential DoS vectors (?)

eth/protocols/snap/handler.go

holiman · 2020-11-05T20:57:00Z

trie/proof.go

 	for i := 0; i < len(keys)-1; i++ {
 		if bytes.Compare(keys[i], keys[i+1]) >= 0 {
-			return errors.New("range is not monotonically increasing"), false
+			return nil, nil, false, errors.New("range is not monotonically increasing")
 		}
 	}


Would't it make sense to have this check include firstKey and lastKey aswell? Something like this:

var pKey = firstKey for i := 0; i < len(keys)-1; i++ { if bytes.Compare(pKey, keys[i]) >= 0 { return nil, nil, false, errors.New("range is not monotonically increasing") } pKey = keys[i] } if bytes.Compare(pKey, keys[len(keys)-1]) >= 0 { return nil, nil, false, errors.New("range is not monotonically increasing") }

Or rather

// Ensure the received batch is monotonic increasing. if firstKey != nil && len(keys) > 0 { if bytes.Compare(firstKey, keys[0]) > 0 { return nil, nil, false, errors.New("range is not monotonically increasing [0]") } } for i := 0; i < len(keys)-1; i++ { if bytes.Compare(keys[i], keys[i+1]) >= 0 { return nil, nil, false, errors.New("range is not monotonically increasing [1]") } } if lastKey != nil && len(keys) > 0 { if bytes.Compare(keys[len(keys)-1], lastKey) > 0 { return nil, nil, false, errors.New("range is not monotonically increasing [2]") } }

For the snap implementation, this doesn't matter, since AFAICT the snap callers always only send in the origin and the last key as firstKey and lastKey.

eth/protocols/snap/sync.go

holiman · 2020-11-19T10:50:18Z

eth/downloader/statesync.go

+	defer func() {
+		// Cancel active request timers on exit. Also set peers to idle so they're
+		// available for the next sync.
+		for _, req := range active {
+			req.timer.Stop()
+			req.peer.SetNodeDataIdle(int(req.nItems), time.Now())
+		}
+	}()


Hm, odd that we don't have this on master, if it's something we should have ?

karalabe · 2020-11-19T10:54:02Z

Random memo: We can eventually make the protocol types (messages, numbers) public so they can be used by external testre frameworks without redefining them localy.

holiman · 2020-12-12T09:01:05Z

On a branch of mine, snapshot-sync-c, I rebased this on current master, and did two regular fast-syncs, for some sanity checking. Both completed in ~7-8 hours: https://geth-bench.ethdevops.io/d/Jpk-Be5Wk/dual-geth?orgId=1&from=1607712736289&to=1607756400000&var-exp=bench03&var-master=bench04&var-percentile=50 . All good!

holiman · 2020-12-12T09:12:20Z

On that same this-pr-on-master, I put bench03 and 04 to do snap-sync instead (ansible-playbook configure_bench03-04.yaml -t wipe,prep,bench -e "img_a=holiman/geth-experimental:latest" -e "img_b=holiman/geth-experimental:latest" -e '{"geth_args_custom":["--snapshot"]}' -e "syncmode=snap") . Bench03 seem to have picked up a peer to sync with (using dns discovery only), without even having been added as a priority client at the bootnode(s).

Note though, they're not running with karalabe#42, so they might stall eventually.

holiman · 2020-12-12T16:07:06Z

bench03 got synced in about 6.5 hours: https://geth-bench.ethdevops.io/d/Jpk-Be5Wk/dual-geth?orgId=1&from=1607762821723&to=1607788883492&var-exp=bench03&var-master=bench04&var-percentile=50

It seems that bench03 had only one snap-capable peer

enode://f6b362a821ba4898b52fffc7a365a494791a3090b3c7ca855f0f292be97be0c7417d0d48d8ccc520c464802a25a5cb34891f64696171f7755b2a4cb348b52027@52.56.142.13:30303

holiman · 2020-12-12T16:19:31Z

Hm, it's this one: https://github.com/ethereum/discv4-dns-lists/blob/master/snap.mainnet.ethdisco.net/nodes.json#L18 . So apparently there are 6 snap-1 nodes on mainnet - that particular one was bench01.

holiman · 2020-12-13T08:01:18Z

Later in the evening, at about 20:30, bench04 also found a snap peer, and finished up the sync in~3 hours. https://geth-bench.ethdevops.io/d/Jpk-Be5Wk/dual-geth?orgId=1&from=1607764411539&to=1607828594396&var-exp=bench03&var-master=bench04&var-percentile=50

karalabe · 2021-01-05T10:43:33Z

cmd/utils/flags.go

+		} else {
+			cfg.TrieCleanCache += cfg.SnapshotCache
+			cfg.SnapshotCache = 0 // Disabled
+		}


if !ctx.GlobalIsSet(SnapshotFlag.Name) && cfg.SyncMode != downloader.SnapSync { cfg.TrieCleanCache += cfg.SnapshotCache cfg.SnapshotCache = 0 // Disabled }

karalabe · 2021-01-05T10:45:27Z

cmd/utils/flags.go

+		// If snap-sync is requested, this flag is also required
+		if cfg.SyncMode == downloader.SnapSync {
+			log.Info("Snap sync requested, enabling --snapshot")
+			ctx.Set(SnapshotFlag.Name, "true")


This line is a noop.

karalabe · 2021-01-05T10:46:29Z

eth/downloader/downloader.go

@@ -460,7 +460,7 @@ func (d *Downloader) syncWithPeer(p *peerConnection, hash common.Hash, td *big.I
 		}
 	}()
 	if p.version < 64 {
-		return errTooOld
+		return fmt.Errorf("%w, peer version: %d", errTooOld, p.version)


hadv · 2021-11-10T10:54:05Z

@karalabe Would you explain a bit more details why we need the heal phase for the snap protocol? Thank you so much!

karalabe · 2021-11-10T13:43:16Z

@hadv Geth maintains 1 persistent snapshot layer on disk and 128 diff layers in memory. This means, that if you start syncing the head snapshot, after 128 block (~30 minutes), that snapshot will not be available any more in the network. Geth then switches to continuing sync on a new snapshot, but that won't match the old one. We do this until we sync everything, but by that time, we have pieces of many different snapshots (count depending on how fast you can sync up). The heal phase just "glues" these snapshot layers together, plus integrates any changes that have occurred since you've downloaded a particular snapshot segment.

karalabe requested review from gballet, holiman, rjl493456442 and zsfelfoldi as code owners August 25, 2020 05:38

karalabe mentioned this pull request Aug 25, 2020

WIP: core, eth: initial exposure of the snap/1 protocol #20800

Closed

karalabe force-pushed the snapshot-sync-b branch from 596f17b to c610a46 Compare September 4, 2020 08:38

karalabe force-pushed the snapshot-sync-b branch 2 times, most recently from 49151a8 to 1e2609f Compare September 18, 2020 12:59

karalabe requested review from fjl and renaynay as code owners September 18, 2020 12:59

karalabe force-pushed the snapshot-sync-b branch from 1e2609f to cdb49b1 Compare September 23, 2020 19:36

renaynay reviewed Sep 24, 2020

View reviewed changes

eth/peerset.go Outdated Show resolved Hide resolved

karalabe force-pushed the snapshot-sync-b branch from 446ca23 to fc1ebdb Compare October 5, 2020 09:15

holiman reviewed Oct 5, 2020

View reviewed changes

karalabe force-pushed the snapshot-sync-b branch 2 times, most recently from dcdc3de to 0bf5df5 Compare October 9, 2020 09:25

adamschmideg added the status:triage label Oct 19, 2020

karalabe force-pushed the snapshot-sync-b branch from 0bf5df5 to 6451a84 Compare November 4, 2020 14:30

holiman reviewed Nov 5, 2020

View reviewed changes

eth/protocols/snap/handler.go Outdated Show resolved Hide resolved

eth/protocols/snap/handler.go Outdated Show resolved Hide resolved

eth/protocols/snap/handler.go Show resolved Hide resolved

eth/protocols/snap/handler.go Outdated Show resolved Hide resolved

holiman reviewed Nov 5, 2020

View reviewed changes

eth/protocols/snap/handler.go Outdated Show resolved Hide resolved

eth/protocols/snap/handler.go Outdated Show resolved Hide resolved

eth/protocols/snap/handler.go Outdated Show resolved Hide resolved

holiman reviewed Nov 5, 2020

View reviewed changes

holiman reviewed Nov 6, 2020

View reviewed changes

eth/protocols/snap/sync.go Show resolved Hide resolved

holiman reviewed Nov 6, 2020

View reviewed changes

eth/protocols/snap/sync.go Show resolved Hide resolved

karalabe removed the status:triage label Nov 10, 2020

holiman reviewed Nov 19, 2020

View reviewed changes

holiman mentioned this pull request Dec 9, 2020

les: use eth64 instead of 63 #21980

Closed

holiman approved these changes Dec 9, 2020

View reviewed changes

eth: fix typo

f68c4fe

This was referenced Dec 11, 2020

eth/protocols/eth: remove magic numbers in tests #21999

Merged

eth, core: speed up some tests #22000

Merged

Merge branch 'master' into snapshot-sync-b

700f41f

holiman merged commit 017831d into ethereum:master Dec 14, 2020

holiman added this to the 1.10.0 milestone Dec 14, 2020

karalabe commented Jan 5, 2021

View reviewed changes

quorumbot mentioned this pull request Sep 3, 2021

[Upgrade] Go-Ethereum release v1.10.0 Consensys/quorum#1249

Merged

9 tasks

baptiste-b-pegasys mentioned this pull request Sep 3, 2021

[Upgrade] Go-Ethereum release v1.10.0 baptiste-b-pegasys/quorum#16

Closed

9 tasks

This was referenced Sep 23, 2022

Metadium to master METADIUM/go-metadium#24

Closed

Metadium to master METADIUM/go-metadium#25

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

core, eth: split eth package, implement snap protocol #21482

core, eth: split eth package, implement snap protocol #21482

karalabe commented Aug 25, 2020

renaynay Sep 24, 2020

renaynay Sep 24, 2020

renaynay Sep 24, 2020

renaynay Sep 24, 2020

holiman Oct 5, 2020

holiman left a comment

holiman left a comment

holiman Nov 5, 2020

holiman Nov 5, 2020

holiman Nov 6, 2020 •

edited

Loading

holiman Nov 19, 2020

karalabe commented Nov 19, 2020

holiman commented Dec 12, 2020

holiman commented Dec 12, 2020

holiman commented Dec 12, 2020 •

edited

Loading

holiman commented Dec 12, 2020

holiman commented Dec 13, 2020

karalabe Jan 5, 2021

karalabe Jan 5, 2021 •

edited

Loading

karalabe Jan 5, 2021

hadv commented Nov 10, 2021

karalabe commented Nov 10, 2021

	Network uint64 // Network identifier to adfvertise
	Network uint64 // Network identifier to advertise

core, eth: split eth package, implement snap protocol #21482

core, eth: split eth package, implement snap protocol #21482

Conversation

karalabe commented Aug 25, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

holiman left a comment

Choose a reason for hiding this comment

holiman left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

holiman Nov 6, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

karalabe commented Nov 19, 2020

holiman commented Dec 12, 2020

holiman commented Dec 12, 2020

holiman commented Dec 12, 2020 • edited Loading

holiman commented Dec 12, 2020

holiman commented Dec 13, 2020

Choose a reason for hiding this comment

karalabe Jan 5, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hadv commented Nov 10, 2021

karalabe commented Nov 10, 2021

holiman Nov 6, 2020 •

edited

Loading

holiman commented Dec 12, 2020 •

edited

Loading

karalabe Jan 5, 2021 •

edited

Loading