core: remove unnecessary fields in log #17106

rjl493456442 · 2018-06-30T08:43:29Z

This PR drop some unnecessary fields in Recepit, TxLookup and TransactionLog structs to save database storage.

karalabe · 2018-07-02T08:11:06Z

The interesting question we need to figure out is how to do the upgrade path. The code as is in theory works, but is neither forward nor backward compatible.

The way we did seamless upgrades until now was to support both formats (for a few geth releases) and run a background thread that makes the conversion. I think it would be important to at least implement support for the combo-format. The database upgrade is not that important since pruning might require a resync anyway, but its still important for auto-updating nodes to remain operational.

So, what would be essential for this PR is to expand the rawdb.ReadReceipts method, so it tries to decode the receipt in the new format... but if it fails, it will try to decode the old format too before erroring out. This would ensure that a node which starts running this PR on top of an old database will remain operational.

Downgrade of course is not possible, so this PR needs a major version bump, but seamless upgrade (or at least continuous operation) is essential.

core/types/log.go

AlexeyAkhunov · 2018-07-02T14:28:24Z

Is there a corresponding PR for receipts? Because you can remove some fields there too, including bloom filters. Here is what I have done in Turbo-geth: AlexeyAkhunov@017a9e8

rjl493456442 · 2018-07-04T16:57:03Z

@AlexeyAkhunov Thank you for the reference. Will add the optimization to my PR!

eth/backend.go

holiman · 2019-01-09T13:09:12Z

This is problematic..
So with the db versioning,

If I start this PR with 4 on top of old db, 0, it will not show me a log that it does indeed update the db, but it will write an rlp 4. Subsequent tries will discover the same id.
If I then go back and use an older version, it will for some reason read that 4 as a 0, and just try to overwrite it again with a 3.
If I then start this PR again, it will read that attempted 3 as a 0, and again just silently overwrite it.

So the db versioning does not work, at all :(

holiman

I'd recommend to include the db version in the log output, even if skipDbVersionCheck is used -- having it in the logs will help us debug potential errors if people switch back and forth between the broken one which overwrites the version number with nil.

I suggest the following modification:

diff --git a/eth/backend.go b/eth/backend.go
index 2a9d56c5c..0b3625c41 100644
--- a/eth/backend.go
+++ b/eth/backend.go
@@ -139,16 +139,20 @@ func New(ctx *node.ServiceContext, config *Config) (*Ethereum, error) {
 		bloomIndexer:   NewBloomIndexer(chainDb, params.BloomBitsBlocks, params.BloomConfirms),
 	}
 
-	log.Info("Initialising Ethereum protocol", "versions", ProtocolVersions, "network", config.NetworkId)
+	bcVersion := rawdb.ReadDatabaseVersion(chainDb)
+	var dbVer = "<nil>"
+	if bcVersion != nil {
+		dbVer = fmt.Sprintf("%d", *bcVersion)
+	}
+	log.Info("Initialising Ethereum protocol", "versions", ProtocolVersions, "network", config.NetworkId, "db version", dbVer)
 
 	if !config.SkipBcVersionCheck {
-		bcVersion := rawdb.ReadDatabaseVersion(chainDb)
 		if bcVersion != nil && *bcVersion > core.BlockChainVersion {
 			return nil, fmt.Errorf("database version is v%d, Geth %s only supports v%d", *bcVersion, params.VersionWithMeta, core.BlockChainVersion)
-		} else if bcVersion != nil && *bcVersion < core.BlockChainVersion {
-			log.Warn("Upgrade blockchain database version", "from", *bcVersion, "to", core.BlockChainVersion)
+		}else if bcVersion == nil || *bcVersion < core.BlockChainVersion{
+			log.Warn("Upgrading blockchain database version", "from", dbVer, "to", core.BlockChainVersion)
+			rawdb.WriteDatabaseVersion(chainDb, core.BlockChainVersion)
 		}
-		rawdb.WriteDatabaseVersion(chainDb, core.BlockChainVersion)
 	}
 	var (
 		vmConfig = vm.Config{

Db versioning now seems to work fine, I've tested with nil, lower, same and higher version numbers.

holiman · 2019-01-24T07:17:51Z

core/rawdb/accessors_indexes.go

 		return nil, common.Hash{}, 0, 0
 	}
-	return receipts[receiptIndex], blockHash, blockNumber, receiptIndex
+	receipts := ReadReceipts(db, blockHash, *blockNumber)


The ReadReceipts method iterates over the data and returns an []receipts, and this method iterates over those again to pick out the one we're interested in. Would it be worthwhile to instead have a readReceipt method that only returns the one we're interested it, or is that just an unnecessary optimisation?

Since normally a block only contains 200 transactions, so I think it won't hit too much performance if we iterate receipt slice twice.
We can have a similar implementation as ReadRecepits(read blob from db, decode, assemble log), but kind of redundant.

holiman

LGTM

core/rawdb/schema.go

core/rawdb/accessors_indexes.go

core/types/receipt.go

karalabe

LGTM

Matthalp-zz · 2019-03-03T03:41:02Z

@karalabe @holiman Was there a reason not to remove the txHash field on storedReceipts (32 bytes) instead of the index on (Legacy)TxLookupEntry (8 bytes)?

rjl493456442 · 2019-03-03T11:51:49Z

@matthalp My original idea for keeping txhash in receipt is: If we delete txhash, we need to read blockBody to assemble txhash field during retrieve receipt. But I think blockBody and Receipts are not necessarily bound. For example, in some scenarios, receive may be stored, but blockBody is not stored. It is kind of weird that receipts content should rely on the blockBody to assemble.

For the txLookupEntry, BlockBody and Transaction, these three stuff is truly bound. It doesn't make any sense if we only store the txLookupEntry while relative BlockBody is missing.

But these all are my own understanding.

Matthalp-zz · 2019-03-03T15:05:01Z

For example, in some scenarios, receive may be stored, but blockBody is not stored.

@rjl493456442 Could you point me to an example? I don't believe we store receipts unless we have assume we will have its corresponding body (even in light). My mental model is that if we have the block body, we should have the block header and if we have the block receipts we should have the block body. The logic in blockchain.go also makes similar assumptions at times.

It is kind of weird that receipts content should rely on the blockBody to assemble.

From what I can tell, almost all of the use cases were txhash is frequently used are in close proximity to a block body where that information could be recovered. While there could be some overhead of doing an independent lookup, this can be resolves with a good caching layer as the work in #19200 is working towards.

…ereum#17106) * core: remove unnecessary fields in log * core: bump blockchain database version * core, les: remove unnecessary fields in txlookup * eth: print db version explicitly * core/rawdb: drop txlookup entry struct wrapper

The encoding of Log and LogForStorage is exactly the same now. After tracking it down it seems like #17106 changed the storage schema of logs to be the same as the consensus encoding. Support for the legacy format was dropped in #22852 and if I'm not wrong there's no reason anymore to have these two equivalent types. Since the RLP encoding simply contains the first three fields of Log, we can also avoid creating a temporary struct for encoding/decoding, and use the rlp:"-" tag in Log instead. Note: this is an API change in core/types. We decided it's OK to make this change because LogForStorage is an implementation detail of go-ethereum and the type has zero uses outside of package core/types. Co-authored-by: Felix Lange <fjl@twurst.com>

The encoding of Log and LogForStorage is exactly the same now. After tracking it down it seems like ethereum#17106 changed the storage schema of logs to be the same as the consensus encoding. Support for the legacy format was dropped in ethereum#22852 and if I'm not wrong there's no reason anymore to have these two equivalent types. Since the RLP encoding simply contains the first three fields of Log, we can also avoid creating a temporary struct for encoding/decoding, and use the rlp:"-" tag in Log instead. Note: this is an API change in core/types. We decided it's OK to make this change because LogForStorage is an implementation detail of go-ethereum and the type has zero uses outside of package core/types. Co-authored-by: Felix Lange <fjl@twurst.com>

rjl493456442 requested review from holiman and karalabe as code owners June 30, 2018 08:43

rjl493456442 changed the title ~~core: remove unnecessary fields in log~~ WIP core: remove unnecessary fields in log Jun 30, 2018

rjl493456442 force-pushed the downsize_log branch 2 times, most recently from 19c7446 to 87f5938 Compare July 2, 2018 07:34

karalabe added this to the 1.9.0 milestone Jul 2, 2018

karalabe reviewed Jul 2, 2018

View reviewed changes

core/types/log.go Outdated Show resolved Hide resolved

rjl493456442 force-pushed the downsize_log branch 4 times, most recently from aa441f0 to bfad48f Compare July 6, 2018 12:30

rjl493456442 force-pushed the downsize_log branch from bfad48f to ae0e3c2 Compare December 12, 2018 06:02

holiman reviewed Jan 9, 2019

View reviewed changes

eth/backend.go Outdated Show resolved Hide resolved

rjl493456442 added 2 commits January 22, 2019 15:57

core: remove unnecessary fields in log

1ba971e

core: bump blockchain database version

b41c584

rjl493456442 force-pushed the downsize_log branch from 1f8520c to b41c584 Compare January 22, 2019 08:01

rjl493456442 changed the title ~~WIP core: remove unnecessary fields in log~~ core: remove unnecessary fields in log Jan 22, 2019

rjl493456442 mentioned this pull request Jan 22, 2019

types, rawdb, core, miner: add block location fields to receipt #17662

Merged

rjl493456442 requested a review from zsfelfoldi as a code owner January 23, 2019 06:47

core, les: remove unnecessary fields in txlookup

49c1f9c

rjl493456442 force-pushed the downsize_log branch from 64b4f7c to 49c1f9c Compare January 24, 2019 04:30

holiman requested changes Jan 24, 2019

View reviewed changes

eth: print db version explicitly

3d5019b

holiman approved these changes Jan 24, 2019

View reviewed changes

karalabe reviewed Feb 21, 2019

View reviewed changes

core/rawdb/schema.go Outdated Show resolved Hide resolved

karalabe reviewed Feb 21, 2019

View reviewed changes

core/rawdb/accessors_indexes.go Outdated Show resolved Hide resolved

karalabe reviewed Feb 21, 2019

View reviewed changes

core/types/receipt.go Show resolved Hide resolved

core/rawdb: drop txlookup entry struct wrapper

e8b4262

karalabe approved these changes Feb 21, 2019

View reviewed changes

karalabe merged commit 7fd0cca into ethereum:master Feb 21, 2019

karalabe mentioned this pull request Feb 28, 2019

core/types: fix receipt legacy decoding #19182

Merged

Matthalp-zz mentioned this pull request Apr 11, 2019

core, eth, les, light: avoid storing computable receipt metadata #19345

Merged

s1na mentioned this pull request Jul 7, 2021

core/types: remove LogForStorage type #23173

Merged

gzliudan mentioned this pull request Aug 19, 2024

reduce the receipt RLP size in chain database XinFinOrg/XDPoSChain#570

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

core: remove unnecessary fields in log #17106

core: remove unnecessary fields in log #17106

rjl493456442 commented Jun 30, 2018 •

edited

Loading

karalabe commented Jul 2, 2018 •

edited

Loading

AlexeyAkhunov commented Jul 2, 2018

rjl493456442 commented Jul 4, 2018

holiman commented Jan 9, 2019

holiman left a comment

holiman Jan 24, 2019

rjl493456442 Jan 24, 2019

holiman left a comment

karalabe left a comment

Matthalp-zz commented Mar 3, 2019

rjl493456442 commented Mar 3, 2019

Matthalp-zz commented Mar 3, 2019

core: remove unnecessary fields in log #17106

core: remove unnecessary fields in log #17106

Conversation

rjl493456442 commented Jun 30, 2018 • edited Loading

karalabe commented Jul 2, 2018 • edited Loading

AlexeyAkhunov commented Jul 2, 2018

rjl493456442 commented Jul 4, 2018

holiman commented Jan 9, 2019

holiman left a comment

Choose a reason for hiding this comment

holiman Jan 24, 2019

Choose a reason for hiding this comment

rjl493456442 Jan 24, 2019

Choose a reason for hiding this comment

holiman left a comment

Choose a reason for hiding this comment

karalabe left a comment

Choose a reason for hiding this comment

Matthalp-zz commented Mar 3, 2019

rjl493456442 commented Mar 3, 2019

Matthalp-zz commented Mar 3, 2019

rjl493456442 commented Jun 30, 2018 •

edited

Loading

karalabe commented Jul 2, 2018 •

edited

Loading