UTxO-HD targeting `main` #1267

jasagredo · 2024-09-26T09:45:04Z

Description

The changes from UTxO-HD span over ouroboros-consensus, ouroboros-consensus-diffusion and ouroboros-consensus-cardano. The core change is:

The UTxO set is extracted from the LedgerState in the form of LedgerTables.
These tables are stored in the LedgerDB, which can keep them in memory or on disk.
When performing an action that requires UTxOs, we have to ask the LedgerDB for those. This might perform IO.

Here I will explain how I would review this enormous PR. Instead of listing files I will describe concepts, and my suggestion is to go look at the mentioned files (or search for the concepts) then mark the file as viewed to offload it from the brain.

The ledger tables

The first step would be to understand the concept of LedgerTables, see Ouroboros.Consensus.Ledger.Tables.* modules. The LedgerTables are parametrized by l (in the end it will be by blk) and by mk (or MapKinds). MapKinds are just types parametrized by the Key and Value of l. These will be TxIn|TxOut for unitary blocks and CanonicalTxIn|HardForkTxOut for hard fork blocks.
LedgerTables are barbies-like, see Ouroboros.Consensus.Ledger.Tables.Combinators.
LedgerTables are (most commonly) empty (EmptyMK), a (possibly restricted) UTxO set (ValuesMK), a set of TxIns (KeysMK), a sequence of differences (DiffMK) or a combination of values + diffs (TrackingMK). The only non-obvious one is DiffMK which is a map of sequences of changes to a value (in the UTxO case values don't change, they are created and destroyed, so there will be at most 2 elements there). On top of that there is a DiffSeqMK which is a fingertree of differences. Only used in V1 (see below).
The LedgerState is itself parametrized by this same mk. The data instances will then make use of that mk to define tables associated with the block. So the byron ledger state ignores it, the shelley ledger state has a new field with the tables and the hard fork ledger state will propagate the mk through the telescope, therefore having an mk of the particular state in the Telescope.
The LedgerTables can live on their own, which for unitary blocks don't make a difference, but for the Cardano Block, we go from an mk passed to the Telescope (therefore tables at the tip of the Telescope) to CardanoLedgerTables, in which each value is a HardForkTxOut. This cost is non-trivial and we only want to pay it when applying a new block/transaction.
LedgerTables can be extracted and injected into the ledger state via (un)stowLedgerTables.
The ledger tables of the Extended ledger state are the same as the ones form the LedgerState.
A very important bit that maybe was not clear above is that the HardForkBlock has no canonical tables because our definitions are not compositional for the HF block, only the CardanoBlock has "hard fork tables". See the constraints of HasHardForkLedgerTables.

Applying and ticking (Ouroboros.Consensus.Ledger.Abstract/Basics)

When ticking a block, some differences might be created, and no values are needed. So the types go from l EmptyMK to Ticked1 l DiffMK. This is the case at least in two moments: when going from Byron to Shelley (all values are created here) and when going from Shelley to Allegra (avvm addresses are deleted). See the relevant functions: translateLedgerStateByronToShelley and translateLedgerStateShelleyToAllegra.

When applying a block, we get the inputs needed (getBlockKeySets then read those from the LedgerDB), tick the ledger state without tables (possibly creating diffs), apply those diffs on the values from the LedgerDB, then call the ledger rules. We then diff the input and output tables to get a set of differences from applying a block, to which we will prepend the ones from ticking. See applyBlockResult and the Shelley functions for applying blocks.

The story with transactions is pretty similar.

The LedgerDB versions (Ouroboros.Consensus.Storage.LedgerDB)

There are two flavors of the LedgerDB, each one having two implementations:

V1 (Ouroboros.Consensus.Storage.LedgerDB.V1): we keep a sequence of EmptyMK ledger states and dump the values into a BackingStore. We can get back values from the backing store at any ledger state, by opening a BackingStoreValueHandle and reading from it. The BackingStore consists of a "complete" UTxO set at some anchor and then a sequence of differences. To get values at a given point we have to read the anchor, then reapply the differences up to the desired point. This is "wasteful" if done in memory (why keep diffs and have to reapply them every time if we can just apply them in place?) but it is useful on the on-disk implementation which puts the "complete" UTxO set on the disk, offloading it from memory. There are two implementations:
- OnDisk: It uses LMDB underneath. See the Ouroboros.Consensus.Storage.LedgerDB.V1.BackingStore.Impl.LMDB.* modules.
- InMemory: Not intended for real use. As mentioned above it is wasteful. It serves as a reference impl for the OnDisk implementation.
V2 (Ouroboros.Consensus.Storage.LedgerDB.V2): We keep a sequence of StateRefs, which are EmptyMK ledger states together with a tables handle from which we can read values monadically. This is very similar to the previous LedgerDB, in which we kept a sequence of (complete) LedgerStates. There are two implementations:
- InMemory
- LSM: still a WIP

Evaluating forks

In order to evaluate forks, we created the concept of Forkers, where each LedgerDB implementation has their own concept. They are just an abstract interface that allows to query for values and push differences that eventually can be dumped back into the LedgerDB (only by ChainSelection, others use ReadOnlyForkers). Note that they allocate resources so there is some juggling with ResourceRegistries there.

Ledger queries (Ouroboros.Consensus.Ledger.Query)

Some queries will have to look at the UTxO set, in particular GetUtxoByAddress, GetUtxoWhole and GetUtxoByTxin. We categorize them by the means of QueryFootprint. We will process each one of them differently.

Other queries use QFNoTables, GetUtxoByTxIn uses QFLookupTables and will have to read a single value from the tables, and GetUtxoWhole and GetUtxoByAddress use QFTraverseTables as they will have to scan the whole UTxO set.

For the HardForkBlock there is another class Ouroboros.Consensus.HardFork.Combinator.Ledger.Query.BlockSupportsHFLedgerQuery which has faster implementations than projecting the tables into the particular tip of the Telescope, because we can usually judge whether we want the result without upgrading the TxOut to the latest era.

In essence, queries are now monadic. Queries that don't look at the UTxO set are artificially monadic (just a pure of the already existing logic).

The mempool

The mempool in essence will have to acquire (read only) forkers on the LedgerDB at the tip, then read values for the incoming transactions and apply them. The returned diffs are appended to the ones in the mempool, which keeps a TrackingMK with the current values and past diffs.

When revalidating transactions we cannot know if the UTxO set changed so we will have to re-read the values from the (new) forker.

The internal state is now a TMVar because we need to acquire >> read tables >> update where read tables is in IO and the others are in STM.

The snapshots

We now store snapshots in a new format:

V1-OnDisk: a copy of the lmdb database and a (Haskell-CBOR) serialization of the LedgerState.
V*-InMemory: a (Haskell-CBOR) serialization of the UTxO set and a (Haskell-CBOR) serialization of the LedgerState.

Note that for V2 we can take snapshots at any time of the immutable tip, but for v1 we have to take flush some differences from the BackingStore into the anchor to advance it to the immutable tip.

This is abstracted by either implementation in Ouroboros.Consensus.Storage.LedgerDB.V*...tryTakeSnapshot

The forging loop

The forging loop didn't change much. Each iteration runs with a resource registry (to allocate the forkers). Then we use the forker to provide values for the mempool snapshot acquisition, in case of a revalidation.

Changes in Byron/Shelley/Cardano

The changes here are mostly fulfilling everything that was described above, to make all the types match. There are some specific things which are interesting to look at because they might be non-trivial:

Translation functions (with the two examples I already mentioned)
The TxIn|TxOut data instances, the LedgerState data instance and the HasLedgerTables instances
applyBlock for shelley. The cardano one is just the HFC one, which injects the CardanoTables into the tip of the Telescope (here is where we do the costly step, but it usually won't be that costly because the UTxO set for a block is small).
The Cardano.Ledger module which defines the CardanoTxIn and CardanoTxOut.

Other changes

The rest of the changes are mainly just following GHC adjusting the types here and there. Most other code doesn't use tables so an abstract mk or EmptyMK is used to make the kind well-formed.

jasagredo

I did a pass over the non-testing libraries.

ouroboros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/Mempool/Query.hs

...boros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/Storage/ChainDB/Impl/ChainSel.hs

ouroboros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/Storage/LedgerDB/API.hs

...os-consensus/src/ouroboros-consensus/Ouroboros/Consensus/Storage/LedgerDB/V1/BackingStore.hs

...onsensus/src/ouroboros-consensus/Ouroboros/Consensus/Storage/LedgerDB/V1/BackingStore/API.hs

...ros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/Storage/LedgerDB/V1/DbChangelog.hs

...oros-consensus-diffusion/src/ouroboros-consensus-diffusion/Ouroboros/Consensus/NodeKernel.hs

ouroboros-consensus-cardano/src/shelley/Ouroboros/Consensus/Shelley/ShelleyHFC.hs

nfrisby

This is the result of my first pass on the Ouroboros.Consensus.Ledger.Tables.* modules.

ouroboros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/Ledger/Tables.hs

ouroboros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/Ledger/Tables/Utils.hs

ouroboros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/Ledger/Tables/MapKind.hs

nfrisby

Another round of comments. This is all of the *Hard* files, except for Query.hs.

...consensus-cardano/src/ouroboros-consensus-cardano/Ouroboros/Consensus/Cardano/CanHardFork.hs

nfrisby · 2024-10-29T17:14:29Z

...consensus-cardano/src/ouroboros-consensus-cardano/Ouroboros/Consensus/Cardano/CanHardFork.hs

@@ -753,9 +899,14 @@ translateLedgerStateBabbageToConwayWrapper =
            -- we monkey-patch the governance state by ticking across the
            -- era/epoch boundary using Babbage logic, and set the governance
            -- state to the updated one /before/ translating.
+            --
+            -- NOTE we are ignoring the differences created by the ticking that


Maybe write "ignoring the DiffMK" instead of "ignoring the differences" to avoid alarm/confusion.

This code is luckily gone in #1297, so rebasing will resolve this

Rewrote it as Nick suggests, will deal with it when I do a rebase.

I'm leaving this thread open to see it again in the future.

...-consensus-cardano/src/unstable-cardano-testlib/Test/ThreadNet/Infra/ShelleyBasedHardFork.hs

ouroboros-consensus-diffusion/test/consensus-test/Test/Consensus/HardFork/Combinator.hs

ouroboros-consensus-diffusion/test/consensus-test/Test/Consensus/HardFork/Combinator/A.hs

...boros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/HardFork/Combinator/InjectTxs.hs

nfrisby · 2024-10-29T19:29:21Z

...boros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/HardFork/Combinator/InjectTxs.hs

+type TelescopeWithTxList g f tx xs' xs  =
+   Telescope g (Product (ListOfTxs tx xs') f) xs
+
+matchPolyTxs' ::


This implementation allocates a list of txs for each previous era.

I wonder if an approach similar to composeTxOutTranslations $ ipTranslateTxOut hardForkEraTranslation from Combinator/Ledger.hs might make the logic easier to follow and also avoid allocating extraneous lists.

Perhaps lets chat about this on a call, I'm not sure what you mean here.

...ros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/HardFork/Combinator/State/Types.hs

nfrisby · 2024-10-29T20:12:09Z

...ros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/HardFork/Combinator/State/Types.hs

+-- translated to newer eras. This function fills that hole and allows us to
+-- promote tables from one era into tables from the next era.
+--
+-- TODO(jdral): this is not optimal. If either 'translateTxInWith' or


reifying a TODO about skipping traversals if the key/value translations are id

Might also consider combining the two traversals into a single traversal. And maybe also something about monotonicity.

ouroboros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/Ledger/Tables.hs

nfrisby

All *Query*hs files, except:

ouroboros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/Storage/ChainDB/Impl/Query.hs
ouroboros-consensus/test/consensus-test/Test/Consensus/MiniProtocol/LocalStateQuery/Server.hs
ouroboros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/Mempool/Query.hs

...ros-consensus-cardano/src/ouroboros-consensus-cardano/Ouroboros/Consensus/Cardano/QueryHF.hs

ouroboros-consensus-cardano/src/shelley/Ouroboros/Consensus/Shelley/Ledger/Query.hs

nfrisby · 2024-11-05T00:45:18Z

...os-consensus/src/ouroboros-consensus/Ouroboros/Consensus/HardFork/Combinator/Ledger/Query.hs

+    where
+      lcfg = configLedger cfg
+
+  answerBlockQueryTraverse


Some suspicious duplication with answerBlockQueryLookup right above.

Yes, these are pretty similar but the footprint is concretized on each. I tried to unify them with extremely cryptic GHC errors. I will leave this open to revisit but maybe we have to live with this.

answerMonadicQueryVia :: forall m xs footprint result. ( MonadSTM m , All SingleEraBlock xs , HardForkHasLedgerTables xs , BlockSupportsHFLedgerQuery xs , CanHardFork xs ) => ( forall result'. NP ExtLedgerCfg xs -> QueryIfCurrent xs footprint result' -> ReadOnlyForker' m (HardForkBlock xs) -> m result' ) -> ExtLedgerCfg (HardForkBlock xs) -> BlockQuery (HardForkBlock xs) footprint result -> ReadOnlyForker' m (HardForkBlock xs) -> m result answerMonadicQueryVia answerVia (ExtLedgerCfg cfg) qry forker = do st@(HardForkLedgerState hardForkState) <- ledgerState <$> atomically (roforkerGetLedgerState forker) let ei = State.epochInfoLedger lcfg hardForkState cfgs = hmap ExtLedgerCfg $ distribTopLevelConfig ei cfg case qry of QueryIfCurrent (queryIfCurrent :: QueryIfCurrent xs footprint result') -> answerVia cfgs queryIfCurrent forker -- We only call this with QFLookupTables or QFTraverseTables, so these -- two matches below are effectively dead. QueryAnytime queryAnytime (EraIndex era) -> pure $ interpretQueryAnytime lcfg queryAnytime (EraIndex era) hardForkState QueryHardFork queryHardFork -> pure $ interpretQueryHardFork lcfg queryHardFork st where lcfg = configLedger cfg

src\ouroboros-consensus\Ouroboros\Consensus\HardFork\Combinator\Ledger\Query.hs:262:17: error: [GHC-25897] • Couldn't match type ‘result1’ with ‘Either (MismatchEraInfo xs) result1’ Expected: QueryIfCurrent xs footprint result Actual: QueryIfCurrent xs footprint result1 ‘result1’ is a rigid type variable bound by a pattern with constructor: QueryIfCurrent :: forall (xs :: [*]) (footprint :: QueryFootprint) result. Failed, 215 modules loaded. QueryIfCurrent xs footprint result -> BlockQuery (HardForkBlock xs) footprint (HardForkQueryResult xs result), in a case alternative at src\ouroboros-consensus\Ouroboros\Consensus\HardFork\Combinator\Ledger\Query.hs:259:9-78 • In the second argument of ‘answerVia’, namely ‘queryIfCurrent’ In the expression: answerVia cfgs queryIfCurrent forker In a case alternative: QueryIfCurrent (queryIfCurrent :: QueryIfCurrent xs footprint result') -> answerVia cfgs queryIfCurrent forker • Relevant bindings include queryIfCurrent :: QueryIfCurrent xs footprint result1 (bound at src\ouroboros-consensus\Ouroboros\Consensus\HardFork\Combinator\Ledger\Query.hs:259:25) cfgs :: NP ExtLedgerCfg xs (bound at src\ouroboros-consensus\Ouroboros\Consensus\HardFork\Combinator\Ledger\Query.hs:257:11) hardForkState :: State.HardForkState (Flip LedgerState EmptyMK) xs (bound at src\ouroboros-consensus\Ouroboros\Consensus\HardFork\Combinator\Ledger\Query.hs:255:31) st :: LedgerState (HardForkBlock xs) EmptyMK (bound at src\ouroboros-consensus\Ouroboros\Consensus\HardFork\Combinator\Ledger\Query.hs:255:7) lcfg :: LedgerConfig (HardForkBlock xs) (bound at src\ouroboros-consensus\Ouroboros\Consensus\HardFork\Combinator\Ledger\Query.hs:278:7) forker :: ReadOnlyForker' m (HardForkBlock xs) (bound at src\ouroboros-consensus\Ouroboros\Consensus\HardFork\Combinator\Ledger\Query.hs:254:5) qry :: BlockQuery (HardForkBlock xs) footprint result (bound at src\ouroboros-consensus\Ouroboros\Consensus\HardFork\Combinator\Ledger\Query.hs:253:5) (Some bindings suppressed; use -fmax-relevant-binds=N or -fno-max-relevant-binds) | 262 | queryIfCurrent | ^^^^^^^^^^^^^^

Perhaps I just need to stare at it a bit more...

...os-consensus/src/ouroboros-consensus/Ouroboros/Consensus/HardFork/Combinator/Ledger/Query.hs

ouroboros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/Ledger/Query.hs

nfrisby

*LedgerDB* files, except I stopped when I got to the LMDB impl. I'll pick up there tomorrow.

ouroboros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/Storage/LedgerDB/API.hs

ouroboros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/Storage/LedgerDB/V1/Args.hs

...onsensus/src/ouroboros-consensus/Ouroboros/Consensus/Storage/LedgerDB/V1/BackingStore/API.hs

...rc/ouroboros-consensus/Ouroboros/Consensus/Storage/LedgerDB/V1/BackingStore/Impl/InMemory.hs

nfrisby

My previous review was the *LedgerDB* files up to but excluding LMDB.

This review picks up there and stops before V2.

...us/src/ouroboros-consensus/Ouroboros/Consensus/Storage/LedgerDB/V1/BackingStore/Impl/LMDB.hs

ouroboros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/Storage/LedgerDB/V1/Lock.hs

...boros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/Storage/LedgerDB/V1/Snapshots.hs

nfrisby

This is the LedgerDB*V2 files

ouroboros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/Storage/LedgerDB/V2/Common.hs

...boros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/Storage/LedgerDB/V2/LedgerSeq.hs

ouroboros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/Storage/LedgerDB/V2/Init.hs

nfrisby · 2024-11-06T18:42:58Z

ouroboros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/Storage/LedgerDB/V2/Init.hs

+        ) . mkFsPath . (:[])) dirs
+
+-- | Testing only! Truncate all snapshots in the DB.
+implIntTruncateSnapshots :: MonadThrow m => SomeHasFS m -> m ()


Could share a control HOF with destroySnapshots.

nfrisby · 2024-11-06T18:45:30Z

ouroboros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/Storage/LedgerDB/V2/Init.hs

+  -> HandleArgs
+  -> (LedgerDB m l blk, TestInternals m l blk)
+implMkLedgerDb h bss = (LedgerDB {
+      getVolatileTip            = getEnvSTM  h implGetVolatileTip


Some of these impl* functions share a worrying amount of code with the V1 impl.

Yes, but they use different types. Perhaps we should hide this in some typeclass? We will eventually delete V1 so I'm unsure how worth this is.

ouroboros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/Storage/LedgerDB/V2/LSM.hs

nfrisby

This is the LedgerDB files after V2, ie the tests.

nfrisby · 2024-11-06T18:48:56Z

ouroboros-consensus/test/storage-test/Test/Ouroboros/Storage/LedgerDB.hs

+        , DbChangelog.Unit.tests
+        , DbChangelog.QuickCheck.tests
+    ]
+    , SnapshotPolicy.tests


Which of these tests V2?

ouroboros-consensus/test/storage-test/Test/Ouroboros/Storage/LedgerDB/StateMachine/TestBlock.hs

ouroboros-consensus/test/storage-test/Test/Ouroboros/Storage/LedgerDB/StateMachine.hs

nfrisby · 2024-11-06T20:24:46Z