Account for full transaction encoding when batching messages #2575

mzabaluev · 2022-08-18T05:23:47Z

Simple, Cosmos-specific fix for #2560

Closes: #2560

Description

Use the estimated encoded transaction size to ensure that the transaction created from each message batch does not exceed max_tx_size.

PR author checklist:

Added changelog entry, using unclog.
Added tests: integration (for Hermes) or unit/mock tests (for modules).
Linked to GitHub issue.
Updated code comments and documentation (e.g., docs/).
Tagged one reviewer who will be the one responsible for shepherding this PR.

Reviewer checklist:

Reviewed Files changed in the GitHub PR explorer.
Manually tested (in case integration/unit/mock tests are absent).

mzabaluev · 2022-08-18T10:03:07Z

relayer/src/chain/cosmos/batch.rs

+    fn test_fixture() -> (TxConfig, KeyEntry, Account) {
+        todo!()
+    }


@soareschen Is there anything in the test framework to provide usable values here? Unfortunately, the unit tests become more complicated.

There could be a mock instance for TxConfig, if you don't call the API URLs. For KeyEntry you would have to construct a private key pair. For Account the main essential information is the account sequence, which could be set to something like 0 or 999999.

Good, I'm creating mock values (not pushed yet), but had doubts about all fields of TxConfig.

mzabaluev · 2022-08-18T10:04:36Z

relayer/src/chain/cosmos/batch.rs

+    config: &TxConfig,
    max_msg_num: MaxMsgNum,
    max_tx_size: MaxTxSize,
+    key_entry: &KeyEntry,
+    account: &Account,
+    tx_memo: &Memo,


This parameter lists is used repetitively, maybe there should be a struct to arrange some of them.

It was originally not repetitive, but you made them repetitive now. The point of explicit parameters is to make it obvious of cascading dependencies and help us avoid that.

Instead, why don't just pass tx_envelope_len as a parameter and call encoded_tx_len from the call site? It also looks like encoded_tx_len can be estimated once and applicable for all transactions, rather than recomputing it every time.

@soareschen The same parameters are used in other functions in this module, I did not want to break the pattern at this point. As you point out, this flattening was done in an earlier refactoring to reduce unnecessary data dependencies and useless mutability, but I think we should follow it by some creative parameter grouping where it makes sense, to avoid excessive repetition.

It also looks like encoded_tx_len can be estimated once and applicable for all transactions

I was not so sure, there may be some variation in how e.g. addresses are encoded, and there's also the memo field. There needs to be some context above the send_batched_messages_and_wait_* API to safely make this assumption.

The key challenge for using a common struct in this case is that the references are borrowed from different places. So if we define a struct like TxContext we would have to move or clone the values into it. And if we define a struct containing references like TxContext<'a>, then it does not really contribute much other than being a syntactic sugar. It would also risk overgeneralizing functions, especially when catering for those few places that requires a &mut Account.

That said, if we want to refactor the code, it would better to define a proper TxHandler struct that specializes in sending transactions, and decouple it from the ChainHandle implementation, which is the real source of this mess.

The address should all have the same length, and is the same within the same ChainHandle. Same for the memo field, which has the same value configured. The only thing that could potentially change length is the account sequence, unless it has a fixed length encoding like u64.

t would better to define a proper TxHandler struct that specializes in sending transactions, and decouple it from the ChainHandle implementation, which is the real source of this mess.

I agree, splitting the "god trait" into smaller traits is a good way to refactor.

Same for the memo field, which has the same value configured.

So there might be a case for a struct that contains these less-variable parameters.

The only thing that could potentially change length is the account sequence, unless it has a fixed length encoding like u64.

Yes, there is a number of these varint fields and length delimiters that can grow between one tx to another, so you either have to compute the length on the fully composed transaction as it would be sent, or put in some upper bound for the overhead, neither of which is an ideal approach.

The function evaluates the length of an encoded transaction, to perform more accurate batching.

Move utilities from chain::cosmos::batch to share them with other modules' unit tests.

In batch_messages, the varint encoding of the length of the tx_body field of TxRaw may increase the length of the envelope as the body size grows.

The tx_body member of TxRaw encodes more than the messages, so its total length needs to be taken into account when estimating the total transaction size from the previously computed envelope and the vector of messages.

This reverts commit 6a8e31f.

When the TxBody struct is completely empty (all fields have default values), it is encoded as an empty array and the body_bytes field is then also omitted from the encoding of TxRaw. Add transaction size verification to the unit tests of batch_messages, by creating encoded tx from the batched messages (using the maximum fee to keep unit tests simple) and checking the size corresponds to the `MaxTxSize` parameter.

Test larger numbers of messages, up to 100 (maximum allowed per MaxMsgNum limits), with length increasing in the arithmetic progression. For each round, make sure that the size limit is just equal to the size of an encoded tx with all messages except the last one. Verify that the last message is spilled into the next batch and the resulting tx size for the first batch equals the limit.

…lsystems#2575) * Remove a useless clone * De-clone estimate_tx_fees * cosmos: More de-owning around sign_and_encode_tx * Simplify prepending of ClientUpdate * cosmos: encoded_tx_len function The function evaluates the length of an encoded transaction, to perform more accurate batching. * Account for field encoding in tx batch size * Account for tx envelope in message batching * More credible test fixture for batch_messages * Fix calculations in batch_messages, adjust tests * Initialize KeyEntry data from a seed file * relayer: module chain::cosmos::test_utils Move utilities from chain::cosmos::batch to share them with other modules' unit tests. * Take tx_body varint lenth delimiter into account In batch_messages, the varint encoding of the length of the tx_body field of TxRaw may increase the length of the envelope as the body size grows. * Account for body encoding when batching messages The tx_body member of TxRaw encodes more than the messages, so its total length needs to be taken into account when estimating the total transaction size from the previously computed envelope and the vector of messages. * Document EncodedTxMetrics * Revert "relayer: module chain::cosmos::test_utils" This reverts commit 6a8e31f. * batch: fix size estimation for empty body When the TxBody struct is completely empty (all fields have default values), it is encoded as an empty array and the body_bytes field is then also omitted from the encoding of TxRaw. Add transaction size verification to the unit tests of batch_messages, by creating encoded tx from the batched messages (using the maximum fee to keep unit tests simple) and checking the size corresponds to the `MaxTxSize` parameter. * Improve batch_does_not_exceed_max_tx_size test Test larger numbers of messages, up to 100 (maximum allowed per MaxMsgNum limits), with length increasing in the arithmetic progression. For each round, make sure that the size limit is just equal to the size of an encoded tx with all messages except the last one. Verify that the last message is spilled into the next batch and the resulting tx size for the first batch equals the limit. * Remove an unnecessary branch * Add changelog entry Co-authored-by: Romain Ruetschi <romain@informal.systems>

mzabaluev requested a review from romac August 18, 2022 05:23

mzabaluev commented Aug 18, 2022

View reviewed changes

romac added this to the v1.1 milestone Aug 22, 2022

mzabaluev added 8 commits August 22, 2022 17:04

Remove a useless clone

cf012a9

De-clone estimate_tx_fees

081ef00

cosmos: More de-owning around sign_and_encode_tx

6c67f9f

Simplify prepending of ClientUpdate

87b95de

cosmos: encoded_tx_len function

886e983

The function evaluates the length of an encoded transaction, to perform more accurate batching.

Account for field encoding in tx batch size

13ce6a5

Account for tx envelope in message batching

f0e9c58

More credible test fixture for batch_messages

4f90ce3

mzabaluev force-pushed the mikhail/tx-size-fixes branch from 03b2fb0 to 4f90ce3 Compare August 22, 2022 14:10

seanchen1991 marked this pull request as ready for review August 22, 2022 15:38

mzabaluev added 9 commits August 22, 2022 20:43

Fix calculations in batch_messages, adjust tests

dde83df

Merge branch 'master' into mikhail/tx-size-fixes

368808d

Initialize KeyEntry data from a seed file

3f0e881

relayer: module chain::cosmos::test_utils

6a8e31f

Move utilities from chain::cosmos::batch to share them with other modules' unit tests.

Merge branch 'master' into mikhail/tx-size-fixes

464c7a6

Take tx_body varint lenth delimiter into account

c8cebc0

In batch_messages, the varint encoding of the length of the tx_body field of TxRaw may increase the length of the envelope as the body size grows.

Account for body encoding when batching messages

4a83ed5

The tx_body member of TxRaw encodes more than the messages, so its total length needs to be taken into account when estimating the total transaction size from the previously computed envelope and the vector of messages.

Document EncodedTxMetrics

be3e0ed

Revert "relayer: module chain::cosmos::test_utils"

8d62478

This reverts commit 6a8e31f.

mzabaluev marked this pull request as draft August 26, 2022 08:03

mzabaluev marked this pull request as ready for review August 26, 2022 21:28

mzabaluev and others added 4 commits August 29, 2022 00:10

Remove an unnecessary branch

1742569

Add changelog entry

3acf895

Merge branch 'master' into mikhail/tx-size-fixes

c6d7267

romac approved these changes Aug 30, 2022

View reviewed changes

romac merged commit a087c47 into master Aug 30, 2022

romac deleted the mikhail/tx-size-fixes branch August 30, 2022 08:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Account for full transaction encoding when batching messages #2575

Account for full transaction encoding when batching messages #2575

mzabaluev commented Aug 18, 2022 •

edited by romac

Loading

mzabaluev Aug 18, 2022

soareschen Aug 19, 2022

mzabaluev Aug 19, 2022

mzabaluev Aug 18, 2022

soareschen Aug 19, 2022

mzabaluev Aug 19, 2022

soareschen Aug 19, 2022

mzabaluev Aug 23, 2022

Account for full transaction encoding when batching messages #2575

Account for full transaction encoding when batching messages #2575

Conversation

mzabaluev commented Aug 18, 2022 • edited by romac Loading

Description

PR author checklist:

Reviewer checklist:

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mzabaluev commented Aug 18, 2022 •

edited by romac

Loading