Investigate unrealistic base cost of `DataReceiptCreationConfig` #4482

Longarithm · 2021-07-08T19:21:24Z

In recent runtime-params-estimator launches, DataReceiptCreationConfig base cost is 50 TGas, which is unrealistically high comparing to current cost of 4.5 TGas.

Examples:
#4231
#4455

data_receipt_creation_config: DataReceiptCreationConfig {
  base_cost: Fee {
      send_sir: 50745691155500,
      send_not_sir: 50745691155500,
      execution: 50745691155500,
  }
  ...
}

cc @bowenwang1996 @olonho

The text was updated successfully, but these errors were encountered:

bowenwang1996 · 2021-07-08T23:24:57Z

To be clear, the current cost of 4.5Tgas is also unrealistically high. See #3279 for a discussion on this.

Longarithm · 2021-07-14T23:35:58Z

From the first sight, difference is explained by taking IO costs into account:
#3771 (comment) - 50 TGas
#3771 (comment) - 300 Ggas
My local computation in docker confirm this.

But there is conflicting info here:
#3771 (comment) - 40 Ggas in both cases.
Need to investigate further

cc #4474 @matklad

Longarithm · 2021-07-27T15:58:36Z

Data receipt fee estimation depends on its place in the vector of metrics: https://github.com/near/nearcore/blob/master/runtime/runtime-params-estimator/src/cases.rs#L536

Consider an example:

    let v = calls_helper! {
        data_receipt_base_10b_1000_1 => data_receipt_base_10b_1000,
        data_receipt_10b_1000_1 => data_receipt_10b_1000,
        data_receipt_100kib_1000_1 => data_receipt_100kib_1000
        cpu_ram_soak_test => cpu_ram_soak_test,
        base_1M => base_1M,
        read_memory_10b_10k => read_memory_10b_10k,
        ...
        data_receipt_base_10b_1000_2 => data_receipt_base_10b_1000,
        data_receipt_10b_1000_2 => data_receipt_10b_1000,
        data_receipt_100kib_1000_2 => data_receipt_100kib_1000
    };

Computation of DataReceipt fees based on data_receipt_base_10b_1000_1, data_receipt_10b_1000_1, data_receipt_100kib_1000_1 yields 30 Ggas; same computation for data_receipt_*_2 fees yields 50 TGas.

Also, if we cleanup data by re-creating RuntimeTestbed before computation of data_receipt_*_2 fees, we get 30 Ggas again.

Supposed explanation:

initially storage is in optimal state A
we compute data_receipt_*_1 fees and get 30 Ggas
computation of some metric M moves storage to suboptimal state B
we compute data_receipt_*_2 fees and get 50 Tgas

I currently suspect M = storage_write_10kib_key_10b_value_1k.

It makes sense to use separate RuntimeTestbed for each computation. But anyway we have to separate the following two cases:

on mainnet, storage never comes to state B, thus 30 Ggas is the correct realistic price
on mainnet, there is a way to put storage to state B, and then we risk to have x1000 underestimated DataReceipt prices for unknown period of time.

To do so, I plan to understand more deeply what exactly causes the difference by profiling tools.

cc @matklad @olonho @nearmax

bowenwang1996 · 2021-07-28T01:05:05Z

@Longarithm is it correct that data_receipt_creation_cost measures the cost of one storage write? If not, what exactly does it measure?

Longarithm · 2021-07-28T11:45:43Z

There is also a curious dependency between fees and accounts number.
In the notation of #4482 (comment):

+----------+----------------------+----------------------+
| Accounts | data_receipt_*_1 fee | data_receipt_*_2 fee |
+----------+----------------------+----------------------+
| 10K      | 37 Ggas              | 34 Ggas              |
| 20K      | 35 Ggas              | 50 Tgas              |
| 50K      | 40 Ggas              | 120 Tgas             |
| 100K     | 37 Ggas              | 258 Tgas             |
+----------+----------------------+----------------------+

UPD: added row for 100K accounts

Longarithm · 2021-07-28T14:54:29Z

@bowenwang1996
Base cost measures cost of DataReceipt creation and processing: https://github.com/near/nearcore/blob/master/core/primitives-core/src/runtime/fees.rs#L71-L85
Strictly speaking, it measures cost of:

set, get and remove for ReceiptData, e.g.

nearcore/core/store/src/lib.rs

Line 341 in 2c4f6dc

pub fn set_received_data(
set, get and remove for PostponedReceiptId, e.g.

nearcore/runtime/runtime/src/lib.rs

Line 925 in 28d9b7d

set(
update of pending data count

nearcore/runtime/runtime/src/lib.rs

Line 898 in 28d9b7d

set(

Though I'm not entirely sure that all these operations are called ~1000 times, as we expect in the measurement.
It's possible that 1000 DataReceipts are processed before processing ActionReceipt which collects them, and in such case we don't save postponed receipt ids:

nearcore/runtime/runtime/src/lib.rs

Line 914 in 28d9b7d

ReceiptEnum::Action(ref action_receipt) => {

I need to double-check this. cc @olonho @matklad

Note that we don't consider bytes cost here, in which I see no discrepancies.

bowenwang1996 · 2021-07-28T18:23:46Z

@Longarithm is the base cost calculated based on some trivial data or through some statistical method? It looks like this involves at most 4 storage operations and it should not be more expensive than 4 storage writes of [some trivial data]. Also it seems to me that this fee depends on the shape of the trie, but we also have a separate touching_trie_node fee that accounts for this.

MaksymZavershynskyi · 2021-07-28T19:22:03Z

This explains lots of discrepancies that we are observing: https://near.zulipchat.com/#narrow/stream/295306-dev-contract-runtime/topic/fees.20.26.20state.20size/near/247505050

**Idea** We currently reuse the same `RuntimeTestbed` for computing metrics in `calls_helper`. Presumably it saved some time (actually not a lot), and allowed us to skip initialization of testbed. But this led to issues with fees computation: #4482 (comment) Explanation: https://near.zulipchat.com/#narrow/stream/295306-dev-contract-runtime/topic/fees.20.26.20state.20size/near/248169740 So we need to create separate testbeds with different `/tmp/data` folders. **Testing** Check for discrepancies in resulting `RuntimeConfig`s

Longarithm · 2021-08-13T11:44:18Z

Closing, because we have a decent explanation for the issue, and it was fixed in #4647.

@matklad

Stabilize features lowering costs for new release: * #4795 * #4865 Quality control: * We run param estimator several times and got consistent results: * beginning of Sep 2021, my GCP instance https://hackmd.io/w6ODyKjUReuuofXTuqdyFQ * end of Sep 2021, @matklad instance #4778 (comment) * The current fee values are explained: * Data receipt costs - investigated here #4482, the reason was relatively explained and the observed issue was fixed. Note that we don't know the exact root cause, but we assume that it is related to a separate fee (touching_trie_node) for taking store size into account. This fee is problematic because it assumes the constant height of a trie, but we treat it as a separate problem. * Ecrecover cost - follow links from this one: #4778 (comment)

Longarithm added the A-contract-runtime Area: contract compilation and execution, virtual machines, etc label Jul 8, 2021

bowenwang1996 added the T-core Team: issues relevant to the core team label Jul 8, 2021

bowenwang1996 assigned Longarithm Jul 8, 2021

Longarithm mentioned this issue Aug 6, 2021

Create separate RuntimeTestbed for each metric #4647

Merged

Longarithm linked a pull request Aug 6, 2021 that will close this issue

Create separate RuntimeTestbed for each metric #4647

Merged

Longarithm mentioned this issue Aug 6, 2021

Check that RuntimeConfig fees are stable relative to accounts number #4492

Open

near-bulldozer bot closed this as completed in #4647 Aug 13, 2021

Longarithm mentioned this issue Oct 7, 2021

feat: stabilize features lowering costs #4948

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Investigate unrealistic base cost of `DataReceiptCreationConfig` #4482

Investigate unrealistic base cost of `DataReceiptCreationConfig` #4482

Longarithm commented Jul 8, 2021

bowenwang1996 commented Jul 8, 2021

Longarithm commented Jul 14, 2021 •

edited

Loading

Longarithm commented Jul 27, 2021

bowenwang1996 commented Jul 28, 2021

Longarithm commented Jul 28, 2021 •

edited

Loading

Longarithm commented Jul 28, 2021 •

edited

Loading

bowenwang1996 commented Jul 28, 2021

MaksymZavershynskyi commented Jul 28, 2021

Longarithm commented Aug 13, 2021

Investigate unrealistic base cost of DataReceiptCreationConfig #4482

Investigate unrealistic base cost of DataReceiptCreationConfig #4482

Comments

Longarithm commented Jul 8, 2021

bowenwang1996 commented Jul 8, 2021

Longarithm commented Jul 14, 2021 • edited Loading

Longarithm commented Jul 27, 2021

bowenwang1996 commented Jul 28, 2021

Longarithm commented Jul 28, 2021 • edited Loading

Longarithm commented Jul 28, 2021 • edited Loading

bowenwang1996 commented Jul 28, 2021

MaksymZavershynskyi commented Jul 28, 2021

Longarithm commented Aug 13, 2021

Investigate unrealistic base cost of `DataReceiptCreationConfig` #4482

Investigate unrealistic base cost of `DataReceiptCreationConfig` #4482

Longarithm commented Jul 14, 2021 •

edited

Loading

Longarithm commented Jul 28, 2021 •

edited

Loading

Longarithm commented Jul 28, 2021 •

edited

Loading