feat: VM Execution Lanes #10551

vyzo · 2023-03-23T15:20:04Z

Overview

This pr implements effective execution lanes for the fvm, supporting a default and a priority execution lane.

The priority execution lane reserves a number of execution units so that there is always some capacity for it even if default execution load spikes, typically through user load.

Implementation Notes

Configuration is handled through the LOTUS_FVM_CONCURRENCY (also used by the fvm itself to reserve execution units) and LOTUS_FVM_CONCURRENCY_RESERVED env vars.
Currently, only ApplyBlocks execution is prioritized, as it is critical for sync/consensus.

…re appropriate

chain/consensus/compute_state.go

chain/stmgr/call.go

chain/vm/execution.go

raulk · 2023-03-24T13:24:51Z

chain/vm/execution.go

+
+const (
+	DefaultAvailableExecutionLanes = 4
+	DefaultPriorityExecutionLanes  = 2


As we discussed synchronously, we need to justify this number by looking at the critical path usages in the chain sync subsystem. From my understanding, those are the following, but @magik6k will probably have better ideas.

Incoming block validation via gossip

Head coalescer executions.

Actual deferred execution.

Also note that time-sensitive miner gas estimations and mpool pushes will fall under the "user" section. I'm not sure how much of a problem that could be, probably not much since those daemons won't be facing arbitrary user workloads. But worth confirming.

chain/vm/vmi.go

metrics/metrics.go

raulk · 2023-03-24T13:28:23Z

chain/vm/execution.go

I haven't reviewed this whole file yet. Pending.

chain/vm/execution.go

arajasek

Broadly looks like it should work, some notes.

chain/vm/execution.go

chain/consensus/compute_state.go

chain/stmgr/call.go

chain/vm/vmi.go

fridrik01

Looks good, just some minor comments

chain/vm/execution.go

chain/vm/vmi.go

fridrik01 · 2023-03-28T10:02:44Z

chain/vm/execution.go

+	// DefaultPriorityExecutionLanes is the number of reserved execution lanes for priority computations.
+	// This is purely userspace, but we believe it is a reasonable default, even with more available
+	// lanes.
+	DefaultPriorityExecutionLanes = 2


Can we add a comment somewhere that DefaultAvailableExecutionLanes include the DefaultPriorityExecutionLanes so they are not added together (unless I am somehow terribly mistaken).

ok, this should not be confusing.

fridrik01 · 2023-03-28T10:11:37Z

chain/vm/execution.go

+		reserving := 0
+		if e.reserved > 0 {
+			e.reserved--
+			reserving = 1
+		}


So I understand, is the executionToken.reserved mainly used to handle the case where we have a ExecutionLanePriority where all the priority lanes are used/full but there are free default ones?

exactly, it is the spill mechanism.

Co-authored-by: Aayush Rajasekaran <arajasek94@gmail.com>

arajasek

LGTM, but let's let @fridrik01 also approve.

fridrik01

LGTM also

Stebalien · 2023-03-28T21:22:06Z

Doing this entirely in lotus is looking pretty gnarly. Someone is going to forget to call Done in some error path and cause a deadlock (or call Done multiple times).

Please don't merge this yet, I think there's a better way and I'll spend a few minutes trying to flesh it out.

Stebalien · 2023-03-28T21:37:55Z

Ok, so:

The benefit of this approach is that it limits the lotus node to LOTUS_FVM_CONCURRENCY concurrent machines, limiting memory usage.
The downsides are:
1. This prevents interleaving message execution. This doesn't matter for throughput, but it increases the latency of otherwise fast requests by forcing them to block on other slower requests.
2. It infects everything with .Done() statements.

An alternative approach is to handle this at the ApplyMessage level. I.e.:

ApplyBlocks calls ApplyMessage with a priority of 0.
Everything else calls ApplyMessage with the default priority (e.g., 100).
Internally, ApplyMessage uses a semaphore to stop "normal" priority calls when the number of available lanes reaches LOTUS_FVM_CONCURRENCY_RESERVED.

Unlike the current approach, this doesn't require any deferred calls to some "done" method. Really, this can be entirely handled with a flag in the context passed to ApplyMessage.

In the future, this can be refined further: Instead of always reserving a couple of lanes, ApplyBlocks can raise the number of reserved priority lanes when called and reduce them when it returns.

vyzo · 2023-03-29T02:33:29Z

calling Done multiple times is idempotent, so that part is safe.

vyzo · 2023-03-29T02:37:27Z

Your proposal would use a similar mechanism (ie tokens for execution lanes).

The advantage it has is that the lock is more finegrained and that the Done call is limited in scope at ApplyMessage.

vyzo · 2023-03-29T02:39:47Z

I am not opposed to doing it that way, but the benefit really is marginal imo.

vyzo · 2023-03-29T02:51:12Z

Also, we dont need the context option at all, we can do the locking at the ApplyMessage shim and have the priority come from vmopts as it currently does.

In short, we can hide the whole getToken/putToken/Done business inside the ApplyMessage shim of the executor, and it's not the end of the world if you forget to call Done outside.

That's a change I am more willing to implement.

vyzo · 2023-03-29T13:52:28Z

See #10590 for implementation of the finegrained lock.

Honestly, I don't think perf will improve (it will likely be slightly worse for ApplyBlocks as we'll have to take the lock a thousand times), but not having to call Done is a win safety-wise.

VM Execution Lanes Part II: Hide the lock

chain/vm/execution.go

chain/consensus/compute_state.go

chain/stmgr/call.go

chain/vm/execution.go

vyzo added 2 commits March 23, 2023 16:53

introduce execution lanes

6550abd

update VM interface references to use the executor, and call Done whe…

7362556

…re appropriate

vyzo requested a review from raulk March 23, 2023 15:20

vyzo added 5 commits March 23, 2023 17:28

only call Atoi on non empty strings

ee6c0f8

call Executor.Done where appropriate in stmgr uses

2bb89d9

make token.Done idempotent

2a06604

add some sanity checks for execution concurrency parameters

317a87d

fix incorrect deferred vm release

f11a7f8

maciejwitowski added this to the Lotus - v1.23.0 milestone Mar 23, 2023

vyzo marked this pull request as ready for review March 23, 2023 19:12

vyzo requested a review from a team as a code owner March 23, 2023 19:12

add vm execution metrics

4b590e2

raulk reviewed Mar 24, 2023

View reviewed changes

address review comments

0813455

maciejwitowski reviewed Mar 27, 2023

View reviewed changes

chain/vm/execution.go Outdated Show resolved Hide resolved

maciejwitowski reviewed Mar 27, 2023

View reviewed changes

chain/vm/execution.go Show resolved Hide resolved

arajasek reviewed Mar 27, 2023

View reviewed changes

chain/vm/execution.go Outdated Show resolved Hide resolved

chain/vm/execution.go Show resolved Hide resolved

chain/consensus/compute_state.go Outdated Show resolved Hide resolved

chain/stmgr/call.go Outdated Show resolved Hide resolved

chain/vm/vmi.go Outdated Show resolved Hide resolved

vyzo requested a review from Stebalien March 27, 2023 16:16

vyzo added 3 commits March 27, 2023 20:56

add execution metrics to the chain node views

ddebdfb

use Count instead of LastValue

a0f908d

no, Sum it is.

6ecaf82

fridrik01 reviewed Mar 28, 2023

View reviewed changes

vyzo and others added 4 commits March 28, 2023 16:58

make gen

dcd9869

Update chain/vm/execution.go

b2b78e9

Co-authored-by: Aayush Rajasekaran <arajasek94@gmail.com>

rename confusing variable

b271216

rename newVM to makeVM for a happy yushie

71650cd

arajasek approved these changes Mar 28, 2023

View reviewed changes

fridrik01 approved these changes Mar 28, 2023

View reviewed changes

vyzo added 2 commits March 29, 2023 16:45

refactor execution lanes: hide the lock

4184ce9

fix tests

52d70d5

vyzo mentioned this pull request Mar 29, 2023

VM Execution Lanes Part II: Hide the lock #10590

Merged

Merge pull request #10590 from filecoin-project/vyzo/feat/exec-lanes-2

bc7dafc

VM Execution Lanes Part II: Hide the lock

Stebalien reviewed Mar 29, 2023

View reviewed changes

vyzo added 3 commits March 30, 2023 18:11

revert dead code

54a80a8

add comment about Signal unsoundness

7b4e682

reorg initialization code for better readability, remove unused import

d71b528

Stebalien approved these changes Mar 30, 2023

View reviewed changes

vyzo merged commit bf666a3 into master Mar 30, 2023

vyzo deleted the vyzo/feat/exec-lanes branch March 30, 2023 19:19

Fatman13 mentioned this pull request Mar 31, 2023

[venus] VM execution lane / vm 执行通道 filecoin-project/venus#5882

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: VM Execution Lanes #10551

feat: VM Execution Lanes #10551

vyzo commented Mar 23, 2023 •

edited

Loading

raulk Mar 24, 2023

raulk Mar 24, 2023

arajasek left a comment

fridrik01 left a comment

fridrik01 Mar 28, 2023

vyzo Mar 28, 2023

fridrik01 Mar 28, 2023

vyzo Mar 28, 2023

arajasek left a comment

fridrik01 left a comment

Stebalien commented Mar 28, 2023

Stebalien commented Mar 28, 2023

vyzo commented Mar 29, 2023

vyzo commented Mar 29, 2023

vyzo commented Mar 29, 2023

vyzo commented Mar 29, 2023

vyzo commented Mar 29, 2023

feat: VM Execution Lanes #10551

feat: VM Execution Lanes #10551

Conversation

vyzo commented Mar 23, 2023 • edited Loading

Overview

Implementation Notes

raulk Mar 24, 2023

Choose a reason for hiding this comment

raulk Mar 24, 2023

Choose a reason for hiding this comment

arajasek left a comment

Choose a reason for hiding this comment

fridrik01 left a comment

Choose a reason for hiding this comment

fridrik01 Mar 28, 2023

Choose a reason for hiding this comment

vyzo Mar 28, 2023

Choose a reason for hiding this comment

fridrik01 Mar 28, 2023

Choose a reason for hiding this comment

vyzo Mar 28, 2023

Choose a reason for hiding this comment

arajasek left a comment

Choose a reason for hiding this comment

fridrik01 left a comment

Choose a reason for hiding this comment

Stebalien commented Mar 28, 2023

Stebalien commented Mar 28, 2023

vyzo commented Mar 29, 2023

vyzo commented Mar 29, 2023

vyzo commented Mar 29, 2023

vyzo commented Mar 29, 2023

vyzo commented Mar 29, 2023

vyzo commented Mar 23, 2023 •

edited

Loading