Backport #346 #347 #349 #367

mohanson · 2023-07-21T02:47:29Z

When the Rust interpreter executes instructions, it uses a huge match to match opcodes.

let op = extract_opcode(inst);
match op {
    insts::OP_SUB => {}
    insts::OP_SUBW => {}
    insts::OP_ADD => {}
    insts::OP_ADDW => {}
    ...
}

This leads to a problem, the function is too big and it needs to store too many variables on the stack. In some cases (such as calling vm in vm), It will cause stack overflow.

@xxuejie's works can avoid this problem. In the worst case (vm calls vm, loop 57 times), only about 900K stack space is now needed.

Additional change: ckb currently uses paste 1.0.6, so I downgraded the paste version of ckb-vm.

* feat: Add opcode enumerating helpers using macros Previously, we have to manually maintain the opcodes so the list of opcode definitions is in sync with the array of `INSTRUCTION_OPCODE_NAMES`, which is a tedious step. In addition, it is also quite easier to forget one opcode or two in the implementations somewhere. This change leverages some purposely designed macros to grant us a way to enumerate all the macros, providing accessing code for each of them individually. This way we can define opcodes in one place and make sure every enumeration contains all the opcodes correctly. Note that using macros does have the drawbacks of making code less clearer to read. We are also providing a one-line command to generate the underlying Rust code. What's more one can also refer to Rust docs to see all the defined opcodes * ci: Format

…ing (nervosnetwork#347)

@mohanson

* perf: Threadify Rust interpreter This change leverages a technique named "threaded interpreter" to speedup the Rust interpreter. Basically, it splits a giant match statement into multiple smaller individual functions, each handling a specific opcode. When we have a group of opcodes(e.g., a basic block), we can extract the handler function(also named "thread") for each instruction's opcode. Then we can simple run each handler function to execute each instructions. This way we can aid CPU's branch predictor to better predict what code to execute next. Note this work is inspired from @mohanson's original work at here: nervosnetwork@8422373 Reference: * http://www.emulators.com/docs/nx25_nostradamus.htm * test: Add a test to ensure opcodes are defined sequentially

xxuejie

That reminds me a question: do we need to move ThreadFactory to heap? I'm not entirely sure how far LLVM is willing to go here. Maybe moving ThreadFactory to heap will save further stack space, maybe not, just not sure.

XuJiandong · 2023-07-21T02:55:04Z

Can we cast op into u32 and then use with match?

mohanson · 2023-07-21T03:16:38Z

That reminds me a question: do we need to move ThreadFactory to heap? I'm not entirely sure how far LLVM is willing to go here. Maybe moving ThreadFactory to heap will save further stack space, maybe not, just not sure.

I did a test, if the ThreadFactory is moved to the heap, it can save ~7K stack space for each vm

xxuejie · 2023-07-21T03:19:12Z

7K * 57 == 399K, which can be a rather valuable save. But we can leave this to a future change.

mohanson · 2023-07-21T03:21:16Z

Can we cast op into u32 and then use with match?

It might help a little, but not enough to fix this bug

xxuejie and others added 4 commits July 21, 2023 10:21

perf: Group basic block end opcodes to eliminate slower pattern match…

0e2f117

…ing (nervosnetwork#347)

Downgrade paste from 1.0.12 to 1.0

3db07f6

mohanson requested a review from xxuejie July 21, 2023 02:47

xxuejie approved these changes Jul 21, 2023

View reviewed changes

mohanson merged commit 59f9dab into nervosnetwork:release-0.24 Jul 21, 2023

mohanson deleted the release-0.24-patch branch July 21, 2023 03:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Backport #346 #347 #349 #367

Backport #346 #347 #349 #367

mohanson commented Jul 21, 2023

xxuejie left a comment

XuJiandong commented Jul 21, 2023

mohanson commented Jul 21, 2023

xxuejie commented Jul 21, 2023

mohanson commented Jul 21, 2023

Backport #346 #347 #349 #367

Backport #346 #347 #349 #367

Conversation

mohanson commented Jul 21, 2023

xxuejie left a comment

Choose a reason for hiding this comment

XuJiandong commented Jul 21, 2023

mohanson commented Jul 21, 2023

xxuejie commented Jul 21, 2023

mohanson commented Jul 21, 2023