Add documents on the Mono interpreter analysis #2857

janvorli · 2024-12-05T13:26:16Z

This change adds documents on the Mono interpreter byte code, the transformation and execution phase, debugger, exception handling and stack walking.

This change adds document describing how the functions in the transform.c will be converted to use the Jit2EEInterface methods instead of Mono APIs.

…sform-doc

First part from my notes

…melab into add-transform-doc

janvorli · 2024-12-05T13:29:11Z

cc: @jkotas, @AaronRobinsonMSFT, @radekdoulik, @BrzVlad, @cshung, @steveisok, @thaystg, @kg, @davidwrighton, @pragmanomos, @mangod9

BrzVlad · 2024-12-05T15:06:40Z

docs/design/interpreter/exception-handling.md

+
+Resuming execution in the parent of a catch handler after the catch handler exits depends on whether the catch handler is in an interpreted code or in compiled managed code. 
+For the compiled code case, the existing CoreCLR EH code will handle it without changes. It would just restore the context at the resume location.
+For the interpreted code though, CoreCLR will resume execution in the mono_interp_exec_method, then pop some InterpFrame instances from the local linked list (depending on how many interpreted managed frames need to be removed) and restore the interpreter SP from the last popped one and the interpreter IP will be set to the resume location IP.


I am not a big fan of context restoring into C code as we currently do on mono. I think it would be simpler to always have a C++ try/catch when we leave interpreter. Asking to resume into interpreter would be a simple throw. Catchers would check if the resume information is for the current block of linked interpreter frames, if it is not found then it would just rethrow. This is more or less the approach described below that we would need to take for wasm anyway.

I didn't say that we would necessarily restore context into the C code, I said we would do that for the compiled managed code case. However, from perf point of view, it might be better to restore context back into the interp_throw on other than WASM. I have found unwinding native frames to be very costy, see my change #108480 where just getting rid of two levels of native code when propagating an exception resulted in a significant perf win.

AaronRobinsonMSFT · 2024-12-05T18:01:36Z

docs/design/interpreter/stackwalk.md

+To enable seamless integration of interpreter with GC, exception handling and debugger, we need to be able to walk stack that contains interleaved sequences of interpreted and AOT / JIT compiled frames. Fortunately, the same StackFrameIterator is used by the GC, exception handling and debugger, so adding a support to walk stack containing interpreter frames to this StackFrameIterator will give us support for all of the three scenarios.
+
+Here is the list of necessary changes:
+* Add a new explicit frame, the InterpreterFrame. Instance of this frame would be created in the mono_interp_exec_method. It will enable stack frame iterator to move through the interpreted frames managed by that mono_interp_exec_method. Please note that interpreter calls to other interpreted code are not recursively calling mono_interp_exec_method and each interpreted frame is represented by an instance of InterpFrame (these form a linked list).


Are InterpreterFrame and InterpFrame different here? Not trying to quibble over names, but using names like this in these design documents is very difficult to parse. I find this sort of long form description difficult to understand - too many words and missing the logic.

As a suggestion in general, having basic steps (1), (2), ... (N) is much easier to follow and avoids encoding steps in long form descriptions.

They are different. InterpFrame is a currently existing data structure in the Mono interpreter while the InterpreterFrame will be a new explicit frame derived from Frame in coreclr. I think we will end up renaming the InterpFrame to something else and use the InterpreterFrame for the explicit one, but I didn't want to rename the Mono structure for the sake of this document and I didn't have a better idea for naming of the new InterpreterFrame.

jkotas · 2024-12-06T07:12:26Z

docs/design/interpreter/exception-handling.md

+For the compiled code case, the existing CoreCLR EH code will handle it without changes. It would just restore the context at the resume location.
+For the interpreted code though, CoreCLR will resume execution in the mono_interp_exec_method, then pop some InterpFrame instances from the local linked list (depending on how many interpreted managed frames need to be removed) and restore the interpreter SP from the last popped one and the interpreter IP will be set to the resume location IP.
+
+WASM doesn't support stack unwinding and context manipulation, so the resuming mechanism cannot use context restoring. Mono currently returns from the mono_handle_exception after the catch is executed back to the interp_throw. When the resume frame is in the interpreted frames belonging to the current mono_interp_exec_method, it uses the mechanism described in the previous paragraph to "restore" to the resume context. But when the resume frame is above all the frames belonging to the current mono_interp_exec_method, it exits from the mono_interp_exec_method and then throws a C++ exception (of int32 * type set to NULL) that will propagate through native frames until it is caught in a compiled managed code or in another interpreter function up the call chain (see usage of mono_llvm_catch_exception) where the propagation through the interpreted frames continues the same way as in the previous interpreted frames block.


How is C++ exception handling implemented in Wasm? Does Wasm have any primitives for stack unwinding to support C++ exception handling?

WASM has exception handling support, see https://github.com/WebAssembly/exception-handling/tree/main/proposals/exception-handling

Yes, it adds new instructions, new tags section in the wasm format and modifies few existing sections.

Note that the current wasm EH, which is implemented in most browser and engines, is deprecated. The new EH with the exnref type is being standardized and is becoming available or included as preview feature. Here you can see the feature extensions table https://webassembly.org/features/

All of our current Wasm runtimes also support targeting VMs with no exception extensions. This is one of those cases where it is useful to think of the browser and WASI separately because the all the major browsers currently support the deprecated Wasm exception instructions and none of the WASI runtimes do.

To answer the unwinding part of the question, the stack unwinding is done by the VM itself and not by the running code. So after exception is thrown, the stack is unwound by VM and control flow is transferred to the catch instruction.

On wasm, the clang/llvm handles C++ exceptions by inserting a call to a wrapper in the libunwind after the catch instruction. The wrapper then calls the libcxxabi's personality function. More details can be found in https://llvm.org/docs/ExceptionHandling.html#overview and in the comments on top of https://github.com/dotnet/llvm-project/blob/dotnet/main-19.x/llvm/lib/CodeGen/WasmEHPrepare.cpp

kg · 2024-12-06T16:57:11Z

docs/design/interpreter/stackwalk.md

+To enable seamless integration of interpreter with GC, exception handling and debugger, we need to be able to walk stack that contains interleaved sequences of interpreted and AOT / JIT compiled frames. Fortunately, the same StackFrameIterator is used by the GC, exception handling and debugger, so adding a support to walk stack containing interpreter frames to this StackFrameIterator will give us support for all of the three scenarios.
+
+Here is the list of necessary changes:
+* Add a new explicit frame, the InterpreterFrame. Instance of this frame would be created in the mono_interp_exec_method. It will enable stack frame iterator to move through the interpreted frames managed by that mono_interp_exec_method. Please note that interpreter calls to other interpreted code are not recursively calling mono_interp_exec_method and each interpreted frame is represented by an instance of InterpFrame (these form a linked list).


Should we make this sort of thing clear by having names like ClrInterpreterFrame and MonoInterpreterFrame?

I guess that would make it even more confusing by naming existing Mono InterpFrame differently in the doc. I think we will end up renaming many data types in the interpreter, but if we rename it in this doc, the link between the doc and the real Mono state will be lost.
I think I'll just rename the InterpreterFrame here to something different to make the doc less ambiguous and keep the InterpFrame.

kg · 2024-12-06T17:23:22Z

docs/design/interpreter/transformation.md

+* mono_class_from_mono_type_internal - N.A., just use CORINFO_CLASS_HANDLE
+
+### mono_interp_jit_call_supported
+* Return false, we will not support JIT (at least not in first version)


Note that in some cases 'jit calls' can be calls into AOT-compiled code on mono. It still seems valid to scope this out in the first version though.

This is good point, when I was writing this comment, I didn't know yet that that JIT call here means in fact AOT call and that it has JIT in the name for historical reasons.

kg · 2024-12-06T19:16:41Z

docs/design/interpreter/transformation.md

+* interp_get_ldind_for_mt
+* interp_get_stind_for_mt
+* generate_compacted_code
+* mono_jiterp_insert_ins


anything with 'jiterp' in the name can be safely ignored as it is specific to the wasm jiterpreter

kg · 2024-12-06T19:17:53Z

docs/design/interpreter/ir-byte-code.md

+ * Both the transformation and interpretation use Mono type / method  / vtable etc data structures
+ * The arguments to the opcode are immediate constants (16, 32 or 64 bit), indexes into local variables table or indexes into a per-method special data items array.
+## Limitations
+ * The local variable offsets in most of the IR codes are 16 bit unsigned integers, so a method can have max 65536 kB of local variables


i think this is a 'max stack frame size of 64kb', not just local variables since stuff like stackalloc eats the space too

kg · 2024-12-06T21:00:19Z

docs/design/interpreter/ir-byte-code.md

+ * MINT_MOV_VT(u16 target_local, u16 source_local, u16 size)
+### Hot reload
+ * MINT_METADATA_UPDATE_LDFLDA(u16 target_local, u16 source_local, u16 type_index, u16 fielddef_token_index)
+### JITerpreter


This is a WASM specific JIT, these opcodes are WASM only and can be ignored

janvorli and others added 12 commits November 26, 2024 22:36

Add documentation of the functions in transform.c

833e518

This change adds document describing how the functions in the transform.c will be converted to use the Jit2EEInterface methods instead of Mono APIs.

Add methods that don't call mono_ functions or m_ macros

a4cff10

Add methods which use just glib or memory allocations

a1ba34a

Added functions from my notes, first part

64da4f2

Feedback

3bceb5a

Add next bunch of functions

04bae85

Add documentation of the functions in transform.c

9c624b5

This change adds document describing how the functions in the transform.c will be converted to use the Jit2EEInterface methods instead of Mono APIs.

Last bunch of functions

5cb1d3d

Merge remote-tracking branch 'origin/add-transform-doc' into add-tran…

e16ab63

…sform-doc

Merge pull request #1 from radekdoulik/pr-interp-transformdoc-1

030602d

First part from my notes

Add docs on debugger, EH, stack walk, byte code

96a9d00

Merge branch 'add-transform-doc' of https://github.com/janvorli/runti…

29c0a23

…melab into add-transform-doc

janvorli added the area-CoreclrInterpreter label Dec 5, 2024

janvorli self-assigned this Dec 5, 2024

BrzVlad reviewed Dec 5, 2024

View reviewed changes

steveisok requested review from kg and lateralusX December 5, 2024 15:26

AaronRobinsonMSFT reviewed Dec 5, 2024

View reviewed changes

jkotas reviewed Dec 6, 2024

View reviewed changes

kg reviewed Dec 6, 2024

View reviewed changes

kg approved these changes Dec 7, 2024

View reviewed changes

janvorli added 2 commits December 10, 2024 18:11

Reflect feedback

8283984

Add two missing links

bea8538

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add documents on the Mono interpreter analysis #2857

Add documents on the Mono interpreter analysis #2857

janvorli commented Dec 5, 2024

janvorli commented Dec 5, 2024

BrzVlad Dec 5, 2024

janvorli Dec 5, 2024

AaronRobinsonMSFT Dec 5, 2024

janvorli Dec 5, 2024

jkotas Dec 6, 2024 •

edited

Loading

janvorli Dec 6, 2024

radekdoulik Dec 6, 2024

lewing Dec 6, 2024

radekdoulik Dec 6, 2024

kg Dec 6, 2024

janvorli Dec 6, 2024

kg Dec 6, 2024

janvorli Dec 6, 2024

kg Dec 6, 2024

kg Dec 6, 2024 •

edited

Loading

kg Dec 6, 2024

Add documents on the Mono interpreter analysis #2857

Are you sure you want to change the base?

Add documents on the Mono interpreter analysis #2857

Conversation

janvorli commented Dec 5, 2024

janvorli commented Dec 5, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jkotas Dec 6, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kg Dec 6, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jkotas Dec 6, 2024 •

edited

Loading

kg Dec 6, 2024 •

edited

Loading