Fix Loads and Stores for SHA256 benchmark #185

aborg-dev · 2024-01-09T05:19:47Z

This PR adds the remaining features necessary to run SHA256 benchmark:

Misaligned loads and stores
Loads and stores of size < 64 bit

There instructions are generated by rustc compiler for wasm32 target and are quite common.

The current implementation is not verifiable as it uses external (free) inputs. It turned out to be quite challenging to implement them in a verifiable fashion because they involve many bitwise operations which are quite expensive.

For now, there is value in having a non-verifiable implementation that makes sure we can run tests and benchmarks related to memory. I've linked a few TODOs to do a verifiable implementation, but first, we would need a proper design for that part. Most likely, we will need to modify the zkAsm processor to support these operations efficiently.

Implementation-wise, there are three steps:

Conversion from address + offset to slot + offset
Read/write the value at the correct offset
Narrowing down the value to the desired type width

cranelift/zkasm_data/benchmarks/sha256/from_rust.wat

nagisa · 2024-01-29T15:48:15Z

cranelift/codegen/src/isa/zkasm/inst/emit.rs

+                        }
+
+                        // Handle the case when read spans two slots.
+                        if rem + width > 8 {


Can you really rely on this? What happens if wasm goes like i32.load offset=0 (i32.const 7)? I think there's no choice but to check alignment of the final computed (byte) address at runtime and/or unconditionally load two slots.

Fair point, I'm not sure whether I can rely on the fact that the address in the load/store register will be word-aligned. So we at least need an assert to check this.

Given that this is a non-trivial amount of work that we might need to redo anyway soon, I'll start with an assert in the code to check that the address is word-aligned. If it is not triggered by the Rust benchmarks that we use, that would be interesting evidence to investigate how exactly these addresses are generated. If it is triggered, we'll need to address it when we have the test/benchmark that exercises it.

Well, this was very well spotted. I've added an assert and it triggered. I'll work on fixing it.

FWIW for a minimal reproducer something like read_unaligned would do. In architectures that require aligned loads the compiler backend would responsibly split the load into aligned loads, but WebAssembly specifically does not impose that sort of requirement, so the backend doesn't need to worry about unaligned addresses at all.

It took some effort, but I rewrote the code without the assumption that the address is word-aligned.
I ended up unconditionally reading and writing to two slots, though I think in the future it would not be that hard to optimize the code to skip the second write when possible using a jump.

aborg-dev · 2024-02-01T10:10:11Z

Converted this back to draft while I figure out the way to fix unaligned register addresses.

This PR adds the remaining features necessary to run SHA256 benchmark: Misaligned loads and stores Loads and stores of size < 64 bit There instructions are generated by rustc compiler for wasm32 target and are quite common. The current implementation is not verifiable as it uses external (free) inputs. It turned out to be quite challenging to implement them in a verifiable fashion because they involve many bitwise operations which are quite expensive. For now, there is value in having a non-verifiable implementation that makes sure we can run tests and benchmarks related to memory. I've linked a few TODOs to do a verifiable implementation, but first, we would need a proper design for that part. Most likely, we will need to modify the zkAsm processor to support these operations efficiently. Implementation-wise, there are three steps: Conversion from address + offset to slot + offset Read/write the value at the correct offset Narrowing down the value to the desired type width

The statement A + 3 > 8 seems to be parsed as A + (3 > 8) after the translaction by zkAsm intrepreter, so adding parenthesis to disambiguate this.

nagisa · 2024-02-06T12:39:58Z

cranelift/zkasm_data/generated/memory_i32.zkasm

+  ${ (E) % 8 } => A
+  ${ (E) / 8 } => E
+  $ => D :MLOAD(MEM:E + 1)
+  $ => B :MLOAD(MEM:E)
+  ${ B >> (8 * A) } => B
+  ${ (D << (128 - 8 * (A + 1))) | B } => B
+  ${ B & ((1 << 8) - 1) } => B


This seems alright, although knowing zkasm, I think a quick improvement in steps in the future is going to involve JNZ on the result of % 8 and otherwise execute a single load only. Probably similar with stores as well.

aborg-dev mentioned this pull request Jan 9, 2024

Pass SHA256 benchmark #143

Closed

aborg-dev force-pushed the sha_full branch from 0eac99b to 93b93bc Compare January 9, 2024 13:29

aborg-dev changed the base branch from main to stack_fixes January 9, 2024 13:29

aborg-dev force-pushed the stack_fixes branch from 8ecbba7 to b47542d Compare January 9, 2024 16:03

Base automatically changed from stack_fixes to main January 9, 2024 16:43

aborg-dev force-pushed the sha_full branch 2 times, most recently from 988b67f to 776625c Compare January 10, 2024 12:04

This was referenced Jan 23, 2024

Account for virtual sp adjustments #193

Merged

Support loads and stores at FPOffset #194

Merged

Fix prologue frame setup/restore #195

Merged

aborg-dev force-pushed the sha_full branch 10 times, most recently from e481d12 to e232698 Compare January 29, 2024 14:54

aborg-dev changed the title ~~SHA256 now fully runs~~ Fix Loads and Stores for SHA256 benchmark Jan 29, 2024

aborg-dev marked this pull request as ready for review January 29, 2024 15:00

aborg-dev requested review from nagisa, MCJOHN974 and mooori January 29, 2024 15:01

nagisa reviewed Jan 29, 2024

View reviewed changes

aborg-dev force-pushed the sha_full branch 2 times, most recently from ebed326 to 2484725 Compare January 30, 2024 14:38

aborg-dev changed the base branch from main to sha_code January 30, 2024 14:38

Base automatically changed from sha_code to main January 31, 2024 16:55

aborg-dev force-pushed the sha_full branch from 4e3b383 to d7de340 Compare February 1, 2024 10:08

aborg-dev marked this pull request as draft February 1, 2024 10:09

aborg-dev removed request for mooori and MCJOHN974 February 1, 2024 10:09

aborg-dev force-pushed the sha_full branch from d7de340 to 11240b0 Compare February 2, 2024 16:41

aborg-dev marked this pull request as ready for review February 2, 2024 16:44

aborg-dev requested a review from nagisa February 2, 2024 16:45

aborg-dev force-pushed the sha_full branch from 03b052f to 8a5e20e Compare February 5, 2024 13:46

aborg-dev requested review from MCJOHN974 and mooori February 6, 2024 11:08

aborg-dev added 5 commits February 6, 2024 11:10

Add assert about aligned addresses

23e3d60

Handle misaligned base address

b35b294

Update state files

087fda6

Fix bug due to operator ordering

2f5bc55

The statement A + 3 > 8 seems to be parsed as A + (3 > 8) after the translaction by zkAsm intrepreter, so adding parenthesis to disambiguate this.

aborg-dev force-pushed the sha_full branch from 8a5e20e to 2f5bc55 Compare February 6, 2024 11:10

aborg-dev enabled auto-merge February 6, 2024 11:12

aborg-dev added this to the ZK WASM: Stage 2 milestone Feb 6, 2024

nagisa approved these changes Feb 6, 2024

View reviewed changes

aborg-dev added this pull request to the merge queue Feb 6, 2024

Merged via the queue into main with commit 3d0d0a7 Feb 6, 2024
21 checks passed

aborg-dev deleted the sha_full branch February 6, 2024 13:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Loads and Stores for SHA256 benchmark #185

Fix Loads and Stores for SHA256 benchmark #185

aborg-dev commented Jan 9, 2024 •

edited

Loading

nagisa Jan 29, 2024

aborg-dev Jan 30, 2024

aborg-dev Jan 30, 2024

nagisa Jan 31, 2024

aborg-dev Feb 2, 2024

aborg-dev commented Feb 1, 2024

nagisa Feb 6, 2024

Fix Loads and Stores for SHA256 benchmark #185

Fix Loads and Stores for SHA256 benchmark #185

Conversation

aborg-dev commented Jan 9, 2024 • edited Loading

nagisa Jan 29, 2024

Choose a reason for hiding this comment

aborg-dev Jan 30, 2024

Choose a reason for hiding this comment

aborg-dev Jan 30, 2024

Choose a reason for hiding this comment

nagisa Jan 31, 2024

Choose a reason for hiding this comment

aborg-dev Feb 2, 2024

Choose a reason for hiding this comment

aborg-dev commented Feb 1, 2024

nagisa Feb 6, 2024

Choose a reason for hiding this comment

aborg-dev commented Jan 9, 2024 •

edited

Loading