feat(cheatcodes): Record Account and Storage Access Cheatcodes #6310

Inphi · 2023-11-14T15:13:27Z

Motivation

This follows up the great work @refcell and @jameswenzel have done on #6087. It implements a record and storage access cheatcode interface based on #6125.

Solution

#6125 goes over the details of the implementation. However, the interface proposed has been tweaked to the following:

enum AccountAccessKind {
  Call, CallReturn, CallCode, StatiCall,
  Create,
  SelfDestruct,
  Resume,
}

struct ChainInfo { uint256 forkId; uint256 chainId; }
 
struct AccountAccess {
    ChainInfo chainInfo;
    AccountAccessKind kind;
    address account;
    address accessor;
    bool initialized;
    uint256 oldBalance;
    uint256 newBalance;
    bytes deployedCode;
    uint256 value;
    bytes data;
    bool reverted;
    StorageAccess[] storageAccesses;
}

struct StorageAccess {
    address account;
    bytes32 slot;
    bool isWrite;
    bytes32 previousValue;
    bytes32 newValue;
    bool reverted;
}

function startStateDiffRecording() external;
function stopAndReturnStateDiff() external returns (AccountAccess[] memory accesses);

Once cheats.startStateDiffRecording() is called, all state and account accesses will be recording starting from the next context, rather than the current one. For example, given the following code snippet:

function run() {
  B b = new B();
  cheats.startStateDiffRecording();
  A a = new A(b);
  a.foo();
}

contract A {
  B _b;
  constructor(B b) { b = b }
  function foo() external { _b.foo(); }
}

The recorded accesses will contain the following:

CREATE opcode on A
CALL on a.foo
CALL on b.foo

Which excludes the CREATE context for B's ctor because no AccountAccess record exists for the current context at the time recording was enabled. Without keeping track of all AccountAccess records even when recording is disabled, we cannot emit a well-defined AccountAccess for B's ctor.

Control-Flow Linking

A context being recorded may be temporarily switched via sub-calls. Once control flow is returned to that context, it's important to maintain an ordering of subsequent accesses. This is accomplished through the Resume access kind. A Resume AccountAccess record contains storage accesses that occur between context switches. For example, given the following:

function run() {
  cheats.startStateDiffRecording();
  a.foo();  
}

contract A {
  function foo() external {
    assembly { sstore(0, 0) }
    this.bar(); //
    assembly { sstore(1, 0) }
  }
  function bar() external {}
}

We record the following:

AccountAccess for the A.foo Call
- StorageAccess for slot 0 store
AccountAccess for the A.bar Call
AccountAccess for the A.foo Return
- StorageAccess for slot 1 store

Here, the Return allows users determine the context where the storage access occurred. The Return account access is only created if there were subsequent storage accesses made after a context switch. This interface only aims to provide enough information to track state changes, rather than offering a comprehensive control flow trace.

mds1

I haven't reviewed the implementation, but have some questions around UX and behavior. Overall looks great though agree with the direction!

crates/cheatcodes/defs/src/vm.rs

mds1 · 2023-11-14T16:56:08Z

crates/abi/abi/HEVM.sol

@@ -84,6 +86,9 @@ record()
 accesses(address)(bytes32[], bytes32[])
 skip(bool)

+recordStateDiff()
+getStateDiff()(AccountAccess[])


Let's say my test calls A.foo() where foo calls another contract B, so the full flow is:

A calls B

B reads slot 0

B calls C

C reads slot 3

B has control again and reads slot 5

Is the returned array ordered like this:

AccountAccess.A

AccountAccess.B

StorageAccess in B, slot 0

AccountAccess.C

StorageAccess in C, slot 3

AccountAccess.B (I guess technically it was accessed again when re-gaining control flow? If so perhaps we need a new access kind)

StorageAccess in B, slot 5

Depends on where the record was initiated. If you start recording just prior to A.foo(), then there wouldn't be any AccountAccess.A since A wasn't accessed. The first AccountAccess would be for B, with A as its accessor. So the returned array would be as you say, without the AccountAccess.A.

Yeah, I'm starting to agree with you on a new AccessKind needed for this return control flow. We do need to maintain the "chronological" order so StorageAccess C, slot 3 relates with StorageAccess B, slot 5. But it may not be immediately to users that the latter storage access isn't from C's context.

OTOH, this kinda defeats the purpose of having a storageAccesses field for AccountAccess if we need a new kind to handle this case. I'm not exactly sure the right approach should be. But I lean towards completely decoupling storage accesses from account accesses, while maintaining ordering. So you have a the following state diff array:

AccountAccess.B

StorageAccess.B, slot 0

AccountAccess.C

StorageAccess.C, slot 3

StorageAccess.B, slot 5

The main issue with this approach is that the user cannot tell the context StorageAccess.B, slot 5 is for. This should be evident from the StorageAccess.account field, but perhaps in cases like delegatecall it's a bit blurry.
This API depends on whether users need to know exactly which context an account access occurred.

Depends on where the record was initiated. If you start recording just prior to A.foo(), then there wouldn't be any AccountAccess.A since A wasn't accessed. The first AccountAccess would be for B, with A as its accessor. So the returned array would be as you say, without the AccountAccess.A.

Interesting, I'd expect this to have resulted in AccountAccess.A entry:

contract MyTest is Test { function testAccesses() external { vm.recordStateDiff(); a.foo(); // `foo` now calls `b` AccountAccess[] memory accesses = vm.getStateDiff(); } }

And I think it should, because users will want the storage accesses that took place in the a.foo() call also

Hmm, yea this is tricky. A "return control flow" seems ok but feels hacky. And I don't think Solidity is flexible enough to allow the cheat to return a flat array of (AccountAccess | StorageAccess)[] so the UX there may be clunky because I think you'd need it return a struct of:

struct AccountOrStorageAccess { AccountAccess accountAccess; // empty if it was a storage access StorageAccess storageAccess; // empty if it was an account access bool isAccountAccess; bool isStorageAccess; }

Let me clarify how recording occurs. In your code snippet, the a.foo() CALL will be recorded. What I was trying to explain is that state accesses in the same context a record begins are not recorded. That is the sstore in the following example won't be recorded:

contract MyTest is Test { function testAccesses() external { vm.recordStateDiff(); assembly { sstore(0x0, 0x0) } a.foo(); // `foo` now calls `b` AccountAccess[] memory accesses = vm.getStateDiff(); } }

The a.foo() call generates an AccountAccess record for only the b contract, but not a since that's the only account being accessed there. So we have the following record:

AccountAccess { accessor: a, account: b, data: abi.encodeCall(B.foo,()), kind: CALL, ... }

I guess, if there is a value transfer then it should generate two records for the credit and debit changes. Though this seems redundant as we'll need to represent this with new type of account access or even a new Access type struct.

Hm, I do think that a.foo() call should capture 2 account accesses—one for A and one for B. One way to think about this is that if a has storage writes those should be captured, and you can't have a storage write without first having an account access.

I do agree that sstore should not be captured though.

In your example, is the a.foo() an actual CALL, i.e. calling an external A.foo function for example? If so, then I think we're on the same page. If it's just a jump, then that kinda breaks the CALL kind concept we have going.

Oh sorry it was a CALL, I see how that is unclear

Just noting for completeness here that we discussed offline and agreed on adding a synthetic Resume access type to handle the "control flow returned to caller" access kind

crates/abi/abi/HEVM.sol

Co-authored-by: James Wenzel <wenzel.james.r@gmail.com>

Adds a couple more fields to recorded account and storage accesses.

Also rename cheats APIs

crates/cheatcodes/spec/src/vm.rs

Co-authored-by: refcell.eth <abigger87@gmail.com>

crates/abi/abi/HEVM.sol

crates/cheatcodes/spec/src/vm.rs

refcell

Looks good to me now, really nice work @Inphi

mattsse

some style nits,
defer to @mds1 and @DaniPopes

crates/cheatcodes/src/inspector.rs

mds1 · 2023-11-16T14:56:13Z

crates/abi/abi/HEVM.sol

@@ -5,6 +5,8 @@ struct DirEntry { string errorMessage; string path; uint64 depth; bool isDir; bo
 struct FsMetadata { bool isDir; bool isSymlink; uint256 length; bool readOnly; uint256 modified; uint256 accessed; uint256 created; }
 struct Wallet { address addr; uint256 publicKeyX; uint256 publicKeyY; uint256 privateKey; }
 struct FfiResult { int32 exitCode; bytes stdout; bytes stderr; }
+struct AccountAccess { address accessor; address account; uint256 kind; bool initialized; uint256 oldBalance; uint256 newBalance; bytes deployedCode; uint256 value; bytes data; bool reverted; StorageAccess[] storageAccesses; }


Can we include the enum definition in this file and that instead of uint256 kind?

Yup will do. Though currently running into an abigen issue where it fails to generate proper bindings for an AccountAccess with too many fields.

Ah interesting, should be ok if we can't use the enum here since we can still use in Vm.sol in forge-std, so will defer to you + @mattsse / @Evalir here about the abigen issue

I got it working by replacing both chainId and forkId with a struct h/t @refcell . Good to do anyways if we'll be adding this info to the rest of the cheatcode interface.

don't worry about it, crates/abi is deprecated and will be removed soon. It's currently only used for trace/log decoding

crates/cheatcodes/spec/src/vm.rs

crates/cheatcodes/src/inspector.rs

mattsse

I'd like to rename the variable

otherwise lgtm, pending @DaniPopes

I still prefer ref mut because it makes it instantly clear to me when looking at the Some that this is a &mut without checking the rhs
but I don't mind &mut

crates/cheatcodes/src/inspector.rs

DaniPopes

lgtm

mattsse · 2023-11-17T22:24:49Z

gg
sending it

refcell · 2023-11-17T22:27:51Z

Wooot thank you @mattsse @DaniPopes 👑 👑 👑 👑

mds1 · 2023-11-17T22:31:38Z

@Inphi Can you PR the cheats into forge-std's VmSafe in Vm.Sol, and the book also?

Inphi · 2023-11-17T22:32:13Z

@Inphi Can you PR the cheats into forge-std's VmSafe in Vm.Sol, and the book also?

yup. Will do

Inphi force-pushed the inphi/cheat-records branch 2 times, most recently from c217748 to bdfe4ed Compare November 14, 2023 16:02

mds1 reviewed Nov 14, 2023

View reviewed changes

mds1 reviewed Nov 15, 2023

View reviewed changes

crates/abi/abi/HEVM.sol Outdated Show resolved Hide resolved

Inphi force-pushed the inphi/cheat-records branch 2 times, most recently from 1da7115 to aebbfa3 Compare November 15, 2023 20:43

refcell and others added 10 commits November 15, 2023 22:57

Record storage and account access cheatcodes

4d1f96c

Co-authored-by: James Wenzel <wenzel.james.r@gmail.com>

expand record access cheatcode interface

b7da13f

Adds a couple more fields to recorded account and storage accesses.

fix small doc comment nit

349469e

fix(cheatcodes): account access doc comment

924d700

fix(cheatcodes): clarify reverted account access status

5684e20

fix(cheatcodes): clarify balance doc comments

f10b03d

fix(cheatcodes): clarify initialized account access field in doc comment

7d2d0c2

update Access kind to include Resumed account access

03ab22f

Also rename cheats APIs

cleanup Resume logic

b1798d4

fmt

8f5cb9b

Inphi force-pushed the inphi/cheat-records branch from c48c525 to 8f5cb9b Compare November 16, 2023 04:00

Inphi added 2 commits November 15, 2023 23:13

remove unused Resume access kind

5a63bba

add chain_id to AccountAccess

055d3d9

Inphi marked this pull request as ready for review November 16, 2023 04:47

Inphi requested review from DaniPopes, Evalir and mattsse as code owners November 16, 2023 04:47

refcell reviewed Nov 16, 2023

View reviewed changes

crates/cheatcodes/spec/src/vm.rs Outdated Show resolved Hide resolved

Update crates/cheatcodes/spec/src/vm.rs

9748aff

Co-authored-by: refcell.eth <abigger87@gmail.com>

mds1 reviewed Nov 16, 2023

View reviewed changes

crates/abi/abi/HEVM.sol Outdated Show resolved Hide resolved

crates/cheatcodes/spec/src/vm.rs Outdated Show resolved Hide resolved

refcell approved these changes Nov 16, 2023

View reviewed changes

mattsse requested changes Nov 16, 2023

View reviewed changes

mds1 mentioned this pull request Nov 16, 2023

[Cheatcode] RecordCalls #6267

Closed

mds1 reviewed Nov 16, 2023

View reviewed changes

crates/cheatcodes/spec/src/vm.rs Show resolved Hide resolved

Inphi added 2 commits November 16, 2023 10:20

add ChainInfo struct; address PR comments

f34ebe5

avoid old skool ref mut

c454b3b

DaniPopes reviewed Nov 16, 2023

View reviewed changes

crates/cheatcodes/spec/src/vm.rs Show resolved Hide resolved

DaniPopes requested changes Nov 16, 2023

View reviewed changes

crates/cheatcodes/src/inspector.rs Outdated Show resolved Hide resolved

crates/cheatcodes/src/inspector.rs Outdated Show resolved Hide resolved

Inphi added 2 commits November 16, 2023 14:32

tidy code per pr review

5aff57d

rmeove unused import

95f0326

Inphi requested review from DaniPopes, mattsse and mds1 November 17, 2023 18:51

mattsse requested changes Nov 17, 2023

View reviewed changes

crates/cheatcodes/src/inspector.rs Outdated Show resolved Hide resolved

crates/cheatcodes/src/inspector.rs Outdated Show resolved Hide resolved

Inphi added 2 commits November 17, 2023 15:25

address nits

ca03ab0

selfdesutrct on record ctx check

d032d9f

mattsse approved these changes Nov 17, 2023

View reviewed changes

DaniPopes approved these changes Nov 17, 2023

View reviewed changes

mattsse merged commit c948388 into foundry-rs:master Nov 17, 2023
19 checks passed

Inphi mentioned this pull request Nov 17, 2023

feat: add storage and access record cheats foundry-rs/forge-std#481

Merged

sakulstra mentioned this pull request Nov 30, 2023

feat(cheatcodes): ability to capture and store state diffs #2846

Closed

ditto-eth mentioned this pull request Dec 7, 2023

fix(vm.cool) Persist storage changes #5852

Open

7 tasks

This was referenced Apr 26, 2024

Reset to upstream phylaxsystems/phoundry#12

Merged

feat/forge bin as lib phylaxsystems/phoundry#13

Merged

zerosnacks mentioned this pull request Jul 4, 2024

feat(cheatcodes): Record Account + Storage Access Cheatcode #6125

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(cheatcodes): Record Account and Storage Access Cheatcodes #6310

feat(cheatcodes): Record Account and Storage Access Cheatcodes #6310

Inphi commented Nov 14, 2023 •

edited

Loading

mds1 left a comment

mds1 Nov 14, 2023

Inphi Nov 15, 2023 •

edited

Loading

mds1 Nov 15, 2023

Inphi Nov 15, 2023 •

edited

Loading

mds1 Nov 15, 2023

Inphi Nov 15, 2023

mds1 Nov 15, 2023

mds1 Nov 16, 2023

refcell left a comment

mattsse left a comment

mds1 Nov 16, 2023

Inphi Nov 16, 2023 •

edited

Loading

mds1 Nov 16, 2023

Inphi Nov 16, 2023

DaniPopes Nov 16, 2023

mattsse left a comment

DaniPopes left a comment

mattsse commented Nov 17, 2023

refcell commented Nov 17, 2023

mds1 commented Nov 17, 2023

Inphi commented Nov 17, 2023

feat(cheatcodes): Record Account and Storage Access Cheatcodes #6310

feat(cheatcodes): Record Account and Storage Access Cheatcodes #6310

Conversation

Inphi commented Nov 14, 2023 • edited Loading

Motivation

Solution

Control-Flow Linking

mds1 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Inphi Nov 15, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Inphi Nov 15, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

refcell left a comment

Choose a reason for hiding this comment

mattsse left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Inphi Nov 16, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mattsse left a comment

Choose a reason for hiding this comment

DaniPopes left a comment

Choose a reason for hiding this comment

mattsse commented Nov 17, 2023

refcell commented Nov 17, 2023

mds1 commented Nov 17, 2023

Inphi commented Nov 17, 2023

Inphi commented Nov 14, 2023 •

edited

Loading

Inphi Nov 15, 2023 •

edited

Loading

Inphi Nov 15, 2023 •

edited

Loading

Inphi Nov 16, 2023 •

edited

Loading