-
Notifications
You must be signed in to change notification settings - Fork 68
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add fuzzing support with docs; RawVal comparison proptests #957
Conversation
The cargo-deny CI error is about duplicate crates. I'll investigate another time. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Wow this is awesome. I've started reviewing but haven't finished yet and will have a play with it. The integration with proptest will be helpful.
I'm also particularly interested in some of those invalid RawVal cases, especially Vec (and Map) types where the contents aren't all the same type, because that's a valid Vec, for things like tuples or argument lists.
I looked into the cargo-deny errors, and pushed two commits to resolve them, but they may not be acceptable. The first commit just upgrades proptest, which resolves the quick-error dupe. The second adds rand 0.8 and its dependencies to the deny.toml ban whitelist, allowing dupes of rand, rand_core, rand_chacha, and getrandom. The problem here seems to be that all revisions of ed25519-dalek are on rand 0.7, while all revisions of proptest are on rand 0.8. I added comments to deny.toml indicating that proptest is only a dev-dependency so the dupes won't cause contract bloat, but it's possible that by being on a whitelist rand dupes could sneak into contracts in the future. I'm still poking at other solutions to the duplicated rand crate, but am not super optimistic. edit: it looks like curve25519-dalek 4.0 upgrades rand, but that crate is not released yet. |
I found something I need to change here: I have implemented Arbitrary for U256Val, but I should be using U256 instead. |
684bf79
to
ea350a0
Compare
I rebased this and made the following changes:
I also discovered that |
It's also worth noting that |
It looks like this is the issue for Timepoint and Duration: stellar/rs-soroban-env#724 Also indicates that the standard Rust String will be expected to be usable in contracts in the future, so will need implementations for Soroban Arbitrary. It's worth noting that this SorobanArbitrary is definitely a maintenance burden, as every new type that is expected to be used in contracts needs to be given explicit compatibility; and every type that might be stored in a UDT needs an implementation, and probably tests verifying they work, and I am still finding bugs and omissions like mentioned in previous comments. I expect that there will be bugs reported against it by people using UDTs in ways I have not anticipated. |
8b6210b
to
6017207
Compare
Rebased. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
👏🏻 Thanks @brson! This looks great.
There's a couple minor things with the testutils features I've commented inline, and I'll push fixes for because they're trivial to fix.
I've been playing around with it with a simple contract, and run into a couple issues. What I'm going to do is push a contract into this PR with some comments in the contract with the issue.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I pushed an example that uses the fuzzer in 22dd445. I think it'd be helpful for us to have a complete example in the test vectors of this repo. We've tried to keep the test vectors few and wherever possible keep any tests inside the sdk crate, but given that fuzzing is an out-of-crate experience, I think a test vector is suitable.
I've hit some compile errors I'm not sure what to do with though. See inline comment.
Also note the example I pushed isn't a particularly good one, I was just trying to get it working before doing anything meaningful.
I pushed some more changes so that the sdk doesn't ever generate testutils dependent code if it isn't dependent on testutils. Also, cleaned up the example that I had added. |
I think there's three things left we need to figure a story for:
|
Pushed a fix for the arbitrary re-export issue. It's not beautiful, but it works. The Arbitrary macro doesn't path-qualify the code it generates, so it assumes there'll be a module |
The solution to the multiple crate-types situation suggested by cargo devs is to build our wasms with |
I think that's reasonable. We can make that the default in docs for how to build .wasm files, and contracts can be rlib otherwise. We can keep the builds in this repo as is for the convenience of the workspace, and update the soroban-docs to show a setup without a crate-type in the toml, and using that new command. |
@leighmcculloch as far as renaming it |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The only thing that seems a little odd / unmotivated to me is the specialized typed vec and map<foo,foo> types rather than just vec and map<val,val> but it's not a big deal and I can imagine them plausibly being useful at some point, no need to delay further on that basis.
Overall this is great stuff! Thanks so much.
If this is referring to the giant list of types that might be generated for an arbitrary Val, then I agree it's not ideal. Just getting something working that covers a bunch of cases. It can be improved in the future. The current Val generator can be considered an implementation detail. |
What
Preliminary support for fuzzing Soroban contracts with cargo-fuzz. This patch is primarily concerned with introducing a pattern for implementing the Arbitrary trait, by which fuzz tests generate semi-random Rust values.
This is a new revision of previous pr #878. Little has changed functionally since that pr.
This patch additionally uses the Arbitrary impls with
proptest-arbitrary-interop
crate to add proptests that help ensure that:RawVals and ScVals can be converted between each other and their their comparison functions are equivalent, which closes stellar/rs-soroban-env#743.
This patch introduces the SorobanArbitrary trait which looks like this:
Basically every type relevant to soroban contracts implements (or should) this trait, including i32, u32, i64, u64, i128, u128, (), bool, I256Val, U256Val, Error, Bytes, BytesN, Vec, Map, Address, Symbol, TimepointVal, DurationVal.
The
#[contracttype]
macro automatically derives an implementation, along with a type that implementsSorobanArbitrary::Prototype
.In use the trait looks like
A more complete example is at https://github.com/brson/rs-soroban-sdk/blob/val-fuzz/soroban-sdk/fuzz/fuzz_targets/fuzz_rawval_cmp.rs
Why
This patch assumes it is desirable to fuzz Soroban contracts with cargo-fuzz.
Soroban reference types can only be constructed with an
Env
, but theArbitrary
trait constructs values from nothing but bytes. TheSorobanArbitrary
trait provides a pattern to bridge this gap, expecting fuzz tests to constructSorobanArbitrary::Prototype
types, and then convert them to their final type withFromVal
/IntoVal
.There are a lot of docs here and hopefully they explain what's going on well enough.
fuzz_catch_panic
This patch also introduces a helper function,
fuzz_catch_panic
, which is built off of thecall_with_suppressed_panic_hook
function in soroban-env-host.The
fuzz_target!
macro overrides the Rust panic hook to abort on panic, assuming all panics are bugs, but Soroban contracts fail by panicking, and I have found it desirable to fuzz failing contracts.fuzz_catch_panic
temporarily prevents the fuzzer from aborting on panic.Known limitations
The introduction of SorobanArbitrary requires a bunch of extra documentation to explain why Soroban contracts can't just use the stock Arbitrary trait.
As an alternative to this approach, we could instead expect users to construct XDR types, not SorobanArbitrary::Prototype types, and convert those to RawVals. I have been assuming that contract authors should rather concern themselves with contract types, and not the serialization types; and presently there are a number of XDR types that have values which can't be converted to contract types. The SorobanArbitrary::Prototype approach does allow more flexibility in the generation of contract types, e.g. we can generate some adversarial types like invalid object references and bad tags.
Contracts must use
IntoVal
to create the final types, but these traits are under-constrained for this purpose and always require type hints:This is quite unfortunate because it means the real type must be named twice.
This patch introduces a new private module
arbitrary_extra
which simply defines a bunch of new conversions likeThese conversions are required by
SorobanArbitrary
, which is only defined whencfg(feature = "testutils")
; thearbitrary_extra
module defines these conversions for all cfgs to ensure type inference is always the same, but the impls are probably useless outside of the SorobanArbitrary prototype pattern.Crates that use
#[contracttype]
need to define a "testutils" feature if they want to use Arbitrary.This patch doesn't generate "adversarial" values, which might include:
The arbitrary module has unit tests, and these help especially ensure the macros for custom types are correct, but the tests are not exhaustive.