Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Current state of rust library API and implementation #276

Open
ozgrakkurt opened this issue Dec 2, 2021 · 4 comments
Open

Current state of rust library API and implementation #276

ozgrakkurt opened this issue Dec 2, 2021 · 4 comments
Assignees

Comments

@ozgrakkurt
Copy link
Contributor

Currently rust api and implementation of the library isn't idiomatic rust code, which makes it awkward to read and understand.

I think we can change the library so it is like a normal rust library, but wrap it in javascript so It is convenient to use from javascript as well. I am currently working on a concrete proposal/PR to do this.

Also currently running clippy fails on the rust library and running fmt changes so much lines that it is hard to apply to the whole library and not break a lot of people's work.

See also:

@ozgrakkurt ozgrakkurt self-assigned this Dec 2, 2021
@rooooooooob
Copy link
Contributor

I am currently working on a concrete proposal/PR to do this.

Could you provide some details about this please @ozgrakkurt ?

After writing those comments in #264 I started trying something exploratory so that we can better weigh the options with what I mentioned in #264's comment about using a macro. I'm trying to see what could be done via procedural macros to make wrappers similar to the wrappers I linked in js-chain-libs. My hope is that it could even do parameter and error conversions but I'm a noob at proc macros so we'll have to see how it goes.

I don't want to commit to any approach at this moment but I wanted to explore this option at least to give us a better idea. This would help the usage from rust as mentioned but the usage from js would still be the same as it is now (i.e. not great).

Also I'm linking these wasm-bindgen issues here since they're highly relevant to making the JS experience better:

rustwasm/wasm-bindgen#111
rustwasm/wasm-bindgen#1742

We need to remember that the npm package users are far, far more numerous than the rust crate users as well.

@rooooooooob
Copy link
Contributor

@ozgrakkurt I was thinking about this more. We're going to need some kind of wasm_bindgen interface no matter what, whether that be auto-conversions applied programatically, or whether it be hand-written wrappers (e.g. how js-chain-libs or Ergo's sigma-rust wasm bindings work), or whether we use code-gen to do it for us.

This interface would be necessary whether we use it as the main interface (i.e. now), as well as if we try and write some JS around it to make usage nicer from JS. One important thing to keep in mind for the latter is I still can't think of a way to make memory management not utter garbage since there's no refcounting/destructors. The only way to get around that is if the rust object is temporary (e.g. is created and then detroyed in a single function for serialize() or deserialize(), or for calling crypto code, etc) but then you're still writing the non-(de)serialization code over again in JS + the conversion code between both formats which sounds like a lot. If we do just the first option then the JS is still just as awkward to use, and only the rust API has been improved. So I think unless we have a lot of JS dev time that we're stuck with a subpar JS experience.

I was hesitant about hand-writing wrappers because:

  1. You'd have comments in multiple places and that would likely lead to someone changing one but not the other by accident, etc.
  2. You have to think not just about now, but what about in the future when we need to add code (e.g. upcoming hardforks)? We've traditionally used cddl-codegen to generate 90%+ of our code from IOHK's binary CDDL spec and then we've added in useful functionality on top.

I guess Point 1. is mostly valid if we're directly consuming the wrappers rather than having a more idiomatic-JS library on top of the direct wasm_bindgen wrappers.

Point 2. has wider-spread implications regardless of how we do the wrappers since we still need to generate code for most structs unless you feel like hand-writing 1000+ lines of CDDL code for hardforks and hoping there's no mistakes. It's worth noting that you could still use the tool to generate the serialization (CBOR) components that are mostly in serialization.rs and then hand-write (or hand-edit the generated code) for everything that would be in lib.rs. If we do that then some of the serialization code would be broken depending on how far we are transforming the library to idiomatic rust.

So that brings an important question: what do we imagine this library looking like once we transform into idiomatic rust code? I assume we stop making those map and wrapper classes like Assets or ScriptHashes which are wrappers around Vec<T> BTreeMap<T, U> or even linked_hash_map::LinkedHashMap<T, U> in some places. What about access? Are we keeping it getters/setters everywhere or are going to change (some/most/all of the cbor structs whose entire definition is basically just their fields?) it to public fields? Changing parameters to be value (so we can move values into instead of cloning)? What about returns? There are traits we could implement that would be helpful for usage from rust too. Do we just axe the BigNum type entirely then and swap it for u64? What else did you have in mind?

As for the auto-wrapper-gen I was looking into proc macros which could wrap the definitions. They could return the same token stream on rust builds and on wasm builds could transform the methods a bit since you get the whole method body AST when parsing with the syn crate which with quote could be inserted into the new body with some kind of wrapper or by a conversion trait implementation. I tried writing one that would auto-implement to do nothing for most traits and only for things like Result<T, E> would have a custom one but it looks like helpful things like having trait default types (not just function bodies) is still unstable. So I think it might be more fruitful to do it just through the macro, although it's very ugly doing so since you're dealing mostly with raw ASTs. syn greatly helps though but only to an extent. Using syn you could do automatic public traits to wasm-compatible getters/setters by wrapping the struct definition in one. Comments are preserved using these macros.

@ozgrakkurt
Copy link
Contributor Author

@rooooooooob I am currently thinking of it like we make a core rust library which has all the functionality in idiomatic rust.
Then we write a thin wasm wrapper around it, sort of like writing a ffi for it. This means we need comments on the user facing wasm_bindgen layer which should be a small surface.

I think typedefs to BTreeMap<T, U> are fine. But we can make fields public maybe. Then the wrapper can define setters and getters.

Sorry I am not able to make much progress on this now

@rooooooooob
Copy link
Contributor

I guess for the comment-preservation argument you could argue that it's possible there would be some parts of it that would be different between rust/js but I think 98% would be identical.

I am currently thinking of it like we make a core rust library which has all the functionality in idiomatic rust.

It would help if you could list all the planned changes once you think of them so that we can discuss them and see how they would affect writing wasm wrappers (or auto-generating them) over top of them. Depending on the size of the next hardfork it might be worth making minor changes to the cddl-codgen tool to instead of generating the wasm_bindgen/rust compromise to instead generate more idiomatic rust (if it's small changes) and possibly even generate wrappers, depending on how we decide to handle those.

I think typedefs to BTreeMap<T, U> are fine. But we can make fields public maybe. Then the wrapper can define setters and getters.

We'd still have to generate the wasm_bindgen-compatible wrappers since those templated types can't be directly exposed to wasm with the exception of Vec<T> but only when T is a primitive type sendable to wasm like u32. Even Vec<String> isn't allowed for some reason. You'd have to double-check the list of types that implement to ToWasmABI/etc traits in wasm_bindgen though.

Sorry I am not able to make much progress on this now

Don't worry at all! This isn't super urgent and is something that should be very carefully thought through first as it would be a super big refactor/API breakage. I was mostly just concerned by your "I am currently working on a concrete proposal/PR to do this" comment since I didn't want us to jump into any particular approach without comparing all the different possibilities and their pros/cons and having a discussion between other devs familiar with the lib. I wasn't trying to make this happen ASAP but rather the opposite. We shouldn't jump into sudden heavily breaking changes.

rooooooooob added a commit that referenced this issue Dec 10, 2021
Profiling the tx builder showed that even after we removed the
fake_key_root construction in #214 due to terrible performance in asm.js
(#213) that the cryptoxide calls within `fake_full_tx()` were up a
ridiculous amount of the runtime. I investigated this as we had a report
that it was taking up to hundreds of milliseconds just to make a tx and
I wanted to see if there would be any significant improvement by
migrating to an idiomatic rust API as discussed in #276 but from a
purely performance perspective in a release build (opt-level=3) this seemed to
be pretty minor, even with the unnecessary cloning forced by this
inside of the implementation.

This will need to be slightly modified once #273 gets merged as there are
some conflicts.

Previous perf results from building 100,000 simple txs and serializing
them:
```
-   53.64%        [.] <&cryptoxide::curve25519::Fe as
core::ops::arith::Mul>::mul
    - 53.63% 0xffffffffffffffff
      + 23.03% cardano_serialization_lib::tx_builder::min_fee
      - 21.32%
cardano_serialization_lib::tx_builder::TransactionBuilder::add_change_if_needed
          - 15.21%
cardano_serialization_lib::tx_builder::TransactionBuilder::fee_for_output
              cardano_serialization_lib::tx_builder::min_fee
            + cardano_serialization_lib::tx_builder::fake_full_tx
          + 6.11%
cardano_serialization_lib::tx_builder::TransactionBuilder::min_fee
(inlined)
      - 7.78% ser_lib_perf::main
          + 6.11%
cardano_serialization_lib::tx_builder::TransactionBuilder::build
          + 1.67%
cardano_serialization_lib::tx_builder::TransactionBuilder::add_change_if_needed
      + 1.46% std::rt::lang_start_internal
-   19.40%        [.] cryptoxide::curve25519::Fe::square
    - 0xffffffffffffffff
      - 8.34% cardano_serialization_lib::tx_builder::min_fee
            cardano_serialization_lib::tx_builder::TransactionBuilder::build
            cardano_serialization_lib::tx_builder::TransactionBuilder::build_and_size
          + cardano_serialization_lib::tx_builder::fake_full_tx
      - 7.41%
cardano_serialization_lib::tx_builder::TransactionBuilder::add_change_if_needed
          + 5.58%
cardano_serialization_lib::tx_builder::TransactionBuilder::fee_for_output
          + 1.83%
cardano_serialization_lib::tx_builder::TransactionBuilder::min_fee
(inlined)
      + 2.77% ser_lib_perf::main
      + 0.88% std::rt::lang_start_internal
+    7.80%        [.] cryptoxide::curve25519::GePrecomp::maybe_set
+    5.13%        [.] cryptoxide::curve25519::Fe::invert
+    3.98%        [.] <&cryptoxide::curve25519::GeP3 as
core::ops::arith::Add<&cryptoxide::curve25519::GePrecomp>>::add
+    2.84%        [.] cryptoxide::curve25519::GePrecomp::select
+    1.22%        [.] cryptoxide::curve25519::ge_scalarmult_base
+    1.00%        [.]
cryptoxide::sha2::impl512::reference::digest_block_u64
      0.47%        [.] cryptoxide::curve25519::GeP2::dbl
      0.13%        [.] cryptoxide::curve25519::sc_muladd
      0.12%        [.] cryptoxide::curve25519::Fe::to_bytes
      0.12%        [.] cryptoxide::curve25519::sc_reduce
      0.10%        [.] cbor_event::se::Serializer<W>::write_type
      0.07%        [.]
cardano_serialization_lib::tx_builder::fake_full_tx
```

which shows that almost the entire runtime was spent on cryptoxide calls
which were only necessary for `fake_full_tx()`.

Running it again afterwards there is no overwhelming bottleneck anymore
and the remaining runtime was fairly distributed.
vsubhuman pushed a commit that referenced this issue Dec 12, 2021
Profiling the tx builder showed that even after we removed the
fake_key_root construction in #214 due to terrible performance in asm.js
(#213) that the cryptoxide calls within `fake_full_tx()` were up a
ridiculous amount of the runtime. I investigated this as we had a report
that it was taking up to hundreds of milliseconds just to make a tx and
I wanted to see if there would be any significant improvement by
migrating to an idiomatic rust API as discussed in #276 but from a
purely performance perspective in a release build (opt-level=3) this seemed to
be pretty minor, even with the unnecessary cloning forced by this
inside of the implementation.

This will need to be slightly modified once #273 gets merged as there are
some conflicts.

Previous perf results from building 100,000 simple txs and serializing
them:
```
-   53.64%        [.] <&cryptoxide::curve25519::Fe as
core::ops::arith::Mul>::mul
    - 53.63% 0xffffffffffffffff
      + 23.03% cardano_serialization_lib::tx_builder::min_fee
      - 21.32%
cardano_serialization_lib::tx_builder::TransactionBuilder::add_change_if_needed
          - 15.21%
cardano_serialization_lib::tx_builder::TransactionBuilder::fee_for_output
              cardano_serialization_lib::tx_builder::min_fee
            + cardano_serialization_lib::tx_builder::fake_full_tx
          + 6.11%
cardano_serialization_lib::tx_builder::TransactionBuilder::min_fee
(inlined)
      - 7.78% ser_lib_perf::main
          + 6.11%
cardano_serialization_lib::tx_builder::TransactionBuilder::build
          + 1.67%
cardano_serialization_lib::tx_builder::TransactionBuilder::add_change_if_needed
      + 1.46% std::rt::lang_start_internal
-   19.40%        [.] cryptoxide::curve25519::Fe::square
    - 0xffffffffffffffff
      - 8.34% cardano_serialization_lib::tx_builder::min_fee
            cardano_serialization_lib::tx_builder::TransactionBuilder::build
            cardano_serialization_lib::tx_builder::TransactionBuilder::build_and_size
          + cardano_serialization_lib::tx_builder::fake_full_tx
      - 7.41%
cardano_serialization_lib::tx_builder::TransactionBuilder::add_change_if_needed
          + 5.58%
cardano_serialization_lib::tx_builder::TransactionBuilder::fee_for_output
          + 1.83%
cardano_serialization_lib::tx_builder::TransactionBuilder::min_fee
(inlined)
      + 2.77% ser_lib_perf::main
      + 0.88% std::rt::lang_start_internal
+    7.80%        [.] cryptoxide::curve25519::GePrecomp::maybe_set
+    5.13%        [.] cryptoxide::curve25519::Fe::invert
+    3.98%        [.] <&cryptoxide::curve25519::GeP3 as
core::ops::arith::Add<&cryptoxide::curve25519::GePrecomp>>::add
+    2.84%        [.] cryptoxide::curve25519::GePrecomp::select
+    1.22%        [.] cryptoxide::curve25519::ge_scalarmult_base
+    1.00%        [.]
cryptoxide::sha2::impl512::reference::digest_block_u64
      0.47%        [.] cryptoxide::curve25519::GeP2::dbl
      0.13%        [.] cryptoxide::curve25519::sc_muladd
      0.12%        [.] cryptoxide::curve25519::Fe::to_bytes
      0.12%        [.] cryptoxide::curve25519::sc_reduce
      0.10%        [.] cbor_event::se::Serializer<W>::write_type
      0.07%        [.]
cardano_serialization_lib::tx_builder::fake_full_tx
```

which shows that almost the entire runtime was spent on cryptoxide calls
which were only necessary for `fake_full_tx()`.

Running it again afterwards there is no overwhelming bottleneck anymore
and the remaining runtime was fairly distributed.
vsubhuman pushed a commit that referenced this issue Jan 10, 2022
Profiling the tx builder showed that even after we removed the
fake_key_root construction in #214 due to terrible performance in asm.js
(#213) that the cryptoxide calls within `fake_full_tx()` were up a
ridiculous amount of the runtime. I investigated this as we had a report
that it was taking up to hundreds of milliseconds just to make a tx and
I wanted to see if there would be any significant improvement by
migrating to an idiomatic rust API as discussed in #276 but from a
purely performance perspective in a release build (opt-level=3) this seemed to
be pretty minor, even with the unnecessary cloning forced by this
inside of the implementation.

This will need to be slightly modified once #273 gets merged as there are
some conflicts.

Previous perf results from building 100,000 simple txs and serializing
them:
```
-   53.64%        [.] <&cryptoxide::curve25519::Fe as
core::ops::arith::Mul>::mul
    - 53.63% 0xffffffffffffffff
      + 23.03% cardano_serialization_lib::tx_builder::min_fee
      - 21.32%
cardano_serialization_lib::tx_builder::TransactionBuilder::add_change_if_needed
          - 15.21%
cardano_serialization_lib::tx_builder::TransactionBuilder::fee_for_output
              cardano_serialization_lib::tx_builder::min_fee
            + cardano_serialization_lib::tx_builder::fake_full_tx
          + 6.11%
cardano_serialization_lib::tx_builder::TransactionBuilder::min_fee
(inlined)
      - 7.78% ser_lib_perf::main
          + 6.11%
cardano_serialization_lib::tx_builder::TransactionBuilder::build
          + 1.67%
cardano_serialization_lib::tx_builder::TransactionBuilder::add_change_if_needed
      + 1.46% std::rt::lang_start_internal
-   19.40%        [.] cryptoxide::curve25519::Fe::square
    - 0xffffffffffffffff
      - 8.34% cardano_serialization_lib::tx_builder::min_fee
            cardano_serialization_lib::tx_builder::TransactionBuilder::build
            cardano_serialization_lib::tx_builder::TransactionBuilder::build_and_size
          + cardano_serialization_lib::tx_builder::fake_full_tx
      - 7.41%
cardano_serialization_lib::tx_builder::TransactionBuilder::add_change_if_needed
          + 5.58%
cardano_serialization_lib::tx_builder::TransactionBuilder::fee_for_output
          + 1.83%
cardano_serialization_lib::tx_builder::TransactionBuilder::min_fee
(inlined)
      + 2.77% ser_lib_perf::main
      + 0.88% std::rt::lang_start_internal
+    7.80%        [.] cryptoxide::curve25519::GePrecomp::maybe_set
+    5.13%        [.] cryptoxide::curve25519::Fe::invert
+    3.98%        [.] <&cryptoxide::curve25519::GeP3 as
core::ops::arith::Add<&cryptoxide::curve25519::GePrecomp>>::add
+    2.84%        [.] cryptoxide::curve25519::GePrecomp::select
+    1.22%        [.] cryptoxide::curve25519::ge_scalarmult_base
+    1.00%        [.]
cryptoxide::sha2::impl512::reference::digest_block_u64
      0.47%        [.] cryptoxide::curve25519::GeP2::dbl
      0.13%        [.] cryptoxide::curve25519::sc_muladd
      0.12%        [.] cryptoxide::curve25519::Fe::to_bytes
      0.12%        [.] cryptoxide::curve25519::sc_reduce
      0.10%        [.] cbor_event::se::Serializer<W>::write_type
      0.07%        [.]
cardano_serialization_lib::tx_builder::fake_full_tx
```

which shows that almost the entire runtime was spent on cryptoxide calls
which were only necessary for `fake_full_tx()`.

Running it again afterwards there is no overwhelming bottleneck anymore
and the remaining runtime was fairly distributed.

(cherry picked from commit e0b33c6)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants