[Rust] Support mutation #164

Zk2u · 2023-01-14T22:29:57Z

Support mutating a deserialised object as seen in google/flatbuffers#5772

Maybe add setters to [x]Ref to set fields?

The text was updated successfully, but these errors were encountered:

TethysSvensson · 2023-01-15T17:58:48Z

It wouldn't be quite as simple. All deserialized objects currently contain a &[u8] slice inside them. To support mutability we would have to change that to either &mut [u8] or &[Cell<u8>] -- but if we do that, then users would no longer be able to deserialize immutable slices.

I think a better approach would be to add additional [x]Mut types, which support mutability. We can then make a decision about whether a &mut [u8] or &[Cell<u8>] would be better. I &mut [u8] is potentially faster, but is also less ergonomic.

One compromise could be to make the reference types generic so they could work with multiple slice types.

TethysSvensson · 2023-01-15T22:02:00Z

Another option would be to improve the Builder to support loading in pre-serialized objects and then mutate at them. I'm thinking we could extend it with three features that combined gives you what you are asking for (and more):

Allow deserializing a partial message inside a non-finished Builder. When you use this API you would not get an [x]Ref<'a> out, but rather an Offset<[x]> without a lifetime.
Allow poking at the primitives in the builder using the same API. This would work, since you already have a &mut Builder.
Allow memcpying an existing message into a builder while assert that it has a certain type.

All of these functions should suitably marked as being be sound (as in no segfaults), but not safe (as in it will allow invalid messages and/or wrong interpretation of messages).

TethysSvensson · 2023-01-17T08:04:10Z

As a work-around, perhaps something like this could be of use?

table Object {
  counter: uint32;
}

use object::{Object, ObjectRef};
use planus::ReadAsRoot;

#[path = "object_generated.rs"]
mod object;

fn mutate_object(input: &[u8]) -> planus::Result<Vec<u8>> {
    let object_ref: ObjectRef<'_> = ObjectRef::read_as_root(input)?;
    let mut object: Object = object_ref.try_into()?;
    object.counter += 1;

    let mut builder = planus::Builder::new();
    let finished = builder.finish(object, None);
    Ok(finished.to_vec())
}

Note however [x]Ref<'_> can have a lot of object reuse, while there is no such reuse for [x]. This means that the generated output might be bigger than the input. For some schemas it might even be exponentially bigger if the input is constructed maliciously. This would also mean exponential runtime and exponential memory use.

Zk2u · 2023-01-18T20:49:58Z

As a work-around, perhaps something like this could be of use?

Yep, this is what I'm currently doing. It's not as optimal as a mutation API, but it works currently.

Zk2u · 2023-01-19T09:45:51Z

All of these functions should suitably marked as being be sound (as in no segfaults), but not safe (as in it will allow invalid messages and/or wrong interpretation of messages).

Allow deserializing a partial message inside a non-finished Builder. When you use this API you would not get an [x]Ref<'a> out, but rather an Offset<[x]> without a lifetime.

Allow poking at the primitives in the builder using the same API. This would work, since you already have a &mut Builder.

Allow memcpying an existing message into a builder while assert that it has a certain type.
Is there any way we can have an option to catch errors during this? This would be O(n) to check through, but in some cases that's acceptable.

Perhaps a better way to mutate data in a client-server architecture (where buffers from the client are untrusted) is to send over the field modifications from the client instead. This would make updates to large buffers a lot faster as compared to a O(n) validation of the new buffer. Mutations would then only happen in a trusted environment where safety isn't so much of an issue. Would need to write this "diff" format as another FBS schema. You could then load part or the whole buffer that's being modified and mutate the fields without changing or deserialising anything else if not needed.

Could mutation work with non-scalar values if you load the whole message? Then inserting the new value into the offset and shifting the rest of the message to the right (perhaps updating any offsets as needed?).

Also, if none of this makes sense, please forgive me. I don't know much about the internals of FBs yet. :)

Zk2u · 2023-01-23T12:09:44Z

The above is very much usecase specific for us. Hopefully this is a better reply...

Another option would be to improve the Builder to support loading in pre-serialized objects and then mutate at them. I'm thinking we could extend it with three features that combined gives you what you are asking for...

Extending the builder would probably mean smaller code size too, as I don't think you'd need extra [x]Mut types in the generated code. What would the interface to this look like using the builder to mutate the types?

I'm guessing it would be a bit clumsier to use than something like the syntax used for creating messages currently but we want it to be developer friendly too.

All of these functions should suitably marked as being be sound (as in no segfaults), but not safe (as in it will allow invalid messages and/or wrong interpretation of messages).

Validating the message in an O(n) way when loading would mean using [x]Mut types as the generic builder code doesn't know about the specific message schemas.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Rust] Support mutation #164

[Rust] Support mutation #164

Zk2u commented Jan 14, 2023

TethysSvensson commented Jan 15, 2023

TethysSvensson commented Jan 15, 2023

TethysSvensson commented Jan 17, 2023

Zk2u commented Jan 18, 2023

Zk2u commented Jan 19, 2023

Zk2u commented Jan 23, 2023

[Rust] Support mutation #164

[Rust] Support mutation #164

Comments

Zk2u commented Jan 14, 2023

TethysSvensson commented Jan 15, 2023

TethysSvensson commented Jan 15, 2023

TethysSvensson commented Jan 17, 2023

Zk2u commented Jan 18, 2023

Zk2u commented Jan 19, 2023

Zk2u commented Jan 23, 2023