dist: Support a bincoded manifest file for performance reasons #2627

kinnison · 2021-01-02T12:42:31Z

This goes some of the way to mitigating #2626 but isn't a "fix" per-se.

Not least, we need to be sure of whether this is valid.

Signed-off-by: Daniel Silverstone <dsilvers@digital-scurf.org>

kinnison · 2021-01-09T13:17:19Z

I've rewritten this as a serialisation of the parsed manifest as a bincoded file. This is basically the same performance as toml parsing the trimmed manifest, but doesn't involve trimming which was debateable as to its correctness.

We need to introduce a version indicator for this so that we can detect if we should fall back to reading the toml and rewriting the bincode in case of changing our manifest structures.

rbtcollins · 2021-01-11T09:40:04Z

Lets use flatbuf not bincode. https://www.reddit.com/r/rust/comments/cmq6k9/bincode_for_other_languages/ vs https://google.github.io/flatbuffers/flatbuffers_support.html

kinnison · 2021-01-11T09:46:50Z

Is there a flatbuffers crate with serde support?

rbtcollins · 2021-01-11T09:45:21Z

src/utils/raw.rs


    file.sync_data()?;

    Ok(())
 }

+pub fn write_file(path: &Path, contents: &str) -> io::Result<()> {


I don't think this pays for itself vs write_file(path, contents.as_bytes())?;

Fair enough, I'll sort out a refactor commit alongside this which pushes that up to the call sites.

rbtcollins · 2021-01-11T09:47:04Z

src/utils/raw.rs

@@ -52,20 +52,24 @@ pub fn if_not_empty<S: PartialEq<str>>(s: S) -> Option<S> {
    }
 }

-pub fn write_file(path: &Path, contents: &str) -> io::Result<()> {
+pub fn write_file_bytes(path: &Path, contents: &[u8]) -> io::Result<()> {


I note this is doing a sync_data - this is an important part of the contract of the function; if we're renaming it perhaps consider exposing that at the same time - e.g. namespacing it or adding _synced or something.

I agree with this idea and will sort it out

rbtcollins · 2021-01-11T10:11:59Z

There is for flexbuffers - https://github.com/google/flatbuffers/tree/master/rust/flexbuffers - but I'm not sure of the story for flatfbuffers.

kinnison · 2021-01-11T15:55:04Z

Okay so flexbuffers look plausible vs. bincode, though as it's an internal cache implementation detail why are you adamant we shouldn't use bincode?

rbtcollins · 2021-01-12T14:18:33Z

If we need to debug it or introspect it, flatbuffers has more tooling available as it isn't rust-only with relatively few users. ditto flexbuffers; flatbuffers is the schemad version, I don't think the lack of serde support should be an issue though I haven't looked into it closely - an alternative would be protobuf, the tower protobuf glue is pretty nice

kinnison · 2021-01-12T19:44:18Z

I'm concerned about minimising the impact of the effort if we're do this soon. I was thinking of treating the binary as a cache and if it failed to load falling back to the toml. The serde capability just means it's much less effort for us in terms of implementation.

Debuggability is a good argument against bincode though. Flexbuffers look plausible if a bit more awkward to implement than bincode, yaml, json, etc.

bjorn3 · 2021-02-24T09:04:05Z

Cargo also uses bincode for certain caches like fingerprints. Flatbuffers having a schema would make the caches a bit bigger I think and will likely encourage others to inspect this implementation detail.

kinnison mentioned this pull request Jan 2, 2021

improve rustc wrapper startup time? #2626

Open

manifest: Support a bincoded channel manifest for performance

b7a2ccd

Signed-off-by: Daniel Silverstone <dsilvers@digital-scurf.org>

kinnison force-pushed the do-less-toml branch from c356122 to b7a2ccd Compare January 9, 2021 13:15

kinnison changed the title ~~dist: Trim the manifest toml to improve startup time~~ dist: Support a bincoded manifest file for performance reasons Jan 9, 2021

rbtcollins reviewed Jan 11, 2021

View reviewed changes

kinnison mentioned this pull request Nov 11, 2021

Optimization: parse manifest only once #2898

Merged

3 tasks

kinnison closed this Mar 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dist: Support a bincoded manifest file for performance reasons #2627

dist: Support a bincoded manifest file for performance reasons #2627

kinnison commented Jan 2, 2021

kinnison commented Jan 9, 2021

rbtcollins commented Jan 11, 2021

kinnison commented Jan 11, 2021

rbtcollins Jan 11, 2021

kinnison Jan 11, 2021

rbtcollins Jan 11, 2021

kinnison Jan 11, 2021

rbtcollins commented Jan 11, 2021

kinnison commented Jan 11, 2021

rbtcollins commented Jan 12, 2021

kinnison commented Jan 12, 2021

bjorn3 commented Feb 24, 2021

dist: Support a bincoded manifest file for performance reasons #2627

dist: Support a bincoded manifest file for performance reasons #2627

Conversation

kinnison commented Jan 2, 2021

kinnison commented Jan 9, 2021

rbtcollins commented Jan 11, 2021

kinnison commented Jan 11, 2021

rbtcollins Jan 11, 2021

Choose a reason for hiding this comment

kinnison Jan 11, 2021

Choose a reason for hiding this comment

rbtcollins Jan 11, 2021

Choose a reason for hiding this comment

kinnison Jan 11, 2021

Choose a reason for hiding this comment

rbtcollins commented Jan 11, 2021

kinnison commented Jan 11, 2021

rbtcollins commented Jan 12, 2021

kinnison commented Jan 12, 2021

bjorn3 commented Feb 24, 2021