Speed up BSON unmarshalling #410

craiggwilson · 2017-03-17T15:16:14Z

Currently when unmarshalling into a struct which is missing a field or into bson.Raw, the unmarshaller reads the parts that should be skipped into blackHole. This allows for the entire document to be checked for corruption at the expense of speed. In particular for bson.Raw, it also means that every document is unmarshalled twice, once at the initial time of reading, and later when the bson.Raw is unmarshalled into its final form. Effectively, this PR stops doing that. It verifies the element that is getting skipped but doesn't descend into them. This is particularly relevent for containers like arrays and documents.

The effect is a massive speedup (I've measured up to 6x) depending on the complexity of documents when using commands that returns cursors as arrays. This would be the new find command and the aggregate command. The downside is that the corruption message appears later in the program than it used to and sometimes a corruption message may not occur if a field is ignored or a bson.Raw is never ultimately unmarshalled. I feel these are acceptable trade-offs.

As part of verifying, I've implemented the entire bson_corpus as generated code which is checked in (so as long as the corpus doesn't change, no need to regenerate).

…lling.

…at arose.

…ping elements.

fmpwizard · 2017-07-21T00:53:04Z

If you close and reopen the PR, travis will rerun this build and thanks to #462 , your PR should pass all tests on all mongodb versions

craiggwilson · 2017-07-21T01:58:46Z

oh, probably need to rebase...

craiggwilson added 3 commits February 28, 2017 10:57

skipping elements/documents when decoding to raw to speed up unmarsha…

edb1827

…lling.

implemented bson_corpus tests and fixed the small number of issues th…

e73b066

…at arose.

fixed issue with skipping binary and added decode only tests for skip…

ad249b3

…ping elements.

craiggwilson closed this Jul 21, 2017

craiggwilson reopened this Jul 21, 2017

craiggwilson closed this Sep 13, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up BSON unmarshalling #410

Speed up BSON unmarshalling #410

craiggwilson commented Mar 17, 2017

fmpwizard commented Jul 21, 2017

craiggwilson commented Jul 21, 2017

Speed up BSON unmarshalling #410

Speed up BSON unmarshalling #410

Conversation

craiggwilson commented Mar 17, 2017

fmpwizard commented Jul 21, 2017

craiggwilson commented Jul 21, 2017