-
Notifications
You must be signed in to change notification settings - Fork 538
plumbing: packfile, apply small object reading optimization also for delta objects #1121
Conversation
…delta objects Signed-off-by: Filip Navara <navara@emclient.com>
ac0f634
to
980018a
Compare
Signed-off-by: Filip Navara <navara@emclient.com>
9abc1a1
to
72b97da
Compare
…ting new one Signed-off-by: Filip Navara <filip.navara@gmail.com>
I'm investigating a decrease in speed when using these changes it gitbase. When iterating all trees with a cold cache it's faster with these changes (35s vs 25s). We keep the cache and in a second run the previous go-git code is faster (16s vs 25s). It seems the cache is not being filled correctly and in one profile I can see that |
There's couple of flaws I didn't realize when coding this. Firstly, it slightly regresses scenario where someone was accessing the small delta objects only to check the header. Now it falls back to decoding the content which can be comparatively expensive. This notably regresses Secondly, I assumed that if the delta size is small and the expanded size is small then it should not be a big issue to load the base object and just apply the delta. The flaw is that the delta object can reference huge base object but only reuse a small part of it. I have ideas on how to fix both of the issues without introducing new regressions. It will require significant reworking of the logic and a lot of testing though. |
… plumbing.AnyObject Signed-off-by: Filip Navara <filip.navara@gmail.com>
Signed-off-by: Filip Navara <filip.navara@gmail.com>
Signed-off-by: Filip Navara <filip.navara@gmail.com>
After more investigation I've found that the iterator is not using the cache to retrieve objects so delta objects are regenerated even when they are already in the cache. After enabling it the performance increases but triggers a bug I am investigating. |
I forgot to fix the "cold" code path in the iterator to try the cache. Note that if |
Signed-off-by: Filip Navara <navara@emclient.com>
7879789
to
809027f
Compare
Signed-off-by: Filip Navara <navara@emclient.com>
I see... I will look into it tomorrow :-) Some of the tests don't handle errors properly. Thus the underlying error ("invalid delta") crashes them hard. For some reason already undeltified data get to PatchDelta as input. |
I think I figured out what's wrong... |
…osition used by getNextObject in the subsequent statement Signed-off-by: Filip Navara <navara@emclient.com>
… in FSObject Signed-off-by: Filip Navara <navara@emclient.com>
Signed-off-by: Filip Navara <navara@emclient.com>
…objects Signed-off-by: Filip Navara <navara@emclient.com>
f25873b
to
78bab69
Compare
Thanks! @mcuadros Now this patch results in better performance in all my tests. |
In #963 I addressed issue with reading many small objects from a pack file. I originally opted not to apply the optimization to delta objects. It turns out that when traversing a commit history the tree objects are often small and delta encoded. This extends the original optimization to handle this case too. Tested on https://github.com/rails/rails repository and it improves the complete history traversal time by ~20%.