-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Lowering the memory footprint of parse results #14
Comments
hey @mrguiman thanks for opening this issue. I think creating a |
@puffyCid FYI, we try to go a little further on granular iteration, but memory consumption didn't improve : shindan-io@62dfb22 I my opinion a really great improvement would be to iterate straight away on a file itself, at We can try, but it could be a BIG rework of the existing code (even trying to be as conservative as possible). Would you be open to such a thing ? if yes, we could motivate ourself into this goal. |
@jrouaix that does sound like quite a large rewrite. I think the primary source of the memory usage is after reading the tracev3 file (~10.5 MBs) the library iterates through all the compressed chunks in the file and then returns the data needed for I think the decompressed data is probably going to be the reason for memory usage? What if instead of decompressing all chunks in the so instead of it would be something like Thoughts on perhaps that approach? |
no worries @mrguiman! There is a few merge conflicts between the PR and main branch. If you could update/sync the PR to the latest version. I would be fine with committing. For further reducing memory usage i think there are two approaches:
Both would be pretty large overhauls (1. being the largest). If you guys want to take a stab at it that would be great. Otherwise, if your too busy with other projects, no worries. Time permitting I can try to implement it, with the goal having it merged sometime in September (hopefully) |
I agree the option 1 makes the more sense, could really flatten the memory footprint It's a big refactoring, and difficult. |
Hello and thank you for the amazing work on the library.
We're working on an iOS forensics app and currently prototyping some code that will allow our researchers to pull specific, pinpointed events from UnifiedLogs.
We based our code on one of your exemples, so it currently revolves around calling the
build_log
method to assembleLogData
and "send" them to be consumed by other parts of our code.In that context, and because we want our program to be as portable as possible, we're trying to have the lowest possible memory footprint and as such would like to be able to work with iterators rather than allocating an entire Vector of LogData which can be quite heavy.
We were thinking of breaking down the
build_logs
method so that it can build vectors from the results of a newiter_logs
function that itself would return an iterator.Is there any footgun you can think of that we should be worrying about, and would you be open to us contributing such changes to the main codebase ?
The text was updated successfully, but these errors were encountered: