Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reduce memory usage and calculate statistics online/incrementally #103

Merged
merged 5 commits into from
Jul 29, 2018

Conversation

smarr
Copy link
Owner

@smarr smarr commented Jul 29, 2018

This PR changes the way ReBench handles data.
It fixes #100.

The key idea is to avoid keeping all data points and measurements in memory.
This is achieved by calculating statistics incrementally.

One major change related to it is how we discard data for reruns, which is now fused with loading that data file. This may speedup data loading, too.

@daumayr this should hopefully fix your issue.

This avoids keeping all data points in memory, and should not have any drawbacks for ReBench’s functionality.

However, this also removes the scipy/numpy usage, and all mentioning of medians.

For support of quartile/median reporting to codespeed, we’d need to keep the datapoints, at least temporarily.
Though, I would really prefer to safe them all in codespeed.

Signed-off-by: Stefan Marr <git@stefan-marr.de>
This avoids a second path over the data, and avoids keeping the data points around indefinitely.

Signed-off-by: Stefan Marr <git@stefan-marr.de>
Signed-off-by: Stefan Marr <git@stefan-marr.de>
Signed-off-by: Stefan Marr <git@stefan-marr.de>
@smarr smarr added the Bug label Jul 29, 2018
@smarr smarr added this to the v1.0 - Foundation milestone Jul 29, 2018
@smarr smarr self-assigned this Jul 29, 2018
@coveralls
Copy link

coveralls commented Jul 29, 2018

Coverage Status

Coverage increased (+0.9%) to 86.862% when pulling 20c48b6 on memory-usage into 4b105d2 on master.

This is an attempt to optimize, but I think it also makes the code less convoluted.

Signed-off-by: Stefan Marr <git@stefan-marr.de>
@smarr smarr merged commit 43a1d15 into master Jul 29, 2018
@smarr smarr deleted the memory-usage branch July 29, 2018 20:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

Memory usage for long running benchmarks with many data points
2 participants