Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Document e2e logging performance for time series data #4889

Open
nikolausWest opened this issue Jan 23, 2024 · 2 comments
Open

Document e2e logging performance for time series data #4889

nikolausWest opened this issue Jan 23, 2024 · 2 comments
Assignees
Labels
📖 documentation Improvements or additions to documentation 🚀 performance Optimization, memory use, etc
Milestone

Comments

@nikolausWest
Copy link
Member

nikolausWest commented Jan 23, 2024

We want to benchmark logging scalars, including setting a timeline value for each logged scalar, i.e. something like

for frame_nr in range(0, 1_000_000) {
    rr. set_time_sequence("frame", frame_nr)
    rr.log("scalar", rr.TimeSeriesScalar(sin(frame_nr / 1000.0)))
}

We have the tool for it:

just rs-plot-dashboard --num-plots 10 --num-series-per-plot 5 --num-points-per-series 5000 --freq 1000

For each language (C++, Python, Rust), measure the max throughputs (scalars per second), end-to-end (logging -> visualization) for single-threaded/single-plot and multi-threaded logging (so 3 x 2 throughput figures).

We also want to check the memory use in the viewer when we have logged 100M scalars or so, to measure the RAM overhead.


manually document this somewhere in our docs, i.e.:

On a 2023 MacBook M1:

Language Single-threaded Multi-threaded
C++ ? kHz ? kHz
Python ? kHz ? kHz
Rust ? kHz ? kHz

Viewing 100M scalars use up ?GB of RAM in the native viewer.

Very rough numbers is fine, e.g. "~10 M scalars / second"

@nikolausWest nikolausWest added 📖 documentation Improvements or additions to documentation 🚀 performance Optimization, memory use, etc labels Jan 23, 2024
@emilk emilk changed the title User documented performance for time series data Document e2e logging performance for time series data Jan 23, 2024
@emilk
Copy link
Member

emilk commented Jan 29, 2024

We should link to #4423 too

@teh-cmc teh-cmc added this to the 0.13 milestone Jan 30, 2024
@nikolausWest nikolausWest self-assigned this Jan 30, 2024
@nikolausWest nikolausWest modified the milestones: 0.13, Triage Feb 5, 2024
@emilk
Copy link
Member

emilk commented Feb 6, 2024

I know there was some decision to punt on this (and it was moved to Triage), so I'm moving this down in urgency.

It would be nice with a short comment explaining why we are punting on this though.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
📖 documentation Improvements or additions to documentation 🚀 performance Optimization, memory use, etc
Projects
None yet
Development

No branches or pull requests

3 participants