Metric AggregatorStore optimizations for sorting tags #2805

utpilla · 2022-01-22T03:39:38Z

Fixes item 1 of #2374

Changes

The existing two-level lookup structure saves memory by storing distinct set of tag Keys only once. However, it makes it difficult to not rely on sorting the tags based on keys on every update call
This PR uses just one dictionary with key as Tags (a struct which has both string[] tagKeys and object[] tagValues) and value as an int which denotes the MetricPoint index assigned

Here is the new look-up algorithm:

Check if the dictionary contains the given tags
- If yes, return the value
- If not, sort the given tag keys and check if the dictionary contains the sorted order of the tags
  - If yes, then return the value
  - If not, then
    - for tagsLength > 1:
    1. Increment metricPointIndex
    2. Add the sorted order of tags with the newly incremented metricPointIndex
    3. Add the given order of tags with the newly incremented metricPointIndex
    4. return the newly incremented metricPointIndex
    - for tagsLength == 0 or 1
    1. Increment metricPointIndex
    2. Add the given order of tags with the newly incremented metricPointIndex
    3. return the newly incremented metricPointIndex

This means that for any given set of tags, we store the sorted order of the tags and at the most one additional combination of the tags in the dictionary.

Performance Improvement:

This PR improves the performance by upto ~63% for higher number of tags as shown in: Metric AggregatorStore optimizations for sorting tags #2805 (comment)
The Stress Test results have improved significantly: There is an increase of ~12M loops/sec from 14.7M loops/sec in the main branch to 27.3M loops/sec as shown in: Metric AggregatorStore optimizations for sorting tags #2805 (comment)

Follow-up issues to track:

utpilla · 2022-01-22T03:39:58Z

Updating the benchmarks numbers with the latest changes of the PR:

Benchmarks

There is up to ~63% improvement in Perf for higher number of Tags (updated)

// * Summary *

BenchmarkDotNet=v0.13.1, OS=Windows 10.0.22000
Intel Core i7-9700 CPU 3.00GHz, 1 CPU, 8 logical and 8 physical cores
.NET SDK=6.0.101
[Host] : .NET 6.0.1 (6.0.121.56705), X64 RyuJIT
DefaultJob : .NET 6.0.1 (6.0.121.56705), X64 RyuJIT

main

Method	AggregationTemporality	Mean	Error	StdDev	Allocated
CounterHotPath	Cumulative	15.82 ns	0.078 ns	0.069 ns	-
CounterWith1LabelsHotPath	Cumulative	71.28 ns	0.270 ns	0.252 ns	-
CounterWith3LabelsHotPath	Cumulative	390.50 ns	1.550 ns	1.450 ns	-
CounterWith5LabelsHotPath	Cumulative	594.25 ns	11.624 ns	13.386 ns	-
CounterWith6LabelsHotPath	Cumulative	720.17 ns	7.979 ns	6.663 ns	-
CounterWith7LabelsHotPath	Cumulative	768.72 ns	5.820 ns	4.544 ns	-
CounterHotPath	Delta	15.68 ns	0.097 ns	0.086 ns	-
CounterWith1LabelsHotPath	Delta	68.67 ns	0.233 ns	0.195 ns	-
CounterWith3LabelsHotPath	Delta	375.43 ns	3.357 ns	3.140 ns	-
CounterWith5LabelsHotPath	Delta	588.71 ns	5.730 ns	5.360 ns	-
CounterWith6LabelsHotPath	Delta	683.18 ns	3.388 ns	3.169 ns	-
CounterWith7LabelsHotPath	Delta	782.50 ns	5.883 ns	5.503 ns	-

With the new changes

Method	AggregationTemporality	Mean	Error	StdDev	Allocated
CounterHotPath	Cumulative	15.77 ns	0.049 ns	0.041 ns	-
CounterWith1LabelsHotPath	Cumulative	57.13 ns	0.138 ns	0.115 ns	-
CounterWith3LabelsHotPath	Cumulative	140.92 ns	0.602 ns	0.563 ns	-
CounterWith5LabelsHotPath	Cumulative	225.50 ns	0.527 ns	0.440 ns	-
CounterWith6LabelsHotPath	Cumulative	256.97 ns	1.173 ns	1.097 ns	-
CounterWith7LabelsHotPath	Cumulative	288.46 ns	1.003 ns	0.889 ns	-
CounterHotPath	Delta	15.77 ns	0.019 ns	0.017 ns	-
CounterWith1LabelsHotPath	Delta	62.79 ns	0.168 ns	0.149 ns	-
CounterWith3LabelsHotPath	Delta	141.30 ns	0.276 ns	0.230 ns	-
CounterWith5LabelsHotPath	Delta	228.05 ns	0.443 ns	0.415 ns	-
CounterWith6LabelsHotPath	Delta	256.34 ns	1.902 ns	1.686 ns	-
CounterWith7LabelsHotPath	Delta	285.86 ns	0.436 ns	0.407 ns	-

utpilla · 2022-01-22T03:44:16Z

Updating the Stress Test numbers with the latest changes of the PR:

Stress Test

These is an increase of ~12M loops/sec from 14.7M in the main branch to 27.3M with the new changes

main

With the new changes (updated)

codecov · 2022-01-22T03:52:57Z

Codecov Report

Merging #2805 (39e237f) into main (840b24e) will increase coverage by 0.08%.
The diff coverage is 88.57%.

@@            Coverage Diff             @@
##             main    #2805      +/-   ##
==========================================
+ Coverage   83.73%   83.82%   +0.08%     
==========================================
  Files         251      250       -1     
  Lines        8866     8877      +11     
==========================================
+ Hits         7424     7441      +17     
+ Misses       1442     1436       -6

Impacted Files	Coverage Δ
src/OpenTelemetry/Metrics/Tags.cs	`72.00% <72.00%> (ø)`
src/OpenTelemetry/Metrics/AggregatorStore.cs	`82.85% <97.77%> (+1.09%)`	⬆️
...nTelemetry/Internal/OpenTelemetrySdkEventSource.cs	`74.52% <0.00%> (+2.83%)`	⬆️
...lemetry/Internal/SelfDiagnosticsConfigRefresher.cs	`89.42% <0.00%> (+2.88%)`	⬆️

src/OpenTelemetry/Metrics/AggregatorStore.cs

src/OpenTelemetry/Metrics/Tags.cs

src/OpenTelemetry/Metrics/AggregatorStore.cs

…atorStore-Optimization-New

src/OpenTelemetry/Metrics/AggregatorStore.cs

src/OpenTelemetry/Metrics/Tags.cs

src/OpenTelemetry/Metrics/AggregatorStore.cs

…atorStore-Optimization-New

src/OpenTelemetry/CHANGELOG.md

Co-authored-by: Cijo Thomas <cithomas@microsoft.com>

src/OpenTelemetry/Metrics/AggregatorStore.cs

Co-authored-by: Cijo Thomas <cithomas@microsoft.com>

src/OpenTelemetry/Metrics/AggregatorStore.cs

TimothyMothra · 2022-01-28T23:24:23Z

Looking at the LookupAggregatorStore method, the if (length > 1) and else bodies are almost identical...

Lines 185-244 Perform the sort, and then add both givenTags and sortedTags.
Lines 251-294 Are only adding the givenTags.

If it's important to keep these bodies consistent, consider refactoring this out to a helper method.

alanwest

Nice work @utpilla! This looks like a solid improvement.

src/OpenTelemetry/Metrics/AggregatorStore.cs

src/OpenTelemetry/Metrics/Tags.cs

src/OpenTelemetry/CHANGELOG.md

reyang

LGTM.

Co-authored-by: Reiley Yang <reyang@microsoft.com>

utpilla · 2022-02-02T01:55:34Z

Looking at the LookupAggregatorStore method, the if (length > 1) and else bodies are almost identical...

Lines 185-244 Perform the sort, and then add both givenTags and sortedTags.

Lines 251-294 Are only adding the givenTags.

If it's important to keep these bodies consistent, consider refactoring this out to a helper method.

I have created the issue #2843 to track this.

cijothomas

Great job in improving the perf significantly! LGTM.
Please address the non blocking comments as follow ups.

…ps://github.com/utpilla/opentelemetry-dotnet into utpilla/Metric-AggregatorStore-Optimization-New

CodeBlanch · 2022-02-03T18:33:20Z

src/OpenTelemetry/Metrics/Tags.cs

+                return false;
+            }
+
+            for (int i = 0; i < valuesLength; i++)


@utpilla Sorry, just noticed this. If we re-order this a bit so we validate that key & value lengths are equal first, then we could use one loop to check the keys & values for equality. Faster that way I think.

I did try this it didn't affect the benchmark numbers that much. I would still update it nonetheless.

Refactor AggregatorStore to only use two concurrent dictionaries

820085e

utpilla requested a review from a team January 22, 2022 03:39

utpilla mentioned this pull request Jan 22, 2022

Metric AggregatorStore optimization for sorting Tag keys #2777

Closed

alanwest reviewed Jan 26, 2022

View reviewed changes

src/OpenTelemetry/Metrics/AggregatorStore.cs Outdated Show resolved Hide resolved

Yun-Ting reviewed Jan 26, 2022

View reviewed changes

src/OpenTelemetry/Metrics/AggregatorStore.cs Outdated Show resolved Hide resolved

Yun-Ting reviewed Jan 26, 2022

View reviewed changes

src/OpenTelemetry/Metrics/AggregatorStore.cs Outdated Show resolved Hide resolved

cijothomas reviewed Jan 27, 2022

View reviewed changes

src/OpenTelemetry/Metrics/AggregatorStore.cs Outdated Show resolved Hide resolved

CodeBlanch reviewed Jan 27, 2022

View reviewed changes

src/OpenTelemetry/Metrics/Tags.cs Outdated Show resolved Hide resolved

CodeBlanch reviewed Jan 27, 2022

View reviewed changes

src/OpenTelemetry/Metrics/AggregatorStore.cs Outdated Show resolved Hide resolved

utpilla added 2 commits January 27, 2022 13:57

Merge remote-tracking branch 'origin/main' into utpilla/Metric-Aggreg…

a6b35e6

…atorStore-Optimization-New

Addressing PR suggestions

99c9ef4

cijothomas reviewed Jan 28, 2022

View reviewed changes

src/OpenTelemetry/Metrics/AggregatorStore.cs Show resolved Hide resolved

CodeBlanch reviewed Jan 28, 2022

View reviewed changes

src/OpenTelemetry/Metrics/Tags.cs Outdated Show resolved Hide resolved

cijothomas reviewed Jan 28, 2022

View reviewed changes

src/OpenTelemetry/Metrics/AggregatorStore.cs Outdated Show resolved Hide resolved

utpilla and others added 5 commits January 28, 2022 11:49

Address PR comments for Perf optimization

27579dc

Merge remote-tracking branch 'origin/main' into utpilla/Metric-Aggreg…

0e10eab

…atorStore-Optimization-New

Add CHANGELOG.md

6eb6089

Merge branch 'main' into utpilla/Metric-AggregatorStore-Optimization-New

b3663f7

Merge branch 'main' into utpilla/Metric-AggregatorStore-Optimization-New

0583b0b

cijothomas reviewed Jan 28, 2022

View reviewed changes

src/OpenTelemetry/CHANGELOG.md Outdated Show resolved Hide resolved

Update src/OpenTelemetry/CHANGELOG.md

ad0410b

Co-authored-by: Cijo Thomas <cithomas@microsoft.com>

cijothomas reviewed Jan 28, 2022

View reviewed changes

src/OpenTelemetry/Metrics/AggregatorStore.cs Outdated Show resolved Hide resolved

cijothomas reviewed Jan 28, 2022

View reviewed changes

src/OpenTelemetry/Metrics/AggregatorStore.cs Outdated Show resolved Hide resolved

utpilla and others added 2 commits January 28, 2022 13:12

Update src/OpenTelemetry/Metrics/AggregatorStore.cs

3a46d68

Co-authored-by: Cijo Thomas <cithomas@microsoft.com>

Remove unreachable code

64b86e5

cijothomas reviewed Jan 28, 2022

View reviewed changes

src/OpenTelemetry/Metrics/AggregatorStore.cs Show resolved Hide resolved

alanwest approved these changes Jan 28, 2022

View reviewed changes

reyang reviewed Jan 30, 2022

View reviewed changes

src/OpenTelemetry/Metrics/AggregatorStore.cs Show resolved Hide resolved

reyang reviewed Jan 30, 2022

View reviewed changes

src/OpenTelemetry/Metrics/Tags.cs Show resolved Hide resolved

reyang reviewed Jan 30, 2022

View reviewed changes

src/OpenTelemetry/CHANGELOG.md Outdated Show resolved Hide resolved

reyang approved these changes Jan 30, 2022

View reviewed changes

utpilla and others added 3 commits January 31, 2022 11:03

Update src/OpenTelemetry/CHANGELOG.md

333680a

Co-authored-by: Reiley Yang <reyang@microsoft.com>

Merge branch 'main' into utpilla/Metric-AggregatorStore-Optimization-New

2476844

Merge branch 'main' into utpilla/Metric-AggregatorStore-Optimization-New

1bc337f

utpilla mentioned this pull request Feb 1, 2022

Avoid allocation in AggregatorStore for MetricPoint lookup #2838

Closed

Merge branch 'main' into utpilla/Metric-AggregatorStore-Optimization-New

c63e288

utpilla mentioned this pull request Feb 2, 2022

Follow-ups for Metric AggregatorStore refactoring and optimization #2843

Closed

utpilla changed the title ~~Metric AggregatorStore optimizations for sorting tags- New~~ Metric AggregatorStore optimizations for sorting tags Feb 2, 2022

cijothomas approved these changes Feb 2, 2022

View reviewed changes

utpilla added 2 commits February 2, 2022 12:26

Resolve merge conflicts

788389e

Merge branch 'utpilla/Metric-AggregatorStore-Optimization-New' of htt…

39e237f

…ps://github.com/utpilla/opentelemetry-dotnet into utpilla/Metric-AggregatorStore-Optimization-New

cijothomas merged commit c1c5436 into open-telemetry:main Feb 2, 2022

CodeBlanch reviewed Feb 3, 2022

View reviewed changes

reyang mentioned this pull request Mar 18, 2022

OpenTelemetryLoggerProvider is now unaffected by changes to OpenTelemetryLoggerOptions after the LoggerFactory is built. #3055

Merged

3 tasks

alanwest mentioned this pull request Jul 7, 2022

Add Utkarsh as maintainer #3432

Merged

cijothomas mentioned this pull request Nov 22, 2023

Metric stress test with unsorted attributes. open-telemetry/opentelemetry-rust#1396

Closed

4 tasks

utpilla deleted the utpilla/Metric-AggregatorStore-Optimization-New branch November 23, 2023 03:34

cijothomas mentioned this pull request Apr 18, 2024

Move Utkarsh to Approver #5547

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Metric AggregatorStore optimizations for sorting tags #2805

Metric AggregatorStore optimizations for sorting tags #2805

utpilla commented Jan 22, 2022 •

edited

Loading

utpilla commented Jan 22, 2022 •

edited

Loading

utpilla commented Jan 22, 2022 •

edited

Loading

codecov bot commented Jan 22, 2022 •

edited

Loading

TimothyMothra commented Jan 28, 2022

alanwest left a comment

reyang left a comment

utpilla commented Feb 2, 2022

cijothomas left a comment

CodeBlanch Feb 3, 2022

utpilla Feb 3, 2022

Metric AggregatorStore optimizations for sorting tags #2805

Metric AggregatorStore optimizations for sorting tags #2805

Conversation

utpilla commented Jan 22, 2022 • edited Loading

Changes

Performance Improvement:

Follow-up issues to track:

utpilla commented Jan 22, 2022 • edited Loading

Updating the benchmarks numbers with the latest changes of the PR:

Benchmarks

There is up to ~63% improvement in Perf for higher number of Tags (updated)

main

With the new changes

utpilla commented Jan 22, 2022 • edited Loading

Updating the Stress Test numbers with the latest changes of the PR:

Stress Test

main

With the new changes (updated)

codecov bot commented Jan 22, 2022 • edited Loading

Codecov Report

TimothyMothra commented Jan 28, 2022

alanwest left a comment

Choose a reason for hiding this comment

reyang left a comment

Choose a reason for hiding this comment

utpilla commented Feb 2, 2022

cijothomas left a comment

Choose a reason for hiding this comment

CodeBlanch Feb 3, 2022

Choose a reason for hiding this comment

utpilla Feb 3, 2022

Choose a reason for hiding this comment

utpilla commented Jan 22, 2022 •

edited

Loading

utpilla commented Jan 22, 2022 •

edited

Loading

utpilla commented Jan 22, 2022 •

edited

Loading

codecov bot commented Jan 22, 2022 •

edited

Loading