[Security Solution] Move calculation of rule source outside of applyRuleUpdate #199720

rylnd · 2024-11-11T22:07:13Z

Partially addresses: #195632

Summary

This is a small performance improvement that came out of this discussion on a previous PR. Note that the code in question is behind a feature flag (prebuiltRulesCustomizationEnabled). This issue relates to the Prebuilt Rule Import work, and its associated benchmarking effort.

Context

With the current implementation, there are instances where we call applyRuleUpdate but do not want/need it to calculate rule source (e.g. when called from importRules, which pre-calculates the rule_source for incoming rules before passing them to importRule.

Instead of adding a flag to conditionally call calculateRuleSource from within applyRuleUpdate I've opted to separate the two functions as these seem to be logically distinct actions.

The three existing calls to applyRuleUpdate have been updated to be functionally equivalent.

Effect

The effect of this PR is that we will no longer unnecessarily call fetchAssetsByVersion for each individual rule being imported, which should improve performance of rule import.

For maintainers

This was checked for breaking API changes and was labeled appropriately
This will appear in the Release Notes and follow the guidelines

With the current implementation, there are instances where we call `applyRuleUpdate` but do not want/need it to calculate rule source (e.g. when called from `importRules`, which pre-calculates the rule_source for incoming rules before passing them to `importRule`. Instead of adding a flag to conditionally call `calculateRuleSource` from within `applyRuleUpdate` I've opted to separate the two functions as these seem to be logically distinct actions. The three existing calls to `applyRuleUpdate` have been updated to be functionally equivalent. The effect of this PR is that we will no longer unnecessarily call `fetchAssetsByVersion` for each individual rule being imported, which should improve performance of rule import.

rylnd · 2024-11-11T22:07:44Z

/ci

elasticmachine · 2024-11-12T20:40:48Z

Pinging @elastic/security-detection-rule-management (Team:Detection Rule Management)

elasticmachine · 2024-11-14T17:16:16Z

Pinging @elastic/security-detections-response (Team:Detections and Resp)

elasticmachine · 2024-11-14T17:16:16Z

Pinging @elastic/security-solution (Team: SecuritySolution)

xcrzx · 2024-11-20T11:07:57Z

...ver/lib/detection_engine/rule_management/logic/detection_rules_client/methods/import_rule.ts

+    // If no override fields are provided, we calculate the rule source
+    if (overrideFields == null) {
+      ruleWithUpdates.rule_source = await calculateRuleSource({
+        rule: ruleWithUpdates,
+        prebuiltRuleAssetClient,
+      });
+    } else {
+      ruleWithUpdates = { ...ruleWithUpdates, ...overrideFields };
+    }


This change raises some concerns about safety that I think we should discuss.

Previously, updating a rule triggered a rule source recalculation, ensuring the rule source was always in sync with the rule content.

Now, in some cases, we delegate the rule source recalculation to client users. If the applyRuleUpdate method usage isn’t closely monitored, we might see inconsistencies, with some rule sources being updated correctly while others might not.

This change seems to trade off implementation correctness for performance improvements, which could introduce potential issues. The root cause appears to be the splitting of the rule import logic between two clients: RuleSourceImporter and DetectionRulesClient. My suggestion is to adjust the DetectionRulesClient's import method so the overrideFields escape hatch is no longer necessary, and ensure that rule source recalculations are fully handled within a single method.

While performance improvements are important, it might be worth waiting until the rule customization feature flag is enabled by default before considering optimizations. This would also allow us to remove the legacy import method support and combine the RuleSourceImporter and DetectionRulesClient. But untill that, without concrete evidence that the performance is being impacted, premature optimizations might introduce more complexity than benefits.

@banderror, would love to hear your thoughts on this.

Previously, updating a rule triggered a rule source recalculation, ensuring the rule source was always in sync with the rule content.

This is only true because we happen to call applyRuleUpdate in those instances, right? From the perspective of the DetectionRulesClient, nothing has changed here: #importRule, #updateRule, and upgradeRule all follow the same logic as before, it's only the internal applyRuleUpdate whose responsibility has changed. Are you arguing that applyRuleUpdate needs to contain all of that logic?

Now, in some cases, we delegate the rule source recalculation to client users.

I see this as an optimization rather than an inconsistency: in the special case of importing rules, rule_source is calculated in bulk as it's much more efficient.

If the general argument is that these extraneous calculations are acceptable/negligible: this code path is only hit when you're overwriting existing rules, which was about 45% slower than creating new rules in my recent testing. So: an edge case, but it's slower, but not to the point where it's timing out (even at 4000 rules). 🤷

banderror · 2024-11-26T16:29:21Z

@elasticmachine merge upstream

banderror · 2024-11-27T14:06:54Z

@elasticmachine merge upstream

elasticmachine · 2024-11-27T15:57:25Z

💚 Build Succeeded

Buildkite Build
Commit: dc3e349

Metrics [docs]

✅ unchanged

History

💔 Build #254601 failed e760dd4
💚 Build #253753 succeeded 9e52e57
💚 Build #252501 succeeded 2966af2
💚 Build #251092 succeeded da7f9c8
💚 Build #250135 succeeded 5b67f44

cc @rylnd

banderror

Just for the record will post my review, although in our meeting we decided to close it according to @xcrzx's recommendation.

LGTM! 👍 😆

banderror · 2024-11-27T15:52:13Z

...b/detection_engine/rule_management/logic/detection_rules_client/mergers/apply_rule_update.ts

@@ -43,10 +39,5 @@ export const applyRuleUpdate = async ({
    created_by: existingRule.created_by,
  };

-  nextRule.rule_source = await calculateRuleSource({


Now that this call is removed, applyRuleUpdate doesn't have to be async anymore.

rylnd · 2024-11-27T17:36:48Z

Closing this after some offline discussion: we concluded that the performance improvement gained here was not worth the decentralization of the rule source calculation. When the legacy import path is no longer needed, we can/should reexamine a similar optimization.

rylnd added the 8.17 candidate label Nov 11, 2024

rylnd self-assigned this Nov 11, 2024

rylnd added the release_note:skip Skip the PR/issue when compiling release notes label Nov 11, 2024

rylnd added Team:Detection Rule Management Security Detection Rule Management Team Feature:Prebuilt Detection Rules Security Solution Prebuilt Detection Rules area Feature:Rule Import/Export Security Solution Detection Rule Import & Export workflow labels Nov 12, 2024

rylnd marked this pull request as ready for review November 12, 2024 20:40

rylnd requested a review from a team as a code owner November 12, 2024 20:40

rylnd requested a review from jkelas November 12, 2024 20:40

rylnd added the backport:skip This commit does not require backporting label Nov 12, 2024

banderror requested review from xcrzx and removed request for jkelas November 12, 2024 21:02

banderror added v9.0.0 backport:version Backport to applied version labels v8.17.0 and removed backport:skip This commit does not require backporting 8.17 candidate labels Nov 12, 2024

banderror self-requested a review November 13, 2024 14:47

Merge branch 'main' into rylnd/rule_import_improvements

da7f9c8

banderror mentioned this pull request Nov 14, 2024

[Security Solution] Allow importing prebuilt rules at the API level #180168

Closed

banderror changed the title ~~[Rule Management] Move calculation of rule source outside of applyRuleUpdate~~ [Security Solution] Move calculation of rule source outside of applyRuleUpdate Nov 14, 2024

banderror added Team:Detections and Resp Security Detection Response Team Team: SecuritySolution Security Solutions Team working on SIEM, Endpoint, Timeline, Resolver, etc. labels Nov 14, 2024

Merge branch 'main' into rylnd/rule_import_improvements

2966af2

xcrzx reviewed Nov 20, 2024

View reviewed changes

Merge branch 'main' into rylnd/rule_import_improvements

9e52e57

Merge branch 'main' into rylnd/rule_import_improvements

e760dd4

Merge branch 'main' into rylnd/rule_import_improvements

dc3e349

banderror approved these changes Nov 27, 2024

View reviewed changes

rylnd closed this Nov 27, 2024

rylnd deleted the rylnd/rule_import_improvements branch November 27, 2024 17:36

This was referenced Nov 27, 2024

[Security Solution] Benchmark performance of importing a large number of prebuilt rules #195632

Closed

[Security Solution] Users can Customize Prebuilt Detection Rules: Milestone 4 (DRAFT) #179907

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Security Solution] Move calculation of rule source outside of applyRuleUpdate #199720

[Security Solution] Move calculation of rule source outside of applyRuleUpdate #199720

rylnd commented Nov 11, 2024 •

edited by banderror

Loading

rylnd commented Nov 11, 2024

elasticmachine commented Nov 12, 2024

elasticmachine commented Nov 14, 2024

elasticmachine commented Nov 14, 2024

xcrzx Nov 20, 2024

rylnd Nov 22, 2024

banderror commented Nov 26, 2024

banderror commented Nov 27, 2024

elasticmachine commented Nov 27, 2024

banderror left a comment

banderror Nov 27, 2024

rylnd commented Nov 27, 2024

[Security Solution] Move calculation of rule source outside of applyRuleUpdate #199720

[Security Solution] Move calculation of rule source outside of applyRuleUpdate #199720

Conversation

rylnd commented Nov 11, 2024 • edited by banderror Loading

Summary

Context

Effect

For maintainers

rylnd commented Nov 11, 2024

elasticmachine commented Nov 12, 2024

elasticmachine commented Nov 14, 2024

elasticmachine commented Nov 14, 2024

xcrzx Nov 20, 2024

Choose a reason for hiding this comment

rylnd Nov 22, 2024

Choose a reason for hiding this comment

banderror commented Nov 26, 2024

banderror commented Nov 27, 2024

elasticmachine commented Nov 27, 2024

💚 Build Succeeded

Metrics [docs]

History

banderror left a comment

Choose a reason for hiding this comment

banderror Nov 27, 2024

Choose a reason for hiding this comment

rylnd commented Nov 27, 2024

rylnd commented Nov 11, 2024 •

edited by banderror

Loading