Query planner plugins #3177

Geal · 2023-05-30T09:53:05Z

Fixes #3150
Fixes #3199
Related: apollographql/apollo-rs#420

This reintroduces the query planner plugins we had in the past, but with a different API, and for internal use only for now, until we are sure about the API's shape.

Plugin API

The plugins need access to a compiler instance to observe and modify the query, so the caching query planner adds it as part of the query planner request (which means that now the cache and bridge get different request types). For now, if a plugin needs to modify the query, it must serialize the modified version to a string then reparse it, but the goal is to support modification right inside the query: apollographql/apollo-rs#420
That compiler will hold both the schema's type info, and the query.

Once the requests reaches the bridge query planner, it will generate:

a query plan from the filtered query
selections from the filtered query (the Query object used for response formatting)
selections from the original query

Thus execution will depend on the filtered query, so if some fields are added or removed, it wil change the generated query plan. The plan and selections will then be cached in the same way as before.

Details on selections

Query planner plugins can modify the query before sending it through
query planning. They might require different sets of fields on the same
entity, and the query plan might add its own list of fields (keys for
federated queries).
As an example, We could have a User entity with its email field used as
key, and want to remove private information from the query.

So we could have the query:

{
  topProducts {
    name
    reviews {
      author {
        email
        username
      }
    }
  }
}

So here we would filter the query as follows:

{
  topProducts {
    name
    reviews {
      author {
        username
      }
    }
  }
}

But since the email is used as key, it would appear in the JSON object
that accumulates response data.
To avoid sending non requested data to the client, we have the response
formatting phase, that uses selections extracted from the query, to pick
only what is needed from the JSON object, and send it to the client.

Here we need to apply the selections of the filtered query, to make sure
the email is not returned to the client (the original query would let it
go through). But we also need to apply the selections from the original
query, to have the null propagation algorithm apply and return a
response in the shape that matches the client query.

There is probably a better way to do it than applying selections twice,
but here we are sure the behaviour will be correct, and so far the
formatting phase is fast enough, we can spend a bit more time there

Checklist

Complete the checklist (and note appropriate exceptions) before a final PR is raised.

Exceptions

Note any exceptions here

Notes

[^1]. It may be appropriate to bring upcoming changes to the attention of other (impacted) groups. Please endeavour to do this before seeking PR approval. The mechanism for doing this will vary considerably, so use your judgement as to how and when to do this.
[^2]. Configuration is an important part of many changes. Where applicable please try to document configuration examples.
[^3]. Tick whichever testing boxes are applicable. If you are adding Manual Tests:
- please document the manual testing (extensively) in the Exceptions.
- please raise a separate issue to automate the test and label it (or ask for it to be labeled) as manual test

We may need to observe and edit the query between the query plan caching and the query planner, so this brings back the query planner plugins we had initially

router-perf · 2023-05-30T09:53:36Z

CI performance tests

const - Basic stress test that runs with a constant number of users
no-graphos - Basic stress test, no GraphOS.
step - Basic stress test that steps up the number of users over time
reload - Reload test over a long period of time at a constant rate of users
xlarge-request - Stress test with 10Mb request payload
large-request - Stress test with a 1Mb request payload
xxlarge-request - Stress test with 100Mb request payload

Query planner plugins can modify the query before sending it through query planning. They might require different sets of fields on the same entity, and the query plan might add its own list of fields (keys for federated queries). As an example, We could have a User entity with its email field used as key, and want to remove private information from the query. So we could have the query: { topProducts { name reviews { author { email username } } } } So here we would filter the query as follows: { topProducts { name reviews { author { username } } } } But since the email is used as key, it would appear in the JSON object that accumulates response data. To avoid sending unrequested data to the client, we have the response formatting phase, that uses selections extracted from the query, to pick only what is needed from the JSON object, and send it to the client. Here we need to apply the selections of the filtered query, to make sure the email is not returned to the client (the original query would let it go through). But we also need to apply the selections from the original query, to have the null propagation algorithm apply and return a response in the shape that matches the client query. There is probably a better way to do it than applying selections twice, but here we are sure the behaviour will be correct, and so far the formatting phase is fast enough, we can spend a bit more time there

the compiler is not created inside a blocking spawned task so now it appears again in the spans. Salsa's events are filtered in the router, but were not in the test harness, which failed some tests

we want the filtered query to appear under the original query's Studio entry, and it is indexed by the `statsReportKey` field in the usage reporting structure. We call the bridge again with the original query to generate that signature without going through the entire planning process again

SimonSapin · 2023-06-06T12:26:44Z

The plugins need access to a compiler instance to observe and modify the query, so the caching query planner adds it as part of the query planner request (which means that now the cache and bridge get different request types).

This sounds similar to #3200, how do the two interact?

Geal · 2023-06-06T14:56:53Z

This sounds similar to #3200, how do the two interact?

once #3200 is merged, this should be updated to use the compiler coming from the supergraph request

SimonSapin · 2023-06-09T12:49:09Z

apollo-router/src/query_planner/caching_query_planner.rs

+        );
+
+        let mut planner =
+            CachingQueryPlanner::new(delegate, schema, &configuration, IndexMap::new()).await;


This hard-codes the map of query planner plugins to empty, which makes most of this PR dead code. I’d feel better with at least one smoke test showing a query planner plugin in action but we can do that in a follow-up PR.

I agree, adding a test to exercise the API would be useful

SimonSapin · 2023-06-09T13:13:28Z

apollo-router/src/services/query_planner.rs

+/// [`Context`] for the request.
+#[derive(Clone, Derivative)]
+#[derivative(Debug)]
+pub(crate) struct CachingRequest {


This looks identical to struct Request above, why a separate struct?

before merging #3200, the caching request did not have the compiler. But it's better to keep the types separated, because at some point, CachingRequest will have an additional field to change the cache key

apollo-router/src/query_planner/bridge_query_planner.rs

.changesets/feat_geal_query_planner_plugins.md

Co-authored-by: Jeremy Lempereur <jeremy.lempereur@iomentum.com> Co-authored-by: Simon Sapin <simon@apollographql.com>

We expect to use this inside query planner plugins: #3177

.changesets/feat_geal_query_planner_plugins.md

apollo-router/src/query_planner/bridge_query_planner.rs

Co-authored-by: Gary Pennington <gary@apollographql.com>

We expect to use them inside query planner plugins: #3177  **Checklist** Complete the checklist (and note appropriate exceptions) before a final PR is raised. - [x] Changes are compatible[^1] - [x] Documentation[^2] completed - [ ] Performance impact assessed and acceptable - Tests added and passing[^3] - [x] Unit Tests - [ ] Integration Tests - [ ] Manual Tests **Exceptions** *Note any exceptions here* **Notes** [^1]. It may be appropriate to bring upcoming changes to the attention of other (impacted) groups. Please endeavour to do this before seeking PR approval. The mechanism for doing this will vary considerably, so use your judgement as to how and when to do this. [^2]. Configuration is an important part of many changes. Where applicable please try to document configuration examples. [^3]. Tick whichever testing boxes are applicable. If you are adding Manual Tests: - please document the manual testing (extensively) in the Exceptions. - please raise a separate issue to automate the test and label it (or ask for it to be labeled) as `manual test` --------- Co-authored-by: Geoffroy Couprie <apollo@geoffroycouprie.com>

Geal added 3 commits May 30, 2023 11:50

Reintroduce query planner plugins

b4ba8fc

We may need to observe and edit the query between the query plan caching and the query planner, so this brings back the query planner plugins we had initially

create a compiler and pass it around

7db1c4e

plan the filtered query

eaf5ce0

apollo-bot2 assigned Geal May 30, 2023

This comment has been minimized.

Sign in to view

Geal added 2 commits May 30, 2023 14:47

lint

e6f4f4d

Geal changed the title ~~Geal/query planner plugins~~ query planner plugins May 30, 2023

Geal changed the title ~~query planner plugins~~ Query planner plugins May 30, 2023

changeset

5914b8d

Geal marked this pull request as ready for review May 30, 2023 15:33

fix salsa's log level in the test harness

093e5c0

the compiler is not created inside a blocking spawned task so now it appears again in the spans. Salsa's events are filtered in the router, but were not in the test harness, which failed some tests

Geal requested review from a team, garypen, SimonSapin, BrynCooke and o0Ignition0o and removed request for BrynCooke May 30, 2023 16:19

Geal added the component/query-planning label May 30, 2023

Geal added 6 commits May 31, 2023 10:11

filter salsa in tracing tests

69819d1

Merge branch 'dev' into geal/query-planner-plugins

7ea1da6

fix salsa filtering again

a54583e

Merge branch 'dev' into geal/query-planner-plugins

d6a8045

Merge branch 'dev' into geal/query-planner-plugins

59a216e

Geal and others added 2 commits June 7, 2023 16:13

Merge branch 'dev' into geal/query-planner-plugins

8a797eb

Fix rustc/clippy errors

8ea0393

SimonSapin reviewed Jun 9, 2023

View reviewed changes

o0Ignition0o approved these changes Jun 13, 2023

View reviewed changes

.changesets/feat_geal_query_planner_plugins.md Outdated Show resolved Hide resolved

Geal and others added 3 commits June 13, 2023 12:00

Apply suggestions from code review

f08d9dd

Co-authored-by: Jeremy Lempereur <jeremy.lempereur@iomentum.com> Co-authored-by: Simon Sapin <simon@apollographql.com>

fix mutex usage

3ee3355

rename query to original_query

60dc155

SimonSapin approved these changes Jun 13, 2023

View reviewed changes

SimonSapin added a commit that referenced this pull request Jun 13, 2023

Add visitor traits for transforming or traversing queries

b96c0a9

We expect to use this inside query planner plugins: #3177

SimonSapin mentioned this pull request Jun 13, 2023

Add visitor traits for transforming or traversing queries #3252

Merged

6 tasks

Merge branch 'dev' into geal/query-planner-plugins

ed35752

garypen reviewed Jun 13, 2023

View reviewed changes

.changesets/feat_geal_query_planner_plugins.md Outdated Show resolved Hide resolved

apollo-router/src/query_planner/bridge_query_planner.rs Show resolved Hide resolved

Geal and others added 2 commits June 14, 2023 09:47

Update .changesets/feat_geal_query_planner_plugins.md

313846f

Co-authored-by: Gary Pennington <gary@apollographql.com>

Merge branch 'dev' into geal/query-planner-plugins

1d27607

Geal enabled auto-merge (squash) June 14, 2023 08:11

Geal merged commit ae50056 into dev Jun 14, 2023

Geal deleted the geal/query-planner-plugins branch June 14, 2023 09:33

Geal mentioned this pull request Jun 15, 2023

generate an operation signature from the original query, not the filtered query #3199

Closed

o0Ignition0o mentioned this pull request Jun 20, 2023

prep release: v1.21.0 #3280

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Query planner plugins #3177

Query planner plugins #3177

Geal commented May 30, 2023 •

edited

Loading

This comment has been minimized.

router-perf bot commented May 30, 2023

SimonSapin commented Jun 6, 2023

Geal commented Jun 6, 2023 •

edited by SimonSapin

Loading

SimonSapin Jun 9, 2023

Geal Jun 9, 2023

SimonSapin Jun 9, 2023

Geal Jun 9, 2023

Query planner plugins #3177

Query planner plugins #3177

Conversation

Geal commented May 30, 2023 • edited Loading

Plugin API

Details on selections

This comment has been minimized.

router-perf bot commented May 30, 2023

SimonSapin commented Jun 6, 2023

Geal commented Jun 6, 2023 • edited by SimonSapin Loading

SimonSapin Jun 9, 2023

Choose a reason for hiding this comment

Geal Jun 9, 2023

Choose a reason for hiding this comment

SimonSapin Jun 9, 2023

Choose a reason for hiding this comment

Geal Jun 9, 2023

Choose a reason for hiding this comment

Geal commented May 30, 2023 •

edited

Loading

Geal commented Jun 6, 2023 •

edited by SimonSapin

Loading