Query deduplication #285

Geal · 2021-12-17T16:30:41Z

request coalescing / query deduplication on subgraph requests #264

This is an experiment around query deduplication. It is done crudely by having a mutex and a wait map in the subgraph fetcher (as is done in the cache), and hashing over a JSON serialization of the request (since Hash is not implemented on serde_json::Value).

In my local tests, using the products and reviews subgraphs from the perf test project, assigning 2 cores to the router, with 100 concurrent clients, I see:

main: 3500rps, both router cores at 100%, subgraphs using each 50% of a core
this PR: 6300rps, both router cores at 80%, subgraphs using each maximum 5% of a core

So even a naïve implementation gets a 1.8x performance boost, with a 10x reduction of CPU time in subgraphs.

Current issues:

this will deduplicate for all queries, even mutations, but we do not want to deduplicate mutations
this is a bit similar to caching, so we'll have to look closely at what we hash for deduplication, once we start looking at things like authorization headers
~~both cores are stuck at 80%, so I suspect some contention over the mutex~~ artefact of the benchmarks I was doing, I can saturate the cores easily now
~~serializing the json value instead of a Hash implementation is a hack~~ fixed with Store JSON strings in a Bytes instance #284

Geal · 2021-12-17T16:45:24Z

in the perf tests ( https://github.com/apollographql/router/pull/285/checks?check_run_id=4562777771 ):

main: 7032rps, subgraphs CPU usage going from 15-20% up to 50%
PR: 12977rps, subgraphs CPU usage under 20%

Geal · 2022-01-07T16:56:58Z

this should get faster once #284 is merged, because serde_json_bytes::Value implements Hash, so we'll get rid of this step:

https://github.com/apollographql/router/pull/285/files#diff-866ee3b4571238b6af1680a2392a6b50068c10efe0cc8d94770a57fd572e2564R99-R101

Geal · 2022-01-14T16:38:51Z

a few local benchmarks. The router has to cores, 300kreqs done by 1000 concurrent clients by "hey":

main `0146576` (using serde_json_bytes)

Summary:
Total: 83.7811 secs
Slowest: 3.8284 secs
Fastest: 0.0055 secs
Average: 0.2674 secs
Requests/sec: 3580.7585

Total data: 3576900000 bytes
Size/request: 11923 bytes

Response time histogram:
0.006 [1] |
0.388 [218905] |■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■
0.770 [47952] |■■■■■■■■■
1.152 [23364] |■■■■
1.535 [7241] |■
1.917 [1878] |
2.299 [511] |
2.682 [123] |
3.064 [22] |
3.446 [1] |
3.828 [2] |

Latency distribution:
10% in 0.0495 secs
25% in 0.0593 secs
50% in 0.0729 secs
75% in 0.5265 secs
90% in 0.7868 secs
95% in 0.9182 secs
99% in 1.5053 secs

`bc14a79` query deduplication without serde_json_bytes

Summary:
Total: 51.6889 secs
Slowest: 0.3400 secs
Fastest: 0.0068 secs
Average: 0.1715 secs
Requests/sec: 5803.9553

Total data: 3576900000 bytes
Size/request: 11923 bytes

Response time histogram:
0.007 [1] |
0.040 [105] |
0.073 [237] |
0.107 [628] |
0.140 [15595] |■■■
0.173 [88979] |■■■■■■■■■■■■■■■■■■■
0.207 [191595] |■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■
0.240 [2526] |■
0.273 [27] |
0.307 [145] |
0.340 [162] |

Latency distribution:
10% in 0.1438 secs
25% in 0.1632 secs
50% in 0.1771 secs
75% in 0.1819 secs
90% in 0.1870 secs
95% in 0.1909 secs
99% in 0.2063 secs

`1102ccf` query deduplication with serde_json_bytes

Summary:
Total: 38.2763 secs
Slowest: 0.2781 secs
Fastest: 0.0059 secs
Average: 0.1270 secs
Requests/sec: 7837.7488

Total data: 3576900000 bytes
Size/request: 11923 bytes

Response time histogram:
0.006 [1] |
0.033 [207] |
0.060 [291] |
0.088 [753] |
0.115 [66459] |■■■■■■■■■■■■
0.142 [217544] |■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■
0.169 [13379] |■■
0.196 [974] |
0.224 [202] |
0.251 [71] |
0.278 [119] |

Latency distribution:
10% in 0.1063 secs
25% in 0.1233 secs
50% in 0.1307 secs
75% in 0.1345 secs
90% in 0.1384 secs
95% in 0.1419 secs
99% in 0.1568 secs

90cd1de query deduplication with serde_json_bytes and using the Request as key in the wait map (couldn't do that before because serde_json::Value does not implement Hash)

Summary:
Total: 37.5145 secs
Slowest: 0.2403 secs
Fastest: 0.0061 secs
Average: 0.1244 secs
Requests/sec: 7996.9130

Total data: 3576900000 bytes
Size/request: 11923 bytes

Response time histogram:
0.006 [1] |
0.030 [159] |
0.053 [153] |
0.076 [429] |
0.100 [6998] |■
0.123 [71569] |■■■■■■■■■■■■■
0.147 [218928] |■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■
0.170 [1631] |
0.193 [95] |
0.217 [29] |
0.240 [8] |

Latency distribution:
10% in 0.1044 secs
25% in 0.1208 secs
50% in 0.1284 secs
75% in 0.1320 secs
90% in 0.1355 secs
95% in 0.1380 secs
99% in 0.1441 secs

cecton

This looks familiar with the caching mechanism Gary made... 🤔 the difference is that we don't store in cache here but other than that we do queue identical. Am I wrong? Maybe with a bit of refactoring it would be possible to get more parts of the code in common.

Geal · 2022-01-19T13:27:45Z

you're right, I copied that code and removed the storage part 😁
Caching will require a bit more thought, especially about how to do invalidation, and where the data is stored. In the meantime, query deduplication is a cheap way to get a great performance boost, that can then evolve to a proper cache once we have a good design

cecton · 2022-01-19T13:36:41Z

Caching will require a bit more though

Ah!! But I didn't mean you should do caching! I meant we could generalize our implementation more so it supports this scenario (0 length cache?) too

cecton

Ok so as I understand it's a temporary boost until we get also caching on this. Looks good!

garypen

Those performance improvement figures are impressive.

garypen · 2022-01-20T13:33:15Z

apollo-router/src/http_subgraph.rs

+    async fn dedup(
+        &self,
+        request: graphql::Request,
+    ) -> Result<graphql::Response, graphql::FetchError> {
+        loop {
+            let mut locked_wait_map = self.wait_map.lock().await;
+            match locked_wait_map.get_mut(&request) {
+                Some(waiter) => {
+                    // Register interest in key
+                    let mut receiver = waiter.subscribe();
+                    drop(locked_wait_map);
+
+                    match receiver.recv().await {
+                        Ok(value) => return value,
+                        // there was an issue with the broadcast channel, retry fetching
+                        Err(_) => continue,
+                    }
+                }
+                None => {
+                    let (tx, _rx) = broadcast::channel(1);
+                    locked_wait_map.insert(request.clone(), tx.clone());
+                    drop(locked_wait_map);
+
+                    let res = self.fetch(request.clone()).await;
+
+                    {
+                        let mut locked_wait_map = self.wait_map.lock().await;
+                        locked_wait_map.remove(&request);
+                    }
+
+                    // Let our waiters know
+                    let broadcast_value = res.clone();
+                    // Our use case is very specific, so we are sure that
+                    // we won't get any errors here.
+                    tokio::task::spawn_blocking(move || {
+                    tx.send(broadcast_value)
+                        .expect("there is always at least one receiver alive, the _rx guard; qed")
+                }).await
+                .expect("can only fail if the task is aborted or if the internal code panics, neither is possible here; qed");
+                    return res;
+                }
+            }


I'm definitely beginning to feel there is some kind of generic functionality which we can extract for common use. I might try and prototype something up...

yup, that could be generalized

Geal · 2022-01-24T13:18:43Z

I'm waiting on apollographql/federation#1423 before I continue on this

not compiling for now, will be reimplemented as a layer

o0Ignition0o · 2022-02-11T11:05:51Z

@Geal can you please ping me when i can review this ? 🙏

Geal · 2022-02-11T11:07:02Z

it's far from ready 😅

o0Ignition0o · 2022-02-11T11:08:15Z

Ah I m going to mute notifications, i dunno why but github keeps asking me to review it 😂

mutations and subscriptions should not be deduplicated

Geal · 2022-02-11T14:32:02Z

apollo-router-core/src/layers/deduplication.rs

+                    let broadcast_value = res
+                        .as_ref()
+                        .map(|response| response.clone())
+                        .map_err(|e| e.to_string());


We use BoxError all over the code (mainly because Buffer enforces BoxError), and BoxError is not Clone. But here we want to copy the response to all of the waiting queries, even if we got an error. Since we cannot access the underlying type, the best we can do is convert to a String and pass it around

Is it ok as a String? Would it be better as a json Value?

we could serialize a JSON value to a string and pass it as error. Unfortunately, the only way to interact with std::error::Error is through strings: https://doc.rust-lang.org/nightly/std/error/trait.Error.html

Could we convert to String and preserve the error as source()? If we did, would it deliver any value?
(I'm just trying to avoid lowest common denominator of error type, but I guess we don't have many options.)

IIRC we discussed (with @BrynCooke I think?) using FetchError as the error type for all subgraph services and layers, and convert to BoxError just before passing it back to the Buffer layer, but deferred that to later as that would be a big change introduced by that PR

Geal · 2022-02-11T14:34:03Z

apollo-router-core/src/services/mod.rs

@@ -92,9 +92,19 @@ pub struct SubgraphRequest {
    pub http_request: http_compat::Request<Request>,

    pub context: Context,
+
+    pub operation_kind: OperationKind,


I'm adding the operation kind here for now, as I am not sure yet if it should be in the Context object: this is data that's only needed for this request and does not need to be shared with other request, and there's no place in the HTTP request to put it

oh that's interesting, we have it deep in the query planner, I use a rough variant of this in the http get related PR but this doesn't fit your usecase

It has to be in the query planner, and from there it's used in subgraph queries. So both use cases are linked

apollo-router-core/src/services/mod.rs

Geal · 2022-02-11T14:35:50Z

apollo-router/src/router_factory.rs

-                name.to_string(),
-                subgraph.routing_url.clone(),
+            let dedup_layer = QueryDeduplicationLayer;
+            let mut subgraph_service = BoxService::new(dedup_layer.layer(


adding the dedup layer here has less impact on the types than making the subgraph a BoxCloneService then adding the layer then boxing it again

garypen

Looks good generally. Some questions and when you have time to look at them, let me know how you want to proceed and I can approve the PR from there.

apollo-router-core/Cargo.toml

garypen · 2022-02-14T15:42:29Z

apollo-router-core/src/layers/deduplication.rs

+                    let broadcast_value = res
+                        .as_ref()
+                        .map(|response| response.clone())
+                        .map_err(|e| e.to_string());


Is it ok as a String? Would it be better as a json Value?

apollo-router-core/src/query_planner/mod.rs

garypen · 2022-02-14T16:24:18Z

apollo-router-core/src/services/http_compat.rs

+        // this assumes headers are in the same order
+        for (name, value) in self.inner.headers() {
+            name.hash(state);
+            value.hash(state);
+        }


I really dislike HeaderMap. We can't even sort it...

I suppose the downside of not spotting duplicates is that our query can't be de-duplicated. Not a bug, but defeats the purpose of de-duplicating.

We can do something a bit crafty here. HeaderValue does implement Ord and HeaderName always convert to &str, so:

// Map Header names into &str so we can sort let mut tmp: Vec<(&str, &HeaderValue)> = self .inner .headers() .iter() .map(|(k, v)| (k.as_str(), v)) .collect(); tmp.sort(); for (name, value) in tmp { name.hash(state); value.hash(state); }

would give us a consistent ordering for hashing purposes. I think we could do the same for Eq as well.

we should refrain from sorting the headers, since their order can have an impact. Example: the Accept header can have multiple values, which can come either as comma separated in one header value, or as multiple separated Accept headers. In that second case, if we sort the headers, that might reorder the values and change the behaviour.

The assumption for the cache key here is that similar queries coming from the same client will have the same shape (same user agent, same list of headers in the same order...).

What we could do though is decide on which headers we consider for the cache. Once we do that, we would get stronger guarantees, and could expose in the docs the issues around ordering. Could we explore that in another PR though?

We are only sorting our copy, the original headers are un-impacted. I agree that we can't touch the original headers, which we couldn't sort anyway since HeaderName doesn't implement Ord.

Does the original order of headers matter for hashing purposes? i.e.: don't we want the ordering to be consistent to help improve our de-duplication chances?

I'm fine with moving this discussion to a follow-up. I think it's important to make the comparison less fragile, but it doesn't need to be decided before merging to the tower branch. If you don't want to put my suggestion in, perhaps preserve it in the follow-up issue?

yes, that's definitely a good idea to revisit it and find a robust solution

apollo-router-core/src/services/mod.rs

Geal · 2022-02-15T09:20:51Z

we will land this after #429

o0Ignition0o · 2022-02-15T09:34:41Z

LGTM, I'd favor serde camelCase directives over "rename" but it's just a nit

Geal · 2022-02-15T10:33:29Z

alright, now that #429 is merged, I could remove the manual deserialization for OperationKind, this should be good to go

)

Igni/merge connectors

Geal requested review from o0Ignition0o and garypen January 14, 2022 16:44

Geal marked this pull request as ready for review January 14, 2022 16:45

cecton reviewed Jan 19, 2022

View reviewed changes

cecton approved these changes Jan 19, 2022

View reviewed changes

garypen approved these changes Jan 20, 2022

View reviewed changes

Geal mentioned this pull request Jan 21, 2022

FetchNode should contain the operation type apollographql/federation#1423

Closed

query deduplication

18c20c5

not compiling for now, will be reimplemented as a layer

Geal force-pushed the query-deduplication branch from 0ff36a3 to 18c20c5 Compare February 11, 2022 10:45

Geoffroy Couprie added 2 commits February 11, 2022 15:16

implement query deduplication as a layer

b98fe16

check the operation kind to deduplicate only for queries

ab5ffb4

mutations and subscriptions should not be deduplicated

Geal changed the base branch from main to geal-all-along-the-tower-service February 11, 2022 14:29

Geal commented Feb 11, 2022

View reviewed changes

apollo-router-core/src/services/mod.rs Outdated Show resolved Hide resolved

Geal commented Feb 11, 2022

View reviewed changes

Geoffroy Couprie added 2 commits February 11, 2022 15:37

fix tests

3e69ce8

Merge branch 'geal-all-along-the-tower-service' into query-deduplication

5eddfbb

Geal requested a review from garypen February 14, 2022 15:09

garypen reviewed Feb 14, 2022

View reviewed changes

Merge branch 'main' into query-deduplication

3d4b52b

Geoffroy Couprie added 3 commits February 14, 2022 18:19

ideserialize directly the OperationKind and store an enum

941c2aa

OperationKind is Copy now

a228e1b

Merge branch 'geal-all-along-the-tower-service' into query-deduplication

493ef13

Geal mentioned this pull request Feb 14, 2022

robust hashing of header list for caches #462

Open

o0Ignition0o approved these changes Feb 15, 2022

View reviewed changes

Merge branch 'geal-all-along-the-tower-service' into query-deduplication

1a96956

Geal changed the base branch from geal-all-along-the-tower-service to main February 15, 2022 14:23

Geoffroy Couprie added 3 commits February 16, 2022 12:53

Merge branch 'main' into query-deduplication

3915141

Merge branch 'main' into query-deduplication

7e97ee3

Merge branch 'main' into query-deduplication

f05ac6a

Geal merged commit c562d84 into main Feb 16, 2022

Geal deleted the query-deduplication branch February 16, 2022 15:07

Geal mentioned this pull request Feb 16, 2022

request coalescing / query deduplication on subgraph requests #264

Closed

abernix mentioned this pull request Feb 18, 2022

release: v0.1.0-alpha.6 #506

Merged

abernix added this to the v0.1.0-alpha.6 milestone Feb 18, 2022

abernix assigned Geal Feb 21, 2022

tinnou pushed a commit to Netflix-Skunkworks/router that referenced this pull request Oct 16, 2023

fix(deps): update all cargo non-major packages >= 1.0 (apollographql#285

888e308

)

lennyburdette pushed a commit that referenced this pull request Dec 20, 2023

Merge pull request #285 from apollographql/igni/merge_connectors

b931486

Igni/merge connectors

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Query deduplication #285

Query deduplication #285

Geal commented Dec 17, 2021 •

edited

Loading

Geal commented Dec 17, 2021

Geal commented Jan 7, 2022

Geal commented Jan 14, 2022

cecton left a comment

Geal commented Jan 19, 2022

cecton commented Jan 19, 2022

cecton left a comment

garypen left a comment

garypen Jan 20, 2022

Geal Jan 24, 2022

Geal commented Jan 24, 2022

o0Ignition0o commented Feb 11, 2022

Geal commented Feb 11, 2022

o0Ignition0o commented Feb 11, 2022

Geal Feb 11, 2022

garypen Feb 14, 2022

Geal Feb 14, 2022

garypen Feb 14, 2022

Geal Feb 14, 2022

Geal Feb 11, 2022

o0Ignition0o Feb 14, 2022

Geal Feb 14, 2022

Geal Feb 11, 2022

garypen left a comment

garypen Feb 14, 2022

garypen Feb 14, 2022

Geal Feb 14, 2022

garypen Feb 14, 2022

Geal Feb 14, 2022

Geal commented Feb 15, 2022

o0Ignition0o commented Feb 15, 2022

Geal commented Feb 15, 2022

Query deduplication #285

Query deduplication #285

Conversation

Geal commented Dec 17, 2021 • edited Loading

Geal commented Dec 17, 2021

Geal commented Jan 7, 2022

Geal commented Jan 14, 2022

main 0146576 (using serde_json_bytes)

bc14a79 query deduplication without serde_json_bytes

1102ccf query deduplication with serde_json_bytes

90cd1de query deduplication with serde_json_bytes and using the Request as key in the wait map (couldn't do that before because serde_json::Value does not implement Hash)

cecton left a comment

Choose a reason for hiding this comment

Geal commented Jan 19, 2022

cecton commented Jan 19, 2022

cecton left a comment

Choose a reason for hiding this comment

garypen left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Geal commented Jan 24, 2022

o0Ignition0o commented Feb 11, 2022

Geal commented Feb 11, 2022

o0Ignition0o commented Feb 11, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

garypen left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Geal commented Feb 15, 2022

o0Ignition0o commented Feb 15, 2022

Geal commented Feb 15, 2022

Geal commented Dec 17, 2021 •

edited

Loading

main `0146576` (using serde_json_bytes)

`bc14a79` query deduplication without serde_json_bytes

`1102ccf` query deduplication with serde_json_bytes