Add query batching capabilities to the schema stitching layer #524

michaelstaib · 2019-01-23T00:09:24Z

Batching

Introduction

The current schema stitching layer send request to the remote queries as they appear. This can be problematic since we will run into the same n+1 issues then with database calls. With this issue we will introduce a new batching layer that will be hidden behind the IRemoteQueryClient.

Rewriting the Query

The IRemoteQueryClient is the way query against a remote schema. Each IRemoteQueryClient instance represents one remote schema. The stitching layer will delegate parts of a query against the stitching layer to a remote schema.

We now want the IRemoteQueryClient to act like a DataLoader and merge requests into one request and batch this one to the remote schema. There are a view things to consider here:

The batch size has to be configurable
This is important since the remote schema might have a max allowed complexity.
We have auto generated remote requests and we have requests written by developers themeselfs.

Let us say we have three requests agains one remote schema. The first and second requests are auto-generated requests by the stitching layer.
The third request is created by a developer.

Request 1:

query foo($global: String, $arg_var: String) @__hc_auto {
  a(a: $global) {
    b(b: $arg_var) {
      c
      ...abc
    }
  }
}

fragment abc on C {
  d
}

Request 2:

query bar($global: String, $arg_var: String) @__hc_auto {
  b: a(a: $global) {
    b(b: $arg_var) {
      c
    }
  }
  c: a(a: $global) {
    b(b: $arg_var) {
      c
    }
  }
}

Request 3:

query baz($a: String $b: String) {
  d(a: $a) {
    e(b: $b) {
      ... def
    }
  }
}

fragment def on E {
  f {
    .. abc
  }
}

fragment abc on F {
  g
}

Request 1 and request 2 are basically branches from the original query whereas the developer request might be something completly different.

Variables from the original request are not rewritten and are merged in the new request so if request 1 and 2 are both using the variable '$global' from the original request than we just have to declare this variable once in the merged request without changing this. Variables that are defined by the user or generated by the stitching engine will be rewritten to have a name prefix that identifies the request from which they stem from.

In order to avoid field collisions and in order to be able to pick the result apart we have to apply field aliases to the root fields. Like with local variables we will combine the request prefix with the response name in the following way: {requestPrefix}_{responseName}.

The response name is the alias name of a field if the alias name is specified; otherwise the response name is the field name.

Lastly, fragment definitions from the original request are not rewritten and are integrated and merges as they are. Fragment definitions from user-defined queries are rewritten to use the request prefix in the way root field aliases are rewritten to accomodate the request prefix.

query merged($global: String $__req_1_arg_var: String $__req_2_arg_var: String $__req_3_a: String $__req_3_b: String) {
  __req_1_a: a(a: $global) {
    b(b: $__req_1_arg_var) {
      c
    }
  }

  __req_2_b: a(a: $global) {
    b(b: $__req_2_arg_var) {
      c
    }
  }

  __req_2_c: a(a: $global) {
    b(b: $__req_2_arg_var) {
      c
    }
  }

  __req_3_d: d(a: $_req_3_a) {
    e(b: $_req_3_b) {
      ... _req_3_def
    }
  }
}

fragment abc on C {
  d
}

fragment _req_3_def on E {
  f {
    .. _req_3_abc
  }
}

fragment _req_3_abc on F {
  g
}

Handling the Response

Errors

Field errors that have the path property defined will be delegated to the response of their request since the first path element will tell us to which request we have to delegate the error.

Errors that do not have the path property defined will be delegated to one of the results so that they are not outputted multiple times.

If the remote schema does only return errors without returning data then we will send exceptions to the result tasks.

Data

The data can be easily divided by using the root response name since we have used request aliases.

Extensions

For now we will ignore any extension data.

The text was updated successfully, but these errors were encountered:

michaelstaib · 2019-01-23T00:09:58Z

#341

michaelstaib · 2019-02-07T15:31:18Z

This one is now implemented and will be included with 0.8.0-preview.1

michaelstaib · 2019-02-07T15:34:33Z

We opted to not mark operations with @__hc_auto since we would have to parse the query that to get this information.

We now are using the request properties and added a property IsAutoGenerated.

We could make the merged queries smaller, but I would lead to a more complex rewriter, so for now we are living with the slightly larger queries and let leave it to the remote schema to optimize these.

Also we might want to deactivate batching or fix the batch size. Or maybe in future we want to have a fixed batch complexity.

michaelstaib added the enhancement label Jan 23, 2019

michaelstaib added this to the 0.7.0 milestone Jan 23, 2019

michaelstaib self-assigned this Jan 23, 2019

michaelstaib mentioned this issue Jan 23, 2019

Schema Stitching Part 1 #341

Closed

michaelstaib modified the milestones: 0.7.0, 0.7.1, 0.8.0 Jan 29, 2019

michaelstaib assigned rstaib Feb 4, 2019

michaelstaib added the design label Feb 4, 2019

michaelstaib mentioned this issue Feb 4, 2019

Auto-Stitching #561

Closed

michaelstaib added in progress and removed design labels Feb 4, 2019

michaelstaib mentioned this issue Feb 4, 2019

Stitching Merge Queries #585

Merged

michaelstaib closed this as completed Feb 7, 2019

michaelstaib removed the in progress label Feb 7, 2019

michaelstaib removed the 🎉 enhancement label Oct 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add query batching capabilities to the schema stitching layer #524

Add query batching capabilities to the schema stitching layer #524

michaelstaib commented Jan 23, 2019 •

edited

Loading

michaelstaib commented Jan 23, 2019

michaelstaib commented Feb 7, 2019

michaelstaib commented Feb 7, 2019

Add query batching capabilities to the schema stitching layer #524

Add query batching capabilities to the schema stitching layer #524

Comments

michaelstaib commented Jan 23, 2019 • edited Loading

Batching

Introduction

Rewriting the Query

Handling the Response

Errors

Data

Extensions

michaelstaib commented Jan 23, 2019

michaelstaib commented Feb 7, 2019

michaelstaib commented Feb 7, 2019

michaelstaib commented Jan 23, 2019 •

edited

Loading