feat: add operation orchestrator #694

milesziemer · 2024-04-09T21:48:32Z

Companion PR: awslabs/aws-sdk-swift#1435

Description of changes

Client libraries

Adds Orchestrator which implements the operation execution pipeline.
It uses the following new "runtime components" SelectAuthScheme, ApplyEndpoint,
ApplySigner, and ExecuteRequest, in addition to the existing RetryStrategy and
Interceptor components. Orchestrator supports executing a full request, in
addition to creating a presigned request - same as OperationStack.

I say "runtime components" because these new interfaces are essentially proxies
for the actual components. For example, SelectAuthScheme is just a convenient
way of using identity and auth within Orchestrator. This makes it easier to
migrate to Interceptors & Orchestrator, because the middlewares that currently
implement these components can just conform to the new interfaces. Again take
as an example SelectAuthScheme - AuthSchemeMiddleware currently does all the
work, so we just adapt it to the SelectAuthScheme interface and we can pass it
directly to Orchestrator. In the future, we may not need these interfaces at
all because there will be no middleware. But for now they help with transition.

All existing middleware (except for RetryMiddleware) have been extended to
either implement one of the new runtime component interfaces, or Interceptor.
RetryMiddleware's implementation was just copied (roughly) into Orchestrator because
it ended up being simpler than trying to abstract it into a pseudo component.
This does have the unfortunate consequence that there are essentially two
implementations of the core retry functionality for the time being. I also
adapted SdkHttpClient to be useable as ExecuteRequest.

OrchestratorBuilder is used in generated operation method bodies to build
the operation Orchestrator with the required runtime components. This is
analagous to how an OperationStack is currently used to tie together the
middleware needed for an operation.

There are a few other new interfaces too, under Message. These are the starting
points for Requests and Responses, and will be a unifying interface for all
'transport' message types. They were added here because Orchestrator actually
needs to use some pieces of the request, like the host for example, and it also
helps with the transition from middleware. Currently, serialization is done in
multiple steps, operation on a request builder. In order to replicate this
in Orchestrator, I've added RequestMessageSerializer which is an interface that
takes an operation input, and a RequestMessageBuilder, and modifies the builder
with whatever serialized data it pulled from the input. Because Orchestrator
needs to store the serializers, we also need a type for the builder, which is
where RequestMessageBuilder comes in. These interfaces are minimal right now,
but will be expanded in the future so we can move more middleware/interceptors
to being protocol-agnostic (right now they're all http specific).
but will be expanded in the future so we can move more middleware/interceptors
to being protocol-agnostic (right now they're all http specific).

A comparison of a current generated operation body with what it will look like
using Orchestrator (I've omitted some details for readability):

public func getFoo(input: GetFooInput) async throws GetFooOutput {
    let context = HttpContextBuilder()
        ...
        .build()
    var operation = OperationStack(id: "getFoo")
    operation.initializeStep.intercept(position: .after, middleware: URLPathMiddleware(...))
    operation.initializeStep.intercept(position: .after, middleware: URLHostMiddleware())
    operation.serializeStep.intercept(position: .after, middleware: QueryItemMiddleware(...))
    operation.serializeStep.intercept(position: .after, middleware: HeaderMiddleware(...))
    operation.buildStep.intercept(position: .before, middleware: EndpointResolverMiddleware(...))
    operation.buildStep.intercept(position: .before, middleware: AuthSchemeMiddleware())
    operation.finalizeStep.intercept(position: .after, middleware: RetryMiddleware(...))
    operation.finalizeStep.intercept(position: .before, middleware: SignerMiddleware())
    operation.deserializeStep.intercept(position: .after, middleware: DeserializeMiddleware(...))
    operation.deserializeStep.intercept(position: .after, middleware: LoggerMiddleware(...))
    return try await operation.handleMiddleware(context: context, input: input, next: client.getHandler())
}

public func getFoo(input: GetFooInput) async throws GetFooOutput {
    let context = HttpContextBuilder()
        ...
        .build()
    let builder = OrchestratorBuilder()
    builder.interceptors.add(URLPathMiddleware(...))
    builder.interceptors.add(URLHostMiddleware())
    builder.serialize(QueryItemMiddleware(...))
    builder.serialize(HeaderMiddleware(...))
    builder.applyEndpoint(EndpointResolverMiddleware(...))
    builder.selectAuthScheme(AuthSchemeMiddleware())
    // These used to be passed to retry middleware
    builder.retryStrategy(...)
    builder.retryErrorInfoProvider(...)
    builder.applySigner(SignerMiddleware())
    builder.deserialize(DeserializeMiddleware(...))
    builder.interceptors.add(LoggerMiddleware(...))
    let op = builder.attributes(context) // context can be changed in the function body
        .executeRequest(client)
        .build()
    return try await op.execute(input: input)
}

Some other changes were made to the existing interceptors interfaces:

Made InterceptorContext a reference type so we don't need to use inout everywhere
Switch InterceptorContext 'output' -> 'result' which is a Result type containing
the operation output. This allows us to capture error responses and other errors
that occur during execution, so we don't immediately throw from wherever the error
occurs.

Codegen

Updates the codegenerator to be able to generate operations that use
Orchestrator and Interceptors instead of OperationStack. This is
configurable using the useInterceptors codegen setting. Because most
middleware was extended to be useable by Orchestrator, codegen only
needed to be able to render middleware a little differently based on
whether useInterceptors was set.

Ultimately we won't need useInterceptors, and we can fully refactor
the codegenerator + libraries, but for now this makes it easier and
safer to migrate.

Some test cases had to be updated to expect generic params where they
were previously inferred (they can't be inferred with interceptors,
but the middleware initializer is the same for both code paths)

Scope

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

syall · 2024-04-30T15:28:25Z

Sources/ClientRuntime/Orchestrator/Orchestrator.swift

+    // TODO: This only throws because getting the partitionId and initial token are fallible.
+    //  That's fine, but where do we want to go if they fail? Do we just skip retries entirely
+    //  and go to an attempt?


I think it should just fail. I'm not sure where partitionId is populated, but failing to acquire the initial retry token is an indication of a buggy retry strategy implementation.

smithy-swift-codegen/src/main/kotlin/software/amazon/smithy/swift/codegen/SwiftSettings.kt

...hy-swift-codegen/src/main/kotlin/software/amazon/smithy/swift/codegen/MiddlewareGenerator.kt

Tests/ClientRuntimeTests/OrchestratorTests/OrchestratorTests.swift

dayaffe · 2024-05-03T17:08:42Z

Sources/ClientRuntime/Interceptor/Interceptors.swift

+/// These methods throw the last error that occurred when executing a given hook, so if multiple
+/// interceptors fail, only the last error is thrown. The rest are logged (TODO)


We have a goal to remove all TODOs in our SDK, if we plan to log the additional errors let's either include that logic or create a tracking ticket for that work and pull it into sprint

Nice catch, forgot about this one.

dayaffe · 2024-05-03T17:25:35Z

Sources/ClientRuntime/Interceptor/Interceptors.swift

+        var error: Error?
+        for i in interceptors {
+            do {
+                try await i.readBeforeExecution(context: context)
+            } catch let e {
+                error = e
+            }
+        }
+        if let error = error {
+            throw error
+        }


Lines 184-194 are repeated in each of these funcs with only one line differing, can we create a higher-order function that takes the relevant function as input? Should reduce code duplication and shorten this file by a lot

Ex. I didn't test this but maybe something like the following (also note potential suggestion on accomplishing logging)

internal func executeInterceptors<T>( // can make the name better context: InterceptorType.InterceptorContextType, operation: @escaping (InterceptorType) async throws -> T // ) async throws { var error: Error? // suggestion: you could change this to a list for i in interceptors { do { try await operation(i) } catch let e { error = e // append to list } } if let error = error { throw error // loop through list of errors and log the errors then throw the last one } }

then would modify these functions like so:

internal func readBeforeExecution(context: InterceptorType.InterceptorContextType) async throws { try await executeInterceptors(context: context) { interceptor in try await interceptor.readBeforeExecution(context: context) } }

dayaffe · 2024-05-03T17:28:20Z

Sources/ClientRuntime/Networking/Http/Middlewares/ContentLengthMiddleware.swift

        case .data(let data):
-            input.headers.update(name: "Content-Length", value: String(data?.count ?? 0))
+            builder.headers.update(name: "Content-Length", value: String(data?.count ?? 0))


nit: could move data?.count ?? 0 but I'll leave that up to you

let contentLength = data?.count ?? 0 builder.headers.update(name: "Content-Length", value: String(contentLength))

dayaffe · 2024-05-03T17:36:10Z

Sources/ClientRuntime/Networking/Http/Middlewares/RequestBody/EnumBodyMiddleware.swift

+
+    public func apply(input: OperationStackInput, builder: SdkHttpRequestBuilder, attributes: HttpContext) throws {
+        let bodyString = input[keyPath: keyPath]?.rawValue ?? ""
+        builder.withBody(.data(Data(bodyString.utf8)))


nit: can use a variable for readability

// Convert to utf8 let bodyData = Data(bodyContent.utf8) builder.withBody(.data(bodyData))

dayaffe · 2024-05-03T17:45:42Z

...amazon/smithy/swift/codegen/integration/middlewares/RequestTestEndpointResolverMiddleware.kt

+        if (ctx.settings.useInterceptors) {
+            writer.write(
+                """
+                builder.applyEndpoint({ (request, scheme, attributes) in
+                    return request.toBuilder()
+                        .withMethod(attributes.getMethod())
+                        .withPath(attributes.getPath())
+                        .withHost("\(attributes.getHostPrefix() ?? "")\(attributes.getHost() ?? "")")
+                        .build()
+                })
+                """.trimMargin()
+            )
+        } else {
+            writer.openBlock(
+                "$operationStackName.${middlewareStep.stringValue()}.intercept(position: ${position.stringValue()}, id: \"${name}\") { (context, input, next) -> \$N<$outputShapeName> in", "}",
+                ClientRuntimeTypes.Middleware.OperationOutput
+            ) {
+                writer.write("input.withMethod(context.getMethod())")
+                writer.write("input.withPath(context.getPath())")
+                writer.write("let host = \"\\(context.getHostPrefix() ?? \"\")\\(context.getHost() ?? \"\")\"")
+                writer.write("input.withHost(host)")
+                writer.write("return try await next.handle(context: context, input: input)")
+            }


Very basic Q: Because I am unfamiliar with middlewares vs interceptors, is interceptors something that wont always be used (hence the ctx.settings.useInterceptors)? In which case, do customizations need to follow this same code logic of if interecptors then x else y for middleware?

In the short term, yes. Ultimately we won't have useInterceptors config setting, and the SDK will only use interceptors. But to make the transition easier, I made it configurable here.

syall · 2024-05-07T01:15:11Z

Sources/ClientRuntime/Interceptor/Interceptors.swift

+        for i in interceptors {
+            do {
+                try await i.readBeforeExecution(context: context)
+            } catch let e {
+                error = e
+            }
+        }
+        if let error = error {
+            throw error
+        }


This is something I didn't catch, but is there a reason why we don't fail on the first error encountered?

I think its a safer contract than failing on the first error encountered, in case you have interceptors dependent on something occurring in a previous interceptor, like setting up some resources. There's still issues, but at least you know your interceptor will run.

Adds Orchestrator which implements the operation execution pipeline. It uses the following new "runtime components" SelectAuthScheme, ApplyEndpoint, ApplySigner, and ExecuteRequest, in addition to the existing RetryStrategy and Interceptor components. Orchestrator supports executing a full request, in addition to creating a presigned request - same as OperationStack. I say "runtime components" because these new interfaces are essentially proxies for the actual components. For example, SelectAuthScheme is just a convenient way of using identity and auth within Orchestrator. This makes it easier to migrate to Interceptors & Orchestrator, because the middlewares that currently implement these components can just conform to the new interfaces. Again take as an example SelectAuthScheme - AuthSchemeMiddleware currently does all the work, so we just adapt it to the SelectAuthScheme interface and we can pass it directly to Orchestrator. In the future, we may not need these interfaces at all because there will be no middleware. But for now they help with transition. All existing middleware (except for RetryMiddleware) have been extended to either implement one of the new runtime component interfaces, or Interceptor. RetryMiddleware's implementation was just copied into Orchestrator because it ended up being simpler than trying to abstract it into a pseudo component. This does have the unfortunate consequence that there are essentially two implementations of the core retry functionality for the time being. I also adapted SdkHttpClient to be useable as ExecuteRequest. OrchestratorBuilder is used in generated operation method bodies to build the operation Orchestrator with the required runtime components. This is analagous to how an OperationStack is currently used to tie together the middleware needed for an operation. There are a few other new interfaces too, under Message. These are the starting points for Requests and Responses, and will be a unifying interface for all 'transport' message types. They were added here because Orchestrator actually needs to use some pieces of the request, like the host for example, and it also helps with the transition from middleware. Currently, serialization is done in multiple steps, operation on a request builder. In order to replicate this in Orchestrator, I've added RequestMessageSerializer which is an interface that takes an operation input, and a RequestMessageBuilder, and modifies the builder with whatever serialized data it pulled from the input. Because Orchestrator needs to store the serializers, we also need a type for the builder, which is where RequestMessageBuilder comes in. These interfaces are minimal right now, but will be expanded in the future so we can move more middleware/interceptors to being protocol-agnostic (right now they're all http specific). but will be expanded in the future so we can move more middleware/interceptors to being protocol-agnostic (right now they're all http specific). A comparison of a current generated operation body with what it will look like using Orchestrator (I've omitted some details for readability): ``` public func getFoo(input: GetFooInput) async throws GetFooOutput { let context = HttpContextBuilder() ... .build() var operation = OperationStack(id: "getFoo") operation.initializeStep.intercept(position: .after, middleware: URLPathMiddleware(...)) operation.initializeStep.intercept(position: .after, middleware: URLHostMiddleware()) operation.serializeStep.intercept(position: .after, middleware: QueryItemMiddleware(...)) operation.serializeStep.intercept(position: .after, middleware: HeaderMiddleware(...)) operation.buildStep.intercept(position: .before, middleware: EndpointResolverMiddleware(...)) operation.buildStep.intercept(position: .before, middleware: AuthSchemeMiddleware()) operation.finalizeStep.intercept(position: .after, middleware: RetryMiddleware(...)) operation.finalizeStep.intercept(position: .before, middleware: SignerMiddleware()) operation.deserializeStep.intercept(position: .after, middleware: DeserializeMiddleware(...)) operation.deserializeStep.intercept(position: .after, middleware: LoggerMiddleware(...)) return try await operation.handleMiddleware(context: context, input: input, next: client.getHandler()) } ``` ``` public func getFoo(input: GetFooInput) async throws GetFooOutput { let context = HttpContextBuilder() ... .build() let builder = OrchestratorBuilder() builder.interceptors.add(URLPathMiddleware(...)) builder.interceptors.add(URLHostMiddleware()) builder.serialize(QueryItemMiddleware(...)) builder.serialize(HeaderMiddleware(...)) builder.applyEndpoint(EndpointResolverMiddleware(...)) builder.selectAuthScheme(AuthSchemeMiddleware()) // These used to be passed to retry middleware builder.retryStrategy(...) builder.retryErrorInfoProvider(...) builder.applySigner(SignerMiddleware()) builder.deserialize(DeserializeMiddleware(...)) builder.interceptors.add(LoggerMiddleware(...)) let op = builder.attributes(context) // context can be changed in the function body .executeRequest(client) .build() return try await op.execute(input: input) } ``` Note: some middleware, like EndpointResolverMiddleware, are codegenerated, which will be handled in a followup commit. Some other changes were made to the existing interceptors interfaces: - Made InterceptorContext a reference type so we don't need to use `inout` everywhere - Switch InterceptorContext 'output' -> 'result' which is a Result type containing the operation output. This allows us to capture error responses and other errors that occur during execution, so we don't immediately throw from wherever the error occurs. One other change was made to the existing interceptors interface InterceptorContext, to make it require a reference type, so we don't need to use `inout` everywhere.

Updates the codegenerator to be able to generate operations that use Orchestrator and Interceptors instead of OperationStack. This is configurable using the `useInterceptors` codegen setting. Because most middleware was extended to be useable by Orchestrator, codegen only needed to be able to render middleware a little differently based on whether `useInterceptors` was set. Ultimately we won't need `useInterceptors`, and we can fully refactor the codegenerator + libraries, but for now this makes it easier and safer to migrate. Some test cases had to be updated to expect generic params where they were previously inferred (they can't be inferred with interceptors, but the middleware initializer is the same for both code paths)

This was referenced Apr 9, 2024

feat: add operation orchestrator #693

Closed

feat: support orchestrator awslabs/aws-sdk-swift#1435

Merged

milesziemer mentioned this pull request Apr 17, 2024

feat: add interceptors interfaces #685

Merged

2 tasks

milesziemer force-pushed the interceptors-impl branch 2 times, most recently from 1193633 to 76eb2a8 Compare April 25, 2024 17:46

milesziemer marked this pull request as ready for review April 25, 2024 21:53

milesziemer force-pushed the interceptors-impl branch from 76eb2a8 to 4375597 Compare April 30, 2024 13:34

milesziemer requested review from dayaffe and jbelkins April 30, 2024 15:30

milesziemer force-pushed the interceptors-impl branch from 4375597 to 1664363 Compare April 30, 2024 15:36

syall approved these changes Apr 30, 2024

View reviewed changes

Tests/ClientRuntimeTests/OrchestratorTests/OrchestratorTests.swift Show resolved Hide resolved

milesziemer force-pushed the interceptors-impl branch 2 times, most recently from 66d995a to 3c81a81 Compare May 1, 2024 15:59

dayaffe requested changes May 3, 2024

View reviewed changes

syall mentioned this pull request May 6, 2024

SRA Observability awslabs/aws-sdk-swift#696

Closed

4 tasks

syall reviewed May 7, 2024

View reviewed changes

milesziemer force-pushed the interceptors-impl branch 2 times, most recently from b30177c to 804529c Compare May 7, 2024 15:50

dayaffe approved these changes May 7, 2024

View reviewed changes

milesziemer added 5 commits May 8, 2024 12:45

resolve merge conflicts

f185839

Update orchestrator for generic input/output in interceptors

b17f276

reduce duplication in Interceptors, log interceptor errors

2c87fe7

milesziemer force-pushed the interceptors-impl branch from 804529c to 2c87fe7 Compare May 8, 2024 16:45

milesziemer merged commit cf52198 into main May 8, 2024
17 checks passed

milesziemer deleted the interceptors-impl branch May 8, 2024 17:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add operation orchestrator #694

feat: add operation orchestrator #694

milesziemer commented Apr 9, 2024 •

edited

Loading

syall Apr 30, 2024

dayaffe May 3, 2024

milesziemer May 7, 2024

milesziemer May 7, 2024

dayaffe May 3, 2024

milesziemer May 7, 2024

dayaffe May 3, 2024

dayaffe May 3, 2024

dayaffe May 3, 2024

milesziemer May 7, 2024

syall May 7, 2024

milesziemer May 7, 2024

		/// These methods throw the last error that occurred when executing a given hook, so if multiple
		/// interceptors fail, only the last error is thrown. The rest are logged (TODO)

feat: add operation orchestrator #694

feat: add operation orchestrator #694

Conversation

milesziemer commented Apr 9, 2024 • edited Loading

Description of changes

Client libraries

Codegen

Scope

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

milesziemer commented Apr 9, 2024 •

edited

Loading