Return transport errors to the caller #399

bruwozniak · 2023-03-16T17:25:34Z

resolves #389

This is just cherry picking relevant commits from https://github.com/tikue/tarpc/tree/gats and adding a test with a mock transport that errors out on sending.

tikue

Awesome, thanks so much! Just a couple of comments :)

tarpc/src/server.rs

tarpc/src/client.rs

tarpc/tests/compile_fail/must_use_request_dispatch.stderr

bruwozniak · 2023-03-21T17:23:05Z

@tikue Rebased and extended the test to all cases (I tried to compress it into one tabular test not to cause bloat). Please take anoter look.

tikue · 2023-03-22T00:06:16Z

tarpc/src/client.rs

-        let (tx, mut rx) = oneshot::channel();
-        let resp = send_request(&mut channel, "hi", tx, &mut rx).await;
-        assert!(dispatch.as_mut().poll(cx).is_pending());
+        for (error, cause) in vec![


Ugh, I'm sorry about this, I just realized I had left my comment about test coverage on the wrong enum. This test, which is testing RpcError::Send, was totally fine as you had it before.

What I meant to say, but failed to, was that RpcError had some other variants added in this PR, and those new variants could use test coverage—are they returned in the proper circumstances? For example, Disconnected was replaced by Shutdown and Receive errors.

The additional test cases I had in mind:

RpcError::Shutdown is returned when the dispatch task has shut down, preventing the client from:

initiating an RPC

completing an RPC

An RPC should be completed with RpcError::Receive if the dispatch task receives an error when attempting to receive a message from the transport. Failing to receive is a permanent error, which shouldn't change, but the dispatch task should still send the more specific Receive error to all pending RPCs (of which there could be a few).

I've added comments to the specific places in the code that correspond to these test cases. I actually think RpcError::Receive is unused in this PR, so you can remove it if you'd like, and I can file an issue to add it back later.

No worries @tikue , it happens. I think now I added what you had in mind. I don't think it makes sense to delete what I already added though? It seems useful for the future and more tests usually don't hurt, WDYT?

That sounds good to me, though I think these are really 5 different test cases (should be different #[test] fns). It's the match expression that leads me to think that — each error variant has different test logic.

I like the test coverage, but would prefer to put the test cases in different fns.

tarpc/src/client.rs

Previously, InFlightRequests required the client response type to be a server response. However, this prevented injection of non-server responses: for example, if the client fails to send a request, it should complete the request with an IO error rather than a server error.

Previously, a client channel would immediately disconnect when encountering an error in Transport::try_send. One kind of error that can occur in try_send is message validation, e.g. validating a message is not larger than a configured frame size. The problem with shutting down the client immediately is that debuggability suffers: it can be hard to understand what caused the client to fail. Also, these errors are not always fatal, as with frame size limits, so complete shutdown was extreme. By bubbling up errors, it's now possible for the caller to programmatically handle them. For example, the error could be walked via anyhow::Error: 2023-01-10T02:49:32.528939Z WARN client: the client failed to send the request Caused by: 0: could not write to the transport 1: frame size too big

tikue

Just 2 minor things and then I think we're good to merge :)

tarpc/src/client/in_flight_requests.rs

tikue · 2023-03-24T06:40:56Z

tarpc/src/client.rs

-        let (tx, mut rx) = oneshot::channel();
-        let resp = send_request(&mut channel, "hi", tx, &mut rx).await;
-        assert!(dispatch.as_mut().poll(cx).is_pending());
+        for (error, cause) in vec![


That sounds good to me, though I think these are really 5 different test cases (should be different #[test] fns). It's the match expression that leads me to think that — each error variant has different test logic.

I like the test coverage, but would prefer to put the test cases in different fns.

tikue

Thanks so much for making this great improvement!

bruwozniak · 2023-03-27T07:39:19Z

@tikue thanks you for very helpful guidance. May I ask for a release with this change so we can include it in our project?

tikue · 2023-03-27T07:46:53Z

I published v0.32.0 immediately after merging this :)

* Make client::InFlightRequests generic over result. Previously, InFlightRequests required the client response type to be a server response. However, this prevented injection of non-server responses: for example, if the client fails to send a request, it should complete the request with an IO error rather than a server error. * Gracefully handle client-side send errors. Previously, a client channel would immediately disconnect when encountering an error in Transport::try_send. One kind of error that can occur in try_send is message validation, e.g. validating a message is not larger than a configured frame size. The problem with shutting down the client immediately is that debuggability suffers: it can be hard to understand what caused the client to fail. Also, these errors are not always fatal, as with frame size limits, so complete shutdown was extreme. By bubbling up errors, it's now possible for the caller to programmatically handle them. For example, the error could be walked via anyhow::Error: ``` 2023-01-10T02:49:32.528939Z WARN client: the client failed to send the request Caused by: 0: could not write to the transport 1: frame size too big ``` * Some follow-up work: right now, read errors will bubble up to all pending RPCs. However, on the write side, only `start_send` bubbles up. `poll_ready`, `poll_flush`, and `poll_close` do not propagate back to pending RPCs. This is probably okay in most circumstances, because fatal write errors likely coincide with fatal read errors, which *do* propagate back to clients. But it might still be worth unifying this logic. --------- Co-authored-by: Tim Kuehn <tikue@google.com>

tikue self-assigned this Mar 16, 2023

tikue self-requested a review March 16, 2023 19:38

tikue requested changes Mar 16, 2023

View reviewed changes

tarpc/src/server.rs Outdated Show resolved Hide resolved

tarpc/src/client.rs Outdated Show resolved Hide resolved

tikue reviewed Mar 17, 2023

View reviewed changes

tarpc/tests/compile_fail/must_use_request_dispatch.stderr Outdated Show resolved Hide resolved

tikue requested changes Mar 22, 2023

View reviewed changes

tikue and others added 6 commits March 23, 2023 12:59

Add test for handling transport errors, unify ChannelError

a30b97f

Test all error cases for the client

99ae67b

Add tests for RpcError::Shudown

46a0ad8

Add tests for RpcError::Receive

02b4c66

tikue requested changes Mar 24, 2023

View reviewed changes

Add comment, split tests

664d135

bruwozniak requested a review from tikue March 24, 2023 13:38

tikue approved these changes Mar 24, 2023

View reviewed changes

tikue merged commit 93f3880 into google:master Mar 24, 2023

bruwozniak deleted the client_errors branch March 26, 2023 08:16

axos88 mentioned this pull request Apr 10, 2023

Bubble up server side transport errors to the client #403

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Return transport errors to the caller #399

Return transport errors to the caller #399

bruwozniak commented Mar 16, 2023

tikue left a comment

bruwozniak commented Mar 21, 2023

tikue Mar 22, 2023

bruwozniak Mar 22, 2023

tikue Mar 24, 2023

tikue left a comment

tikue Mar 24, 2023

tikue left a comment

bruwozniak commented Mar 27, 2023

tikue commented Mar 27, 2023

Return transport errors to the caller #399

Return transport errors to the caller #399

Conversation

bruwozniak commented Mar 16, 2023

tikue left a comment

Choose a reason for hiding this comment

bruwozniak commented Mar 21, 2023

tikue Mar 22, 2023

Choose a reason for hiding this comment

bruwozniak Mar 22, 2023

Choose a reason for hiding this comment

tikue Mar 24, 2023

Choose a reason for hiding this comment

tikue left a comment

Choose a reason for hiding this comment

tikue Mar 24, 2023

Choose a reason for hiding this comment

tikue left a comment

Choose a reason for hiding this comment

bruwozniak commented Mar 27, 2023

tikue commented Mar 27, 2023