Add client launch retries #218

sewerynplazuk · 2024-11-18T13:32:31Z

Add client-side launch retries. This is helpful to overcome instability of IPC based approach in case of running multiple tests in parallel.

acecilia · 2024-11-18T13:50:51Z

Sources/SBTUITestTunnelClient/SBTUITestTunnelClient.m

        }
    }
-
-    NSLog(@"[SBTUITestTunnel] Tunnel ready after %fs", CFAbsoluteTimeGetCurrent() - self.launchStart);


[question] Why remove this? Maybe needs to be moved higher, next to the attemptLaunchcall?

Good catch, I will revert this back once we agree on the solution.

acecilia · 2024-11-18T13:54:37Z

Sources/SBTUITestTunnelClient/include/SBTUITunneledApplication.h

+ *  @param options List of options to be passed on launch.
+ *  Valid options:
+ *  SBTUITunneledApplicationLaunchOptionResetFilesystem: delete app's filesystem sandbox
+ *  SBTUITunneledApplicationLaunchOptionDisableUITextFieldAutocomplete disables UITextField's autocomplete functionality which can lead to unexpected results when typing text.


I would not copy/paste here the options, will become hard to maintain. Maybe you can just say something like see method launchTunnelWithOptions for details about the options

Options are listed in the twin method. I don't want to introduce discrepancies on the interface.

tcamin · 2024-11-18T15:45:26Z

Could you elaborate and give some further context for the change? Is this on CI or locally on developers machines? I took a look at the last 10k test in CI and couldn't find a single case, and we're also running test concurrently (5 sims per node on a dozen of mac minis). I'm kind of unsure whether this changes could introduce other type of instabilities due to the fact that we're retrying only on just one side of the bridge. Did you investigate more in depth the reasons for your launch connection issues?

Partially related, UI tests are inherently unstable as there are many issues that can cause something to break and it is generally advisable to have an additional component that ensures that tests are automatically retried. Just as an example we built Mendoza which automatically retries our tests covering any type of instability, launch or test implementation related.

tcamin · 2024-11-18T15:49:10Z

Did you try disabling IPC and use HTTP tunneling? Does that make any difference? https://github.com/Subito-it/SBTUITestTunnel/blob/master/Documentation/Setup.md#tunneling-mode?

sewerynplazuk · 2024-11-19T12:02:49Z

@tcamin This comes out as a solution for launch exceptions on Failed getting IPC proxy. It happens on CI and is pretty rare (3 cases out of ~3000 tests on a single job).

@try {
	//Send a ping to check if the connection is still alive while waiting.
	[_connection.otherConnection.rootProxy _ping];
} @catch (NSException *exception) {
	if(_errorBlock)
	{
		_errorBlock([NSError errorWithDomain:DTXIPCErrorDomain code:1 userInfo:@{NSLocalizedDescriptionKey: exception.reason}]);
	}
	else
	{
		[exception raise];
	}
	somethingWentWrong = YES;
}

I understand that the ping above fails. Unfortunately I wasn't able to narrow down the issue any further so I anticipate some sort of race condition (perhaps more than a single ping should be performed?)

Currently, there is no way to reliably recover from this error (while a retry helps in our case) other than restarting the test, which has some overhead I'd like to avoid. Using client's delegate to capture the issue is too late.

Did you try disabling IPC and use HTTP tunneling? Does that make any difference?

I saw this issue #127 and switching to HTTP solves it but it is not possible in our case.

tcamin · 2024-11-21T10:26:27Z

I tried to manually replicate the exception path the first time - [_DTXIPCDistantObject forwardInvocation:] is invoked but the retry logic did not seem to work. Moreover when running the Debug build configuration I see the - [SBTUITestTunnelServer takeOffOnceIPCWithServiceIdentifier] fail at synchronousRemoteObjectProxyWithErrorHandler. Is there any way we could write a test that verifies the retry logic?

@tcamin This comes out as a solution for launch exceptions on Failed getting IPC proxy. It happens on CI and is pretty rare (3 cases out of ~3000 tests on a single job).
@try {
	//Send a ping to check if the connection is still alive while waiting.
	[_connection.otherConnection.rootProxy _ping];
} @catch (NSException *exception) {
	if(_errorBlock)
	{
		_errorBlock([NSError errorWithDomain:DTXIPCErrorDomain code:1 userInfo:@{NSLocalizedDescriptionKey: exception.reason}]);
	}
	else
	{
		[exception raise];
	}
	somethingWentWrong = YES;
}
I understand that the ping above fails. Unfortunately I wasn't able to narrow down the issue any further so I anticipate some sort of race condition (perhaps more than a single ping should be performed?)

Currently, there is no way to reliably recover from this error (while a retry helps in our case) other than restarting the test, which has some overhead I'd like to avoid. Using client's delegate to capture the issue is too late.

Did you try disabling IPC and use HTTP tunneling? Does that make any difference?

I saw this issue #127 and switching to HTTP solves it but it is not possible in our case.

sewerynplazuk · 2024-11-21T16:54:57Z

Thanks. I'll have a look into this, as well as into unit testing the retries.

tcamin · 2024-11-21T17:09:51Z

Thanks. I'll have a look into this, as well as into unit testing the retries.

Thanks! It would be great if you could add an integration test, similarly to the other ones that have already been written for the library.

Add client launch retries

729feb5

sewerynplazuk mentioned this pull request Nov 18, 2024

Add client launch retries revolut-mobile/SBTUITestTunnel#1

Merged

acecilia approved these changes Nov 18, 2024

View reviewed changes

DROP

70b2a79

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add client launch retries #218

Add client launch retries #218

sewerynplazuk commented Nov 18, 2024

acecilia Nov 18, 2024

sewerynplazuk Nov 19, 2024

acecilia Nov 18, 2024

sewerynplazuk Nov 19, 2024

tcamin commented Nov 18, 2024 •

edited

Loading

tcamin commented Nov 18, 2024

sewerynplazuk commented Nov 19, 2024 •

edited

Loading

tcamin commented Nov 21, 2024

sewerynplazuk commented Nov 21, 2024

tcamin commented Nov 21, 2024

Add client launch retries #218

Are you sure you want to change the base?

Add client launch retries #218

Conversation

sewerynplazuk commented Nov 18, 2024

acecilia Nov 18, 2024

Choose a reason for hiding this comment

sewerynplazuk Nov 19, 2024

Choose a reason for hiding this comment

acecilia Nov 18, 2024

Choose a reason for hiding this comment

sewerynplazuk Nov 19, 2024

Choose a reason for hiding this comment

tcamin commented Nov 18, 2024 • edited Loading

tcamin commented Nov 18, 2024

sewerynplazuk commented Nov 19, 2024 • edited Loading

tcamin commented Nov 21, 2024

sewerynplazuk commented Nov 21, 2024

tcamin commented Nov 21, 2024

tcamin commented Nov 18, 2024 •

edited

Loading

sewerynplazuk commented Nov 19, 2024 •

edited

Loading